My point is that LLMs, by virtue of how they work, cannot properly evaluate novel research.
Edit, consider the following hypothetical:
A couple of biologists travel to a remote location and discover a frog with an unusual method of attracting prey. This frog secretes its own blood onto leaves, and then captures the flies that land on the blood.
This is quite plausible from a perspective of the many, many, ways evolution drives predator-prey relations, but (to my knowledge) has not been shown before.
The biologists may have extensive documentation of this observation, but there is simply no way that an LLM would be able to evaluate this documentation.