You really just need to augment with tight prompting and know how to extract information and links to peer reviewed literature and well sourced information. Once again, technology here is a lever. Separating wheat from chaff has been key in academic and information pursuits forever and it is becoming ever more important.
And you're gonna review that voluminous academic literature in a field you're not familiar with, right?...
That is my point: an LLM can be great if you know the field and can spot errors. Or, to a lesser extent, if you have some automatic feedback loop that the model can't easily game ("does this code pass unit tests?"). It's a lot less great if there's a risk that you won't detect the early drift.