Skip to main content
PaperQA2 Language Model Released September 19

PaperQA2 Language Model Released September 19

September 2024

Of Interest to the Information Community

More Details Here

From the Paper's Abstract

Language models are known to “hallucinate” incorrect information, and it is unclear if they are sufficiently accurate and reliable for use in scientific research. We developed a rigorous human-AI comparison methodology to evaluate language model agents on real-world literature search tasks covering information retrieval, summarization, and contradiction detection tasks. We show that PaperQA2, a frontier language model agent optimized for improved factuality, matches or exceeds subject matter expert performance on three realistic literature research tasks without any restrictions on humans (i.e., full access to internet, search tools, and time).

Full Text