Less LLM, More Documents: Searching for Improved RAG

This abstract has open access

Abstract Summary

Retrieval-Augmented Generation (RAG) couples document retrieval with large language models (LLMs). While scaling generators improves accuracy, it also raises cost and limits deployability. We explore an orthogonal axis: enlarging the retriever¡¯s corpus to reduce reliance on large LLMs. Experimental results show that corpus scaling consistently strengthens RAG and can often serve as a substitute for increasing model size, though with diminishing returns at larger scales. Small- and mid-sized generators paired with larger corpora often rival much larger models with smaller corpora; mid-sized models tend to gain the most, while tiny and large models benefit less. Our analysis shows that improvements arise primarily from increased coverage of answer-bearing passages, while utilization efficiency remains largely unchanged. These findings establish a principled corpus¨Cgenerator trade-off: investing in larger corpora offers an effective path to stronger RAG, often comparable to enlarging the LLM itself.

Abstract ID :

NKDR57

Submission Type

Submission Topics

Search and ranking

Associated Sessions

RAG: Retrieval Utility, Scaling & Infrastructure

Author
Co-Authors

Carnegie Mellon University

Yibo Kong

Carnegie Mellon University

Yunfan Long

Student

,

Carnegie Mellon University

Abstracts With Same Type

Abstract ID

Abstract Title

Abstract Topic

Submission Type

Primary Author

NKDR52

An Empirical Study of Model Casing in Learned Sparse Retrieval

Search and ranking

Full papers

Emmanouil Georgios Lionis

NKDR58

Breaking Flat: A Generalised Query Performance PredictionEvaluation Framework

Full papers

Ms. PAYEL SANTRA

NKDR51

Bribery-Resistant Ranking Systems: A Multipartite User-Agnostic Framework for AI Act Compliance

Search and rankingSocietally-motivated IR research

Full papers

Martim Baltazar

NKDR15

Contradictions in Context: Challenges forRetrieval-Augmented Generation in Healthcare

ApplicationsMachine Learning and Large Language Models

Full papers

Saeedeh Javadi

NKDR49

Cross-Sensory Brain Passage Retrieval: Scaling Beyond Visual to Audio

Societally-motivated IR researchUser aspects in IR

Full papers

Niall McGuire

NKDR177

Event-aware Video Corpus Moment Retrieval

ApplicationsSearch and ranking

Full papers

Danyang Hou

NKDR184

Event-aware Video Corpus Moment Retrieval

ApplicationsEvaluation research

Full papers

Danyang Hou

NKDR193

Event-aware Video Corpus Moment Retrieval

ApplicationsSearch and ranking

Full papers

Danyang Hou

NKDR39

ExpertMix: Aspect and Severity Detection in ConversationalComplaints

ApplicationsMachine Learning and Large Language Models

Full papers

Sarmistha Das

View All Abstracts

90 visits