Most conventional RAG pipelines rely on relevance-based retrieval, which often misaligns with utility --- that is, whether the retrieved passages actually improve generation quality. The limitations of existing utility-driven retrieval approaches for RAG are that, firstly, they are resource-intensiv...
Machine Learning and Large Language ModelsFull papers