Evaluating the Efficiency and Effectiveness of Learned Sparse Retrieval with the lsr_benchmark

This abstract has open access

Abstract Summary

Different learned sparse retrieval (LSR) models offer different trade-offs between effectiveness and efficiency. However, while there are standardized and interoperable tools to assess LSR effectiveness, there is no agreed-upon methodology for evaluating efficiency, and datasets with high-quality relevance judgments are too large for repeated efficiency experiments, e.g., across different hardware. To promote the evaluation of LSR~models for effectiveness and efficiency, we introduce the \lsrBenchmark, which measures retrieval effectiveness and efficiency of each step in an LSR~pipeline (document embedding, indexing, query embedding, and retrieval). To ensure tractability and extensibility, we apply current corpus subsampling methods to eleven TREC tasks, precompute embeddings with eleven LSR~models per task, and provide eight retrieval systems as baselines. For the benchmark's hosted version, a modular~API and tools for evaluating effectiveness and efficiency makes submitting new approaches easy. Our experiments show that the chosen embedding model significantly affects the efficiency of a retrieval system and that LSR is more effective but less efficient than BM25---an efficiency gap our benchmark helps to track as new LSR models are published.

Abstract ID :

NKDR130

Submission Type

Resource

Submission Topics

Associated Sessions

Resource III: Evaluation Tooling For Retrieval And RecSys

Author
Co-Authors

PhD Student

,

Friedrich-Schiller-Universität Jena

Ferdinand Schlatt

Friedrich-Schiller-Universit?t Jena

Cosimo Rulli

Researcher

,

ISTI-CNR

Tim Hagen

Research Assistant and PhD Student

,

University Of Kassel

Mr. Jan Heinrich Merker

Friedrich-Schiller-Universität Jena

Gijs Hendriksen

PhD Candidate

,

Radboud University

Carlos Lassance

Cohere

Franco Maria Nardini

Research Director

,

ISTI-CNR

Rossano Venturini

University Of Pisa

Martin Potthast

University Of Kassel, Hessian.AI, And ScaDS.AI

Abstracts With Same Type

Abstract ID

Abstract Title

Abstract Topic

Submission Type

Primary Author

NKDR132

An Open SERP Mining Infrastructure for the Archive Query Log

Resource

Mr. Jan Heinrich Merker

NKDR140

Beyond the Click: A Framework for Inferring Cognitive Traces in Search

User aspects in IR

Resource

Saber Zerhoudi

NKDR136

BioGraphletQA: Knowledge-Anchored Generation of ComplexQuestion Answering Datasets

Resource

Richard Jonker

NKDR129

CitiLink-Minutes: A Multilayer Annotated Dataset ofMunicipal Meeting Minutes

Machine Learning and Large Language Models Societally-motivated IR research

Resource

Ricardo Campos

NKDR131

ClaimPT: A Portuguese Dataset of Annotated Claims in News Articles

Machine Learning and Large Language Models Societally-motivated IR research

Resource

Ricardo Campos

NKDR93

CoRECT: A Framework for Evaluating Embedding CompressionTechniques at Scale

Evaluation research Machine Learning and Large Language Models Search and ranking

Resource

Laura Caspari

NKDR119

FaE: A Resource of Logs, Profiles, and Rankings for Academic Expert Finding

Resource

Marjan Azimi

NKDR125

FoodNexus: Massive Food Knowledge for Recommender Systems

Evaluation research Recommender systems

Resource

Ludovico Boratto

NKDR120

GREAT: Group Recommender Evaluation and Analysis Tool

Resource

Ariel Smith

View All Abstracts

1 visits