UserSimCRS v2: Simulation-Based Evaluation for Conversational Recommender Systems

This abstract has open access

Abstract Summary

Resources for simulation-based evaluation of conversational recommender systems (CRSs) are scarce. The UserSimCRS toolkit was introduced to address this gap. In this work, we present UserSimCRS v2, a significant upgrade aligning the toolkit with state-of-the-art research. Key extensions include an enhanced agenda-based user simulator, introduction of large language model-based simulators, integration for a wider range of CRSs and datasets, and new LLM-as-a-judge evaluation utilities. We demonstrate these extensions in a case study.

Abstract ID :

NKDR128

Submission Type

Resource

Submission Topics

Associated Sessions

Resource I: Interactive And Conversational Search

Author
Co-Authors

Nolwenn Bernard

TH Köln

Krisztian Balog

Professor

,

University Of Stavanger

Abstracts With Same Type

Abstract ID

Abstract Title

Abstract Topic

Submission Type

Primary Author

NKDR132

An Open SERP Mining Infrastructure for the Archive Query Log

Resource

Mr. Jan Heinrich Merker

NKDR140

Beyond the Click: A Framework for Inferring Cognitive Traces in Search

User aspects in IR

Resource

Saber Zerhoudi

NKDR136

BioGraphletQA: Knowledge-Anchored Generation of ComplexQuestion Answering Datasets

Resource

Richard Jonker

NKDR129

CitiLink-Minutes: A Multilayer Annotated Dataset ofMunicipal Meeting Minutes

Machine Learning and Large Language Models Societally-motivated IR research

Resource

Ricardo Campos

NKDR131

ClaimPT: A Portuguese Dataset of Annotated Claims in News Articles

Machine Learning and Large Language Models Societally-motivated IR research

Resource

Ricardo Campos

NKDR93

CoRECT: A Framework for Evaluating Embedding CompressionTechniques at Scale

Evaluation research Machine Learning and Large Language Models Search and ranking

Resource

Laura Caspari

NKDR130

Evaluating the Efficiency and Effectiveness of Learned Sparse Retrieval with the lsr_benchmark

Resource

Maik Fröbe

NKDR119

FaE: A Resource of Logs, Profiles, and Rankings for Academic Expert Finding

Resource

Marjan Azimi

NKDR125

FoodNexus: Massive Food Knowledge for Recommender Systems

Evaluation research Recommender systems

Resource

Ludovico Boratto

View All Abstracts

126 visits