Accepted Papers

This heading text can be changed from Forms > User instructions

Evaluating Large Language Models as Domain-SpecificRetrieval Agents: A Study ...

Large Language Models are increasingly used as retrieval and reasoning agents in specialized domains. This study evaluates their performance on cybersecurity Capture-the-Flag challenges, reframed as structured retrieval tasks where models must infer information from textual and code-based evidence. ...

IR evaluationLarge Language ModelsSystem aspects
Short papers

Omed Abed

Display #