Accepted Papers

This heading text can be changed from Forms > User instructions

Towards Quantitative Summarization Evaluation: AnIntegrated Atomic-Based Eval...

Despite the dramatic advances in Large Language Models (LLMs), traditional summarization benchmarks are critically saturated, failing to differentiate state-of-the-art models or reflect real-world user needs. Existing evaluation datasets suffer from fundamental constraints in instructional diversity...

Applications Evaluation research
Full papers

Yan Lei

Display #