Context Engineering for Agentic Data Science

This abstract has open access
Abstract Summary
We demonstrate CEDAR, an application for automating data science (DS) tasks with an agentic setup. Solving DS problems with LLMs is an underexplored area that has immense market value. The challenges are manifold: task complexities, data sizes, computational limitations, and context restrictions. We show that these can be alleviated via effective context engineering. We first impose structure into the initial prompt with DS-specific input fields, that serve as instructions for the agentic system. The solution is then materialized as an enumerated sequence of interleaved plan and code blocks generated by separate LLM agents, providing a readable structure to the context at any step of the workflow. Function calls for generating these intermediate texts, and for corresponding Python code, ensure that data stays local, and only aggregate statistics and associated instructions are injected into LLM prompts. Fault tolerance and context management are introduced via iterative code generation and smart history rendering. The viability of our agentic data scientist is demonstrated using canonical Kaggle challenges.
Abstract ID :
NKDR168
Submission Type
Submission Topics

Associated Sessions

Senior Scientist
,
Fraunhofer IIS

Abstracts With Same Type

Abstract ID
Abstract Title
Abstract Topic
Submission Type
Primary Author
NKDR143
Applications Machine Learning and Large Language Models Recommender systems Search and ranking
Demos
Trung Vo
NKDR166
Applications Machine Learning and Large Language Models Search and ranking Societally-motivated IR research
Demos
Rodrigo Silva
NKDR156
Applications Machine Learning and Large Language Models Search and ranking System aspects
Demos
Quang Hieu Vu
NKDR159
Applications Machine Learning and Large Language Models Search and ranking
Demos
Rodrigo Duarte
NKDR160
Applications Conversational search and recommender systems Societally-motivated IR research
Demos
Markos Dimitsas
NKDR27
Evaluation researchMachine Learning and Large Language ModelsRecommender systems
Demos
Lukas Wegmeth
2 visits