Creating Specialized RAG-Based Search Engines Using the Open Web Index

This abstract has open access
Abstract Summary
This paper presents a concept and supporting technology for building RAG-based specialized search engines using open-source frameworks and open web data. The Open Web Index (OWI) provides openly accessible web data, while the modular MOSAIC framework is designed to integrate topical OWI partitions obtained to create search applications tailored to specific use cases. MOSAIC-RAG extends this framework with features based on Large Language Models (LLM), such as summarization or re-ranking. Using this infrastructure, special-purpose and domain-specific search applications can easily be developed and experimented with. For demonstration purposes, we present three example applications in the topical domains of science, health, and arts.
Abstract ID :
NKDR169
Submission Type
Submission Topics

Associated Sessions

Post-doctoral Researcher
,
Graz University Of Technology
University Of Passau
Leipzig University
PhD Candidate
,
Radboud University
Graz University of Technology
Postdoctoral researcher
,
University Of Passau
University Of Kassel, Hessian.AI, And ScaDS.AI
University of Passau

Abstracts With Same Type

Abstract ID
Abstract Title
Abstract Topic
Submission Type
Primary Author
NKDR143
Applications Machine Learning and Large Language Models Recommender systems Search and ranking
Demos
Trung Vo
NKDR166
Applications Machine Learning and Large Language Models Search and ranking Societally-motivated IR research
Demos
Rodrigo Silva
NKDR168
Demos
Rishiraj Saha Roy
NKDR156
Applications Machine Learning and Large Language Models Search and ranking System aspects
Demos
Quang Hieu Vu
NKDR159
Applications Machine Learning and Large Language Models Search and ranking
Demos
Rodrigo Duarte
NKDR160
Applications Conversational search and recommender systems Societally-motivated IR research
Demos
Markos Dimitsas
NKDR27
Evaluation researchMachine Learning and Large Language ModelsRecommender systems
Demos
Lukas Wegmeth
1 visits