OpenSearchCon 2024 Session: RAGElo: An Elo Rating-based Evaluation Toolkit for RAG · OpenSearch

OpenSearch Core

OpenSearch Dashboards

OpenSearch Data Prepper

Performance Benchmarks

Search

Observability

Security Analytics

Machine Learning and AI

Forum

Providers

Events

Projects

Members

Documentation Library

OpenSearch and Dashboards

OpenSearch Benchmark

OpenSearch Data Prepper

Clients

Visit the OpenSearch Blog

Platform

OpenSearch

OpenSearch is a powerful search and analytics engine built on Apache Lucene.

OpenSearch Dashboards

Our data visualization toolset is a flexible, fully integrated solution for visually exploring and querying your data.

OpenSearch Data Prepper

A server-side data collector designed to enrich, transform, and aggregate data for downstream analytics with OpenSearch.

Capabilities

Machine Learning and AI

Vector Database

Anomaly Detection

Search

E-Commerce

Document Search

Observability

Performance Monitoring

Log Analysis

Security Analytics

Threat Intelligence

Event Correlation

Performance Benchmarks

View key performance metrics across different workloads

View Data

Platform Drawer Icon

OpenSearch Blog

Most Recent Articles
Generative AI: OpenSearch's journey as an open-source search engine	Mar 26
OpenSearch as a SIEM Solution	Mar 20
GPU-accelerated vector search in OpenSearch: A new frontier	Mar 18
Solution Provider Highlight - Enhancing anomaly detection in Amazon OpenSearc...	Mar 07
Tracking the evolution of OpenSearch performance	Mar 06
Efficient large-scale filtering with bitmap filtering in OpenSearch	Feb 25
Reduce costs with disk-based vector search	Feb 19
From chaos to clarity: Revolutionizing OpenSearch clients and documentation u...	Feb 13
Introducing reciprocal rank fusion for hybrid search	Feb 12
Explore OpenSearch 2.19	Feb 11

Featured Post

Blog Drawer Icon

OpenSearch Community

Forum

Find answers to your questions, help others in the community, and join the conversation.

Solution Providers

Find open-source providers offering solutions and services.

Events

Community Meetings, Development Backlog & Triage, in-person, and virtual events.

Projects

Highlights of projects built by the community.

Members

Community member profiles.

User Groups

Join the OpenSearch Project Meetup Network

Community Resources

Slack

Speak with other developers in the OpenSearch community in our public Slack.

Github Project Organization

Join us for in-person and virtual events to learn the latest about the project.

OpenSearchCon

Europe: Apr 2025

OpenSearchCons in 2024

North America (San Francisco): September 24-26

India (Bengaluru): June 26

Europe (Berlin): May 6-7

Community Drawer Icon

Documentation Library

OpenSearch and Dashboards

Build your OpenSearch solution using core tooling and visualizations.

OpenSearch Benchmark

Measure performance metrics for your OpenSearch cluster.

OpenSearch Data Prepper

Filter, mutate, and sample your data for ingestion into OpenSearch.

Clients

Interact with OpenSearch from your application using language APIs.

OpenSearch Project Roadmap

OpenSearch Project Roadmap

Read our blog on 2024-2025 project development plan and beyond!

Documentation Drawer Icon

Search Drawer Icon

Retrieval Augmented Generation (RAG) has become the workhorse of Large Language Models (LLMs) for Question Answering and Chat grounded in private data sets. On the R side, search engines provide many different retrieval strategies for finding relevant information; vector search, BM25, hybrid search, re-ranking, etc. On the G side, prompt engineering is more like an art than a science; small variations in the prompt can lead to wildly different results. When combined with agent-style generation, where the LLM is in charge of deciding the query, search filters, and retrieval strategy based on the user intent, the number of possible solution variations becomes astronomical. On top of all of this, standard evaluation techniques of comparing to “gold standard” answers are not always feasible, as the answer might not be known or might be too expensive to obtain. This is where RAGElo comes in. RAGElo creates an Elo ranking system for the different RAG solutions. Here, powerful LLMs employ reasoning techniques to evaluate pairs of answers alongside a set of questions, taking into account the information retrieved by the search engine.

Details

Tuesday, May 7 10:55am-11:35am in Asgabat

Track: Community

Speakers

Fernando Rejon Barrera photograph

Fernando Rejon Barrera

CTO at Zeta Alpha

Jakub Zavrel photograph

Jakub Zavrel

View All Sessions

View All Speakers