OpenSearchCon 2024 North America Session: Luna — LLM-powered Unstructured Analytics (aka “RAG-supercharged”)

Platform

Capabilities

Community

Documentation

Most Recent Articles
Generative AI: OpenSearch's journey as an open-source search engine	Mar 26
OpenSearch as a SIEM Solution	Mar 20
GPU-accelerated vector search in OpenSearch: A new frontier	Mar 18
Solution Provider Highlight - Enhancing anomaly detection in Amazon OpenSearc...	Mar 07
Tracking the evolution of OpenSearch performance	Mar 06
Efficient large-scale filtering with bitmap filtering in OpenSearch	Feb 25
Reduce costs with disk-based vector search	Feb 19
From chaos to clarity: Revolutionizing OpenSearch clients and documentation u...	Feb 13
Introducing reciprocal rank fusion for hybrid search	Feb 12
Explore OpenSearch 2.19	Feb 11

RAG is all the rage. In enterprise settings such financial services, healthcare, pharma, and more, users want accurate and explainable answers from LLMs. Unfortunately, LLM hallucinate, and thus can be a liability. To address this, developers are using retrieval-augmented generation (RAG) — an approach where LLMs synthesize answers from external data to limit hallucinations. We argue that pure search-based RAG approaches are insufficient and that we need a more general approach inspired by the tenets that made relational databases successful. We call this Luna for LLM-powered unstructured analytics. With Luna, users ask questions in free-form natural language, and the system uses LLMs to automatically generate and execute a query plan. The query plan can include analytics functions, hybrid search queries, LLM-based data processing, and more. In our case, the plan uses OpenSearch and LLMs to compute answers from complex, unstructured documents such as PDFs, HTML, and presentations. In this talk, we outline customers use cases and how Luna handles three styles of questions, “hunt-and-peck”, “sweep-and-harvest”, and “data integration,” that arise from these use cases. We describe the Luna planner, which uses LLMs to automatically construct query plans and translate them into search pipelines in OpenSearch. We describe how we extend OpenSearch search pipelines to support external Python-native operators that easily integrate with LLMs and their libraries. We show that combining the Luna planner with OpenSearch and its extended set of search and analytics functionality, we can achieve higher accuracy and answer a richer spectrum of questions than existing RAG approaches.