Bringing intelligence to OpenSearch: Introducing the OpenSearch agent server

Real-world OpenSearch deployments serve diverse users: developers querying logs, analysts exploring metrics, engineers optimizing search, and business users seeking insights. A single generalist assistant trying to handle these different tasks sacrifices depth in each area. What if you could match the right specialist assistant to each task?

The OpenSearch agent server, released as experimental in OpenSearch 3.6, is a multi-agent orchestration platform that enables you to build specialized AI agents that work together within OpenSearch. The agent server platform provides infrastructure for creating focused agents—each with distinct expertise and tools—that collaborate through an intelligent routing layer.

A default agent serves as a general assistant for broad queries, while specialized agents handle specific domains such as search relevance tuning. The first specialized agent available is the Automated Relevance Tuning (ART) agent. For more information, see Introducing OpenSearch Relevance Agent.

In this post, we’ll describe the agent server architecture, its key features, and how to start building your own agents.

TL;DR

The OpenSearch agent server is a multi-agent orchestration platform (experimental in OpenSearch 3.6) that routes queries to specialized AI agents.

Routes tasks to the best-fit specialist agent
Built on MCP server, orchestration, and AG-UI streaming
First specialist: Automated Relevance Tuning (ART) agent
Includes Bedrock LLM integration and OBO token security
Install: pip install opensearch-agent-server

Architecture and core concepts

The agent server is built on three foundational components: a standalone Model Context Protocol (MCP) server, a multi-agent orchestration layer, and the Agent-User Interface (AG-UI) protocol for real-time streaming, as shown in the following image.

The OpenSearch MCP server

The foundational layer is a standalone OpenSearch MCP server. This server connects to your OpenSearch cluster and exposes search, aggregation, and index management operations as reusable tools accessible to all agents.

Multi-agent orchestration

The orchestration layer routes incoming requests to the right agent based on context and intent. Agents register their capabilities at startup, and the router matches requests to the most appropriate specialist. If no specialized agent matches the request, a default agent handles general OpenSearch queries. This context-aware routing ensures that users always get a response from the agent best equipped to help.

AG-UI protocol

The AG-UI protocol handles real-time streaming responses between OpenSearch Dashboards and agents. This enables a responsive conversational experience in which users receive results continuously rather than waiting for complete responses.

Because all agents share the same MCP server, you don’t need to reimplement OpenSearch operations for each new agent.

Agent server capabilities

The agent server includes the following built-in capabilities for production use.

Flexible LLM integration

The agent server supports large language model (LLM) integration through Amazon Bedrock, allowing you to use powerful foundation models for agent reasoning. Additional model providers will be supported in the future.

Security with on-behalf-of token passing

Security is handled through on-behalf-of (OBO) token passing from OpenSearch Dashboards. When enabled, the agent server receives the authenticated user’s identity through OBO tokens, ensuring that all OpenSearch operations enforce user-level permissions rather than running under a service account. This preserves proper access controls throughout the request chain.

Production-ready resilience

The platform includes built-in retry logic with exponential backoff for resilient LLM and OpenSearch interactions, plus structured observability logging to track agent behavior and diagnose issues in production.

MCP

MCP provides the standardized interface between agents and OpenSearch. It exposes cluster operations as composable tools that agents can orchestrate without reimplementing low-level functionality. New agents can immediately use the full capabilities of OpenSearch through a well-defined, secure abstraction layer.

Getting started

To get started with the agent server, follow these steps.

Prerequisites

Before running the server, install the following tools and configure Amazon Bedrock credentials:

Java 21+
Node.js 20.x
Python 3.12+
uv
Amazon Bedrock credentials for LLM inference
OpenSearch 3.6+ (the cluster the MCP server connects to)
OpenSearch Dashboards 3.6+ (required for the chat UI and AG-UI integration)

Copy the environment template and add your Bedrock settings:

cp .env.example .env

Add the following to your .env file:

AWS_ACCESS_KEY_ID=your_access_key
AWS_SECRET_ACCESS_KEY=your_secret_key
AWS_REGION=us-east-1
BEDROCK_INFERENCE_PROFILE_ARN=arn:aws:bedrock:...

The rest of the defaults in .env.example are preconfigured for local development.

Running the quickstart

The quickstart script configures the full development stack using one command. It sets up OpenSearch with streaming plugins, OpenSearch Dashboards, the MCP server, the agent server, and adds sample data:

./scripts/quickstart.sh

Installing from PyPI

If you already have an OpenSearch cluster running and don’t need the full quickstart setup, you can install and run the agent server directly from PyPI:

pip install opensearch-agent-server

Configure your environment:

export OPENSEARCH_URL=https://localhost:9200
export OPENSEARCH_USERNAME=admin
export OPENSEARCH_PASSWORD=admin
export AG_UI_AUTH_ENABLED=false

Start the agent server and MCP server together in a single process:

opensearch-agent-server --with-mcp

This command starts both the OpenSearch MCP server (port 3001) and the agent server (port 8001). To stop both servers, press Ctrl/Cmd+C. You can verify that both services are running by running the following commands:

curl http://localhost:8001/health    # {"status": "ok"}
curl http://localhost:8001/agents    # list registered agents

For more options, including customizing the MCP server port and configuration, see the OpenSearch agent server README.

Configure OpenSearch Dashboards

To make the chat assistant available in OpenSearch Dashboards, add the following to your OpenSearch-Dashboards/config/opensearch_dashboards.yml:

# Enable new UI header (required for chat button to appear)
uiSettings:
  overrides:
    "home:useNewHomePage": true

# Send page context (app ID, filters, queries) to the agent
contextProvider:
  enabled: true

# Connect Dashboards to the Agent Server
chat:
  enabled: true
  agUiUrl: "http://localhost:8001/runs"

For a complete example, including OBO token forwarding for authenticated MCP tool calls and other optional settings, see opensearch_dashboards.example.yml.

Start (or restart) OpenSearch Dashboards to apply the config:

./bin/opensearch-dashboards

Once Dashboards is running, the chat icon appears in the top-right header.

Your first interaction

Once the script completes, go to OpenSearch Dashboards at http://localhost:5601 and open the chat interface in the upper-right corner. Then try the agents:

Try the default agent: Ask a question such as “How is my cluster health?” The default agent will use the MCP server to query your cluster and stream back results in real time.
Try the ART agent: When you navigate to the Search Relevance page, the same chat interface automatically routes questions to the ART agent. Try asking “What are my most popular queries?” You’ll notice different tools being used for search relevance purposes to answer this question.

What’s next

We’re actively working on the following enhancements that will enable entirely new classes of agents and use cases:

Agentic memory: Enabling agents to maintain context across conversations and learn from past interactions.
Multimodal support: Enabling agents that work with images, documents, and other rich data types beyond text.

Get involved

The OpenSearch agent server is an open-source project, and we welcome community contributions. You can explore the code in the OpenSearch agent server repository, review open issues for areas where you can contribute, and share your ideas for new agents or platform improvements. Whether you’re building a custom agent for your use case, enhancing the orchestration layer, or adding new capabilities, your contributions help shape the future of intelligent interactions in OpenSearch.

Resources

For more information, see the following resources:

AI Summary

The OpenSearch agent server, released as experimental in OpenSearch 3.6, is a multi-agent orchestration platform that enables you to build specialized AI agents that work together within OpenSearch.

It sets up OpenSearch with streaming plugins, OpenSearch Dashboards, the MCP server, the agent server, and adds sample data:.

If you already have an OpenSearch cluster running and don't need the full quickstart setup, you can install and run the agent server directly from PyPI: Configure your environment: Start the agent server and MCP server together in a single process: This command starts both the OpenSearch MCP server (port 3001) and the agent server (port 8001).

Authors

Mingshi Liu

Mingshi Liu is a software development engineer at AWS working on OpenSearch.

View all posts
Jiaping Zeng

Jiaping Zeng is a Software Engineer at AWS working on the OpenSearch Project. He primarily works on OpenSearch's AI/ML and semantic search features.

View all posts
Nate Po Hong Lau

Nate Po Hong Lau is a software development engineer at AWS working on OpenSearch.

View all posts

Bringing intelligence to OpenSearch: Introducing the OpenSearch agent server

TL;DR

Architecture and core concepts

The OpenSearch MCP server

Multi-agent orchestration

AG-UI protocol

Agent server capabilities

Flexible LLM integration

Security with on-behalf-of token passing

Production-ready resilience

MCP

Getting started

Prerequisites

Running the quickstart

Installing from PyPI

Configure OpenSearch Dashboards

Your first interaction

What’s next

Get involved

Resources

Authors

OpenSearch is a community-driven, Apache 2.0-licensed open source search and analytics suite that makes it easy to ingest, search, visualize, and analyze data.

Participate

Providers

Resources

Bringing intelligence to OpenSearch: Introducing the OpenSearch agent server

TL;DR

Architecture and core concepts

The OpenSearch MCP server

Multi-agent orchestration

AG-UI protocol

Agent server capabilities

Flexible LLM integration

Security with on-behalf-of token passing

Production-ready resilience

MCP

Getting started

Prerequisites

Running the quickstart

Installing from PyPI

Configure OpenSearch Dashboards

Your first interaction

What’s next

Get involved

Resources

Share or Summarize with AI

Authors

Participate

Providers

Resources