Blog

Introducing Root Cause Analysis with Open Distro for Elasticsearch

By , , October 30, 2019June 18th, 2025No Comments

This post was imported from the Open Distro For Elasticsearch blog, a predecessor project of OpenSearch. Information reflected in this post may not be current or accurate.

If you’re interested in the operational behavior of your Elasticsearch cluster, then root cause analysis can help you identify fundamental issues that affect availability and performance of the cluster. Root cause analysis (RCA) is a problem solving technique used to examine symptoms of problems you’re interested in solving and to work backwards from those symptoms to the causes of the problems.

We are building a root cause analysis engine for Open Distro for Elasticsearch. This smart engine along with Performance Analyzer will help users improve availability and performance of their Elasticsearch clusters. While we design the root cause analysis engine, you can weigh in with your feedback on the design proposal. We’d love it if you add your comments and use cases so that Open Distro for Elasticsearch will support your needs too!

Open Distro for Elasticsearch comes with a Performance Analyzer plugin that helps compute and expose diagnostic metrics for Elasticsearch clusters. This useful tool enables Elasticsearch users to measure and understand bottlenecks in their clusters. Open Distro for Elasticsearch also bundles a light weight client – PerfTop. You can learn more about debugging with PerfTop which provides real time visualization of Performance Analyzer’s diagnostic metrics.

The root cause analysis framework extends on the Performance Analyzer architecture by building a data flow graph that computes root causes.

If you have any questions, please feel free to reach out to us. You can tag us on GitHub (Alolita @alolita, Partha @aesgithub, Balaji @sendkb) with your questions.

We anticipate starting development on RCA framework in the next couple of weeks. We invite the community to collaborate with us on building this framework, and in making Elasticsearch manageability more seamless.

With your interest and feedback, the root cause analysis framework will become a valuable tool for everyone using Open Distro for Elasticsearch.

Look forward to your feedback!

Alolita Sharma, Partha Kanuparthy, Balaji Kannan

Authors

  • Alolita Sharma is a Principal Technologist at AWS where she leads open source observability engagement and development of OpenTelemetry, Prometheus components focused on metrics and log based observability. Alolita also serves as a member of the OpenTelemetry Governance Committee and as a board director of the Unicode Consortium. She contributes to open standards on the Unicode Technical Committee and W3C. She has served on the boards of the OSI and SFLC.in. Alolita has built and led engineering teams at Wikipedia, Twitter, PayPal and IBM. Two decades of doing open source continue to inspire her.

    View all posts
  • Partha Kanuparthy