Introducing Root Cause Analysis with Open Distro for Elasticsearch
If you’re interested in the operational behavior of your Elasticsearch cluster, then root cause analysis can help you identify fundamental issues that affect availability and performance of the cluster. Root cause analysis (RCA) is a problem solving technique used to examine symptoms of problems you’re interested in solving and to work backwards from those symptoms to the causes of the problems.
We are building a root cause analysis engine for Open Distro for Elasticsearch. This smart engine along with Performance Analyzer will help users improve availability and performance of their Elasticsearch clusters. While we design the root cause analysis engine, you can weigh in with your feedback on the design proposal. We’d love it if you add your comments and use cases so that Open Distro for Elasticsearch will support your needs too!
Open Distro for Elasticsearch comes with a Performance Analyzer plugin that helps compute and expose diagnostic metrics for Elasticsearch clusters. This useful tool enables Elasticsearch users to measure and understand bottlenecks in their clusters. Open Distro for Elasticsearch also bundles a light weight client - PerfTop. You can learn more about debugging with PerfTop which provides real time visualization of Performance Analyzer’s diagnostic metrics.
The root cause analysis framework extends on the Performance Analyzer architecture by building a data flow graph that computes root causes.
We anticipate starting development on RCA framework in the next couple of weeks. We invite the community to collaborate with us on building this framework, and in making Elasticsearch manageability more seamless.
With your interest and feedback, the root cause analysis framework will become a valuable tool for everyone using Open Distro for Elasticsearch.
Look forward to your feedback!
Alolita Sharma, Partha Kanuparthy, Balaji Kannan