Link Search Menu Expand Document Documentation Menu

You're viewing version 2.17 of the OpenSearch documentation. This version is no longer maintained. For the latest version, see the current documentation. For information about OpenSearch version maintenance, see Release Schedule and Maintenance Policy.

Analyzers

The following sections list all analyzers that OpenSearch supports.

Built-in analyzers

The following table lists the built-in analyzers that OpenSearch provides. The last column of the table contains the result of applying the analyzer to the string It’s fun to contribute a brand-new PR or 2 to OpenSearch!.

Analyzer Analysis performed Analyzer output
Standard (default) - Parses strings into tokens at word boundaries
- Removes most punctuation
- Converts tokens to lowercase
[it’s, fun, to, contribute, a,brand, new, pr, or, 2, to, opensearch]
Simple - Parses strings into tokens on any non-letter character
- Removes non-letter characters
- Converts tokens to lowercase
[it, s, fun, to, contribute, a,brand, new, pr, or, to, opensearch]
Whitespace - Parses strings into tokens on white space [It’s, fun, to, contribute, a,brand-new, PR, or, 2, to, OpenSearch!]
Stop - Parses strings into tokens on any non-letter character
- Removes non-letter characters
- Removes stop words
- Converts tokens to lowercase
[s, fun, contribute, brand, new, pr, opensearch]
Keyword (no-op) - Outputs the entire string unchanged [It’s fun to contribute a brand-new PR or 2 to OpenSearch!]
Pattern - Parses strings into tokens using regular expressions
- Supports converting strings to lowercase
- Supports removing stop words
[it, s, fun, to, contribute, a,brand, new, pr, or, 2, to, opensearch]
Language Performs analysis specific to a certain language (for example, english). [fun, contribut, brand, new, pr, 2, opensearch]
Fingerprint - Parses strings on any non-letter character
- Normalizes characters by converting them to ASCII
- Converts tokens to lowercase
- Sorts, deduplicates, and concatenates tokens into a single token
- Supports removing stop words
[2 a brand contribute fun it's new opensearch or pr to]
Note that the apostrophe was converted to its ASCII counterpart.

Language analyzers

OpenSearch supports analyzers for various languages. For more information, see Language analyzers.

350 characters left

Have a question? .

Want to contribute? or .