Update by query
Introduced 1.0
You can include a query and a script as part of your update request so OpenSearch can run the script to update all of the documents that match the query.
Endpoints
POST <index1>, <index2>/_update_by_query
Path parameters
Parameter | Type | Description |
---|---|---|
<index> | String | Comma-separated list of indexes to update. To update all indexes, use * or omit this parameter. |
Query parameters
All parameters are optional.
Parameter | Type | Description |
---|---|---|
allow_no_indices | Boolean | Whether to ignore wildcards that don’t match any indexes. Default is true . |
analyzer | String | Analyzer to use in the query string. |
analyze_wildcard | Boolean | Whether the update operation should include wildcard and prefix queries in the analysis. Default is false . |
conflicts | String | Indicates to OpenSearch what should happen if the update by query operation runs into a version conflict. Valid options are abort and proceed . Default is abort . |
default_operator | String | Indicates whether the default operator for a string query should be AND or OR . Default is OR . |
df | String | The default field if a field prefix is not provided in the query string. |
expand_wildcards | String | Specifies the type of index that wildcard expressions can match. Supports comma-separated values. Valid values are all (match any index), open (match open, non-hidden indexes), closed (match closed, non-hidden indexes), hidden (match hidden indexes), and none (deny wildcard expressions). Default is open . |
from | Integer | The starting index to search from. Default is 0. |
ignore_unavailable | Boolean | Whether to exclude missing or closed indexes in the response and ignores unavailable shards during the search request. Default is false . |
lenient | Boolean | Specifies whether OpenSearch should accept requests if queries have format errors (for example, querying a text field for an integer). Default is false . |
max_docs | Integer | How many documents the update by query operation should process at most. Default is all documents. |
pipeline | String | ID of the pipeline to use to process documents. |
preference | String | Specifies which shard or node OpenSearch should perform the update by query operation on. |
q | String | Lucene query string’s query. |
request_cache | Boolean | Specifies whether OpenSearch should use the request cache. Default is whether it’s enabled in the index’s settings. |
refresh | Boolean | If true, OpenSearch refreshes shards to make the update by query operation available to search results. Valid options are true and false . Default is false . |
requests_per_second | Integer | Specifies the request’s throttling in sub-requests per second. Default is -1, which means no throttling. |
routing | String | Value used to route the update by query operation to a specific shard. |
scroll | Time | How long to keep the search context open. |
scroll_size | Integer | Size of the operation’s scroll request. Default is 1000. |
search_type | String | Whether OpenSearch should use global term and document frequencies calculating relevance scores. Valid choices are query_then_fetch and dfs_query_then_fetch . query_then_fetch scores documents using local term and document frequencies for the shard. It’s usually faster but less accurate. dfs_query_then_fetch scores documents using global term and document frequencies across all shards. It’s usually slower but more accurate. Default is query_then_fetch . |
search_timeout | Time | How long to wait until OpenSearch deems the request timed out. Default is no timeout. |
slices | String or integer | The number slices to split an operation into for faster processing, specified by integer. When set to auto OpenSearch it should decides how many the number of slices for the operation. Default is 1 , which indicates an operation will not be split. |
sort | List | A comma-separated list of <field> : <direction> pairs to sort by. |
_source | String | Whether to include the _source field in the response. |
_source_excludes | String | A comma-separated list of source fields to exclude from the response. |
_source_includes | String | A comma-separated list of source fields to include in the response. |
stats | String | Value to associate with the request for additional logging. |
terminate_after | Integer | The maximum number of documents OpenSearch should process before terminating the request. |
timeout | Time | How long the operation should wait from a response from active shards. Default is 1m . |
version | Boolean | Whether to include the document version as a match. |
wait_for_active_shards | String | The number of shards that must be active before OpenSearch executes the operation. Valid values are all or any integer up to the total number of shards in the index. Default is 1, which is the primary shard. |
wait_for_completion | boolean | When set to false , the response body includes a task ID and OpenSearch executes the operation asynchronously. The task ID can be used to check the status of the task or to cancel the task. Default is set to true . |
Request body options
To update your indexes and documents by query, you must include a query and a script in the request body that OpenSearch can run to update your documents. If you don’t specify a query, then every document in the index gets updated.
{
"query": {
"term": {
"oldValue": 20
}
},
"script" : {
"source": "ctx._source.oldValue += params.newValue",
"lang": "painless",
"params" : {
"newValue" : 10
}
}
}
Example requests
POST test-index1/_update_by_query
{
"query": {
"term": {
"oldValue": 10
}
},
"script" : {
"source": "ctx._source.oldValue += params.newValue",
"lang": "painless",
"params" : {
"newValue" : 20
}
}
}
Example response
{
"took": 21,
"timed_out": false,
"total": 1,
"updated": 1,
"deleted": 0,
"batches": 1,
"version_conflicts": 0,
"noops": 0,
"retries": {
"bulk": 0,
"search": 0
},
"throttled_millis": 0,
"requests_per_second": -1.0,
"throttled_until_millis": 0,
"failures": []
}
Response body fields
Field | Description |
---|---|
took | The amount of time in milliseconds OpenSearch needed to complete the operation. |
timed_out | Whether any update requests during the operation timed out. |
total | Total number of documents processed. |
updated | Total number of documents updated. |
batches | Number of scroll responses the request processed. |
version_conflicts | Number of conflicts the request ran into. |
noops | How many update requests OpenSearch ignored during the operation. This field always returns 0. |
retries | The number of bulk and search retry requests. |
throttled_millis | Number of throttled milliseconds during the request. |
requests_per_second | Number of requests executed per second during the operation. |
throttled_until_millis | The amount of time until OpenSearch executes the next throttled request. Always equal to 0 in an update by query request. |
failures | Any failures that occur during the request. |