当前位置:   article > 正文

elasticsearch之pipeline_elasticsearch pipeline

elasticsearch pipeline

一,测试pipeline,按逗号拆分字符串为数组,然后对数组的每个项去空格。

POST _ingest/pipeline/_simulate
{
  "pipeline" :
  {
    "description": "_description",
    "processors": [
      {
        "set" : {
          "field" : "field2",
          "value" : "_value"
        }
      },
      {
        "split": {
          "field": "words",
          "separator": "," 
        }
      },
      {
        "foreach" : {
          "field" : "words",
          "processor" : {
            "trim": {
              "field" : "_ingest._value"
            }
          }
        }
      }
    ]
  },
  "docs": [
    {
      "_index": "index",
      "_id": "id",
      "_source": {
        "foo": " bar ",
        "words":"hello , world , hello2  hadoop "
      }
    },
    {
      "_index": "index",
      "_id": "id",
      "_source": {
        "foo": "rab",
        "words":"hello, world , hello2 , flink "
      }
    }
  ]
}
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49

二,设置pipeline

PUT _ingest/pipeline/split_and_trim
{
  "description" : "describe pipeline",
  "processors": [
      {
        "set" : {
          "field" : "field2",
          "value" : "_value"
        }
      },
      {
        "split": {
          "field": "words",
          "separator": "," 
        }
      },
      {
        "foreach" : {
          "field" : "words",
          "processor" : {
            "trim": {
              "field" : "_ingest._value"
            }
          }
        }
      }
    ]
}
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28

三,创建index时设置pipeline

PUT twitter
{
    "settings" : {
        "index" : {
            "number_of_shards" : 3, 
            "number_of_replicas" : 2 ,
            "default_pipeline": "split_and_trim"
        }
    }
}
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10

四,更新时指定pipeline

POST twitter2/_update_by_query?pipeline=split_and_trim

五,索引文档时指定pipeline

PUT twitter2/_doc/2?pipeline=split_and_trim
{
  "words":"hadoop222222, good , flink , spark "
}
  • 1
  • 2
  • 3
  • 4
声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/我家自动化/article/detail/887736
推荐阅读
相关标签
  

闽ICP备14008679号