› ›

使用 Ingest Pipeline 解析数据

当您使用 Elasticsearch 作为输出时，您可以配置 Heartbeat 使用 Ingest Pipeline 来预处理文档，然后再在 Elasticsearch 中进行实际索引。当您想对数据进行一些额外的处理，但不需要 Logstash 的全部功能时，Ingest Pipeline 是一个方便的处理选项。例如，您可以在 Elasticsearch 中创建一个 Ingest Pipeline，它包含一个处理器用于删除文档中的字段，然后是另一个处理器用于重命名字段。

在 Elasticsearch 中定义 pipeline 后，您只需配置 Heartbeat 来使用该 pipeline。要配置 Heartbeat，您需要在 heartbeat.yml 文件中 elasticsearch 下的 parameters 选项中指定 pipeline ID。

output.elasticsearch:
  hosts: ["localhost:9200"]
  pipeline: my_pipeline_id

例如，假设您在一个名为 pipeline.json 的文件中定义了以下 pipeline：

{
    "description": "Test pipeline",
    "processors": [
        {
            "lowercase": {
                "field": "agent.name"
            }
        }
    ]
}

要在 Elasticsearch 中添加 pipeline，您需要运行：

curl -H 'Content-Type: application/json' -XPUT 'https://127.0.0.1:9200/_ingest/pipeline/test-pipeline' [email protected]

然后在 heartbeat.yml 文件中，您需要指定：

output.elasticsearch:
  hosts: ["localhost:9200"]
  pipeline: "test-pipeline"

运行 Heartbeat 时，agent.name 的值在索引之前会转换为小写。

有关定义预处理 pipeline 的更多信息，请参阅 Ingest Pipeline 文档。

« 在配置中使用环境变量避免 YAML 格式问题 »