elasticsearch学习笔记高级篇（十五）——实战搜索推荐

newbornzhao

2019-11-03

准备数据

PUT _bulk
{"index": {"_index": "test_index", "_id": "1"}}
{"test_field": "hello world"}
{"index": {"_index": "test_index", "_id": "2"}}
{"test_field": "hello win"}
{"index": {"_index": "test_index", "_id": "3"}}
{"test_field": "hello dog"}
{"index": {"_index": "test_index", "_id": "4"}}
{"test_field": "hello cat"}

搜索推荐：match_phrase_prefix

match_phrase_prefix原理跟match_phrase类似，唯一的区别就是把最后一个term作为前缀去搜索。属于search time

以搜索hello w为例。

hello就会去进行match搜索，搜索对应的文档，而w会作为前缀去扫描整个倒排索引，找到所有w开头的文档，然后，找到所有文档中，既包含hello,又包含w开头的字符的文档。
最后在这些文档中根据你的slop去计算，看在slop的范围内能不能让hello和w正好跟文档中的hello和w开头的单词的position匹配。
搜索代码如下：

GET /test_index/_search
{
  "query": {
    "match_phrase_prefix": {
      "test_field": "hello w"
    }
  }
}

输出结果：

{
  "took" : 40,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 2.5133061,
    "hits" : [
      {
        "_index" : "test_index",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 2.5133061,
        "_source" : {
          "test_field" : "hello world"
        }
      },
      {
        "_index" : "test_index",
        "_type" : "_doc",
        "_id" : "2",
        "_score" : 2.5133061,
        "_source" : {
          "test_field" : "hello win"
        }
      }
    ]
  }
}

搜索推荐：ngram

ngram做搜索推荐与前缀匹配一个很大的区别是ngram是属于index time的，在index的时候就将此进行拆分，比如world就会拆分成w、o、r、l、d。但是搜索的本质与match_phrase_prefix是一样的。

以搜索hello w为例。

建立索引：

PUT test_index
{
  "settings": {
    "analysis": {
      "filter": {
        "autocomplete_filter": { 
            "type":     "edge_ngram",
            "min_gram": 1,
            "max_gram": 20
        }
      },
      "analyzer": {
        "autocomplete": {
            "type":      "custom",
            "tokenizer": "standard",
            "filter": [
                "lowercase",
                "autocomplete_filter" 
            ]
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "test_field": {
          "type":     "text",
          "analyzer": "autocomplete",
          "search_analyzer": "standard"
      }
    }
  }
}

插入数据

PUT _bulk
{"index": {"_index": "test_index", "_id": "1"}}
{"test_field": "hello world"}
{"index": {"_index": "test_index", "_id": "2"}}
{"test_field": "hello win"}
{"index": {"_index": "test_index", "_id": "3"}}
{"test_field": "hello dog"}
{"index": {"_index": "test_index", "_id": "4"}}
{"test_field": "hello cat"}

查询

GET /test_index/_search
{
  "query": {
    "match_phrase": {
      "test_field": "hello w"
    }
  }
}

输出：

{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.1620307,
    "hits" : [
      {
        "_index" : "test_index",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 1.1620307,
        "_source" : {
          "test_field" : "hello world"
        }
      },
      {
        "_index" : "test_index",
        "_type" : "_doc",
        "_id" : "2",
        "_score" : 1.1620307,
        "_source" : {
          "test_field" : "hello win"
        }
      }
    ]
  }
}

elasticsearch test

安科网

elasticsearch学习笔记高级篇（十五）——实战搜索推荐

newbornzhao

准备数据

搜索推荐：match_phrase_prefix

以搜索hello w为例。

搜索推荐：ngram

以搜索hello w为例。

建立索引：

插入数据

查询

newbornzhao

相关推荐

Elasticsearch py客户端库安装及使用方法解析

ElasticSearch最全详细使用教程

十张图说清Elasticsearch原理！

ElasticSearch 交互使用

django 对接elasticsearch实现全文检索

Spring Boot 集成 Elasticsearch 实战

如何对 ElasticSearch 集群进行压力测试

操作ElasticSearch插件和可视化工具 Kibana

Elasticsearch实战 | match_phrase搜不出来，怎么办？

Elasticsearch聚合后分页深入详解

Elasticsearch大文件检索性能提升20倍实践（干货）

重磅 | 死磕Elasticsearch方法论认知清单（国庆更新版）

Elasticsearch实战 | 必要的时候，还得空间换时间!

Elasticsearch索引增量统计及定时邮件实现

如何在Linux下安装部署分布式全文搜索引擎

ElasticSearch的下载、安装使用

我也是才知道ElasticSearch条件更新是这么玩的

读写成功率达99.999%，提升ElasticSearch系统稳定性的秘密

es快照备份到minio

Elasticsearch是一把梭，用起来再说？！

newbornzhao