ElasticSearch 中文分词插件安装
1. ik
现在不支持bin/plugin -install medcl/elasticsearch-analysis-ik/1.1.3
在elasticsearch/plugins目录下建analysis-ik目录
然后直接下载安装包
https://github.com/medcl/elasticsearch-rtf/tree/master/elasticsearch/plugins/analysis-ik
- cd plugins
- mkdir analysis-ik
- cd analysis-ik
- wget https://github.com/medcl/elasticsearch-rtf/blob/master/elasticsearch/plugins/analysis-ik/elasticsearch-analysis-ik-1.1.4.jar?raw=true --no-check-certificate
词库还是一样
- cd config
- wget http://github.com/downloads/medcl/elasticsearch-analysis-ik/ik.zip --no-check-certificate
- unzip ik.zip
- rm ik.zip
2. mmseg
直接下载安装包安装
- cd plugins
- mkdir analysis-mmseg
- cd analysis-mmseg
- wget https://github.com/medcl/elasticsearch-rtf/blob/master/elasticsearch/plugins/analysis-mmseg/elasticsearch-analysis-mmseg-1.1.2.jar?raw=true --no-check-certificate
词库
- cd config
- mkdir mmseg
- wget https://github.com/medcl/elasticsearch-rtf/raw/master/elasticsearch/config/mmseg/chars.dic --no-check-certificate
- wget https://github.com/medcl/elasticsearch-rtf/raw/master/elasticsearch/config/mmseg/units.dic --no-check-certificate
- wget https://github.com/medcl/elasticsearch-rtf/raw/master/elasticsearch/config/mmseg/words-my.dic --no-check-certificate
- wget https://github.com/medcl/elasticsearch-rtf/raw/master/elasticsearch/config/mmseg/words.dic --no-check-certificate
3. paoding
直接下载安装包安装
- cd plugins
- mkdir analysis-paoding
- cd analysis-paoding
- wget https://github.com/medcl/elasticsearch-rtf/blob/master/elasticsearch/plugins/analysis-paoding/elasticsearch-analysis-paoding-1.0.1.jar?raw=true --no-check-certificate
词库
- cd config
- mkdir paoding
- wget https://github.com/downloads/medcl/elasticsearch-analysis-paoding/config.zip --no-check-certificate
- unzip config.zip
- cp -rp config/paoding/* .
- rm -rf config
- rm config.zip
完整elasticsearch配置
编辑elasticsearch.yml
- index:
- analysis:
- tokenizer:
- mmseg_maxword:
- type: mmseg
- seg_type: "max_word"
- mmseg_complex:
- type: mmseg
- seg_type: "complex"
- mmseg_simple:
- type: mmseg
- seg_type: "simple"
- analyzer:
- mmseg:
- alias: [news_analyzer, mmseg_analyzer]
- type: org.elasticsearch.index.analysis.MMsegAnalyzerProvider
- ik:
- alias: [ik_analyzer]
- type: org.elasticsearch.index.analysis.IkAnalyzerProvider
- ik_max_word:
- type: ik
- use_smart: false
- ik_smart:
- type: ik
- use_smart: true
- paoding:
- alias: [paoding_analyzer]
- type: org.elasticsearch.index.analysis.PaodingAnalyzerProvider
相关推荐
vtnews 2020-07-29
AFei00 2020-08-03
sifeimeng 2020-08-01
renjinlong 2020-09-03
newbornzhao 2020-09-14
做对一件事很重要 2020-09-07
明瞳 2020-08-19
李玉志 2020-08-19
mengyue 2020-08-07
molong0 2020-08-06
molong0 2020-08-03
wenwentana 2020-08-03
YYDU 2020-08-03
另外一部分,则需要先做聚类、分类处理,将聚合出的分类结果存入ES集群的聚类索引中。数据处理层的聚合结果存入ES中的指定索引,同时将每个聚合主题相关的数据存入每个document下面的某个field下。
sifeimeng 2020-08-03
心丨悦 2020-08-03
liangwenrong 2020-07-31
mengyue 2020-07-30
tigercn 2020-07-29