- 授权协议: 未知
- 开发语言: Python
- 操作系统: 跨平台
- 软件首页: https://github.com/fxsjy/finalseg
- 软件文档: https://github.com/fxsjy/finalseg/blob/master/README.md
- 官方下载: https://github.com/fxsjy/finalseg/downloads
软件介绍
finalseg -> Chinese Words Segment Library in Python based on HMM Model 基于HMM模型的中文分词
用法
- 将finalseg目录放置于当前目录或者site-packages目录
- import finalseg
代码示例
import finalseg
sentence_list = [
"姚晨和老凌离婚了",
"他说的确实在理",
"长春市长春节讲话"
]
print u"=默认效果"
for sentence in sentence_list:
seg_list = finalseg.cut(sentence)
print "/ ".join(seg_list)
print u"\n=打开新词发现功能后的效果\n"
for sentence in sentence_list:
seg_list = finalseg.cut(sentence,find_new_word=True)
print "/ ".join(seg_list)
算法
- 算法是基于HMM模型,采用了Viterbi算法
- 可以选择是否打开新词发现功能
- 算法简单,只有89行纯Python代码 https://github.com/fxsjy/finalseg/blob/master/finalseg/__init__.py
性能
- 200 KB/Second
- Test Env: Intel(R) Core(TM) i7-2600 CPU @ 3.4GHz;《围城》.txt
例子
- 在线分词效果展示 https://finalseg.appspot.com/ (需要翻墙)
Rationality for Mortals
Gerd Gigerenzer / Oxford University Press, USA / 2008-05-02 / USD 65.00
Gerd Gigerenzer's influential work examines the rationality of individuals not from the perspective of logic or probability, but from the point of view of adaptation to the real world of human behavio......一起来看看 《Rationality for Mortals》 这本书的介绍吧!
