Skip to content

tsroten/pynlpir

Repository files navigation

PyNLPIR

image

image

PyNLPIR is a Python wrapper around the NLPIR/ICTCLAS Chinese segmentation software.

About

Easily segment text using NLPIR, one of the most widely-regarded Chinese text analyzers:

Features

  • Helper functions for common use cases
  • English/Chinese part of speech mapping
  • Support for UTF-8, GBK, and BIG5 encoded strings (and unicode of course!)
  • Access to NLPIR's C functions via ctypes
  • Includes a copy of NLPIR
  • Supports macOS (Intel), Linux, and Windows

Getting Started

About

A Python wrapper around the NLPIR/ICTCLAS Chinese segmentation software.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages