alphacep / text_search

Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup

Home Page:https://k2-fsa.github.io/text_search/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Installation

With pip

pip install fasttextsearch

For developers

pip install numpy

git clone https://github.com/danpovey/text_search
cd text_search

mkdir build
cd build
cmake ..
make -j
make test

# set PYTHONPATH so that you can use "import textsearch"

export PYTHONPATH=$PWD/../textsearch/python:$PWD/lib:$PYTHONPATH

Now you can use

python3 -c "import textsearch; print(textsearch.__file__)"

Caution: We did not use either python3 setup.py install or pip install. We only set the environment variable PYTHONPATH.

About

Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup

https://k2-fsa.github.io/text_search/


Languages

Language:Python 47.9%Language:C++ 31.0%Language:CMake 21.1%