Giters
rth
/
vtext
Simple NLP in Rust with Python bindings
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
148
Watchers:
5
Issues:
18
Forks:
11
rth/vtext Issues
NLP pipeline design
Updated
4 years ago
Comments count
10
Character n-grams
Updated
4 years ago
Comments count
4
Fine-tune tokenizers
Updated
4 years ago
Standardize language option
Updated
4 years ago
Rename UnicodeSegmentTokenizer to UnicodeWordTokenizer
Closed
4 years ago
Comments count
1
ENH Avoid copying tokens in tokenizers in Python
Closed
4 years ago
Comments count
1
Add sentence splitter
Closed
4 years ago
Comments count
8
Make to_ascii_lowercase optional
Updated
5 years ago
Comments count
4
Better support of configuration parameters in vectorizers
Closed
5 years ago
Comments count
2
General architecture feedback
Updated
5 years ago
Comments count
2
Word n-grams
Updated
5 years ago
Implement IDF transforms
Updated
5 years ago
Build release wheels with LTO
Updated
6 years ago
Make estimators picklables
Updated
6 years ago
Better unicode support in tokenization rules
Updated
6 years ago
Comments count
1
Multi-OS Python wheels
Closed
6 years ago
Support different hash functions in HashingVectorizer
Closed
6 years ago
Comments count
2
Python wrappers
Closed
6 years ago
Comments count
1