abrek-koc / vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

Home Page:https://vngrs.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VNLP: Turkish NLP Tools

State-of-the-art, lightweight NLP tools for Turkish language.

Developed by VNGRS.

https://vngrs.com/

PyPI version Docs License

Functionality:

  • Sentence Splitter
  • Normalizer
    • Spelling/Typo correction
    • Convert numbers to word form
    • Deasciification
  • Stopword Remover:
    • Static
    • Dynamic
  • Stemmer: Morphological Analyzer & Disambiguator
  • Named Entity Recognizer (NER)
  • Dependency Parser
  • Part of Speech (POS) Tagger
  • Sentiment Analyzer
  • Turkish Word Embeddings
    • FastText
    • Word2Vec
    • SentencePiece Unigram Tokenizer
  • Text Summarization: In development progress...

Demo:

Installation

pip install vngrs-nlp

Documentation:

  • See the Documentation for the details about usage, classes, functions, datasets and evaluation metrics.

Usage Example:

Dependency Parser

from vnlp import DependencyParser
dep_parser = DependencyParser()

dep_parser.predict("Onun için yol arkadaşlarımızı titizlikle seçer, kendilerini iyice sınarız.")
[(1, 'Onun', 5, 'obl'),
(2, 'için', 1, 'case'),
(3, 'yol', 1, 'nmod'),
(4, 'arkadaşlarımızı', 5, 'obj'),
(5, 'titizlikle', 6, 'obl'),
(6, 'seçer', 7, 'acl'),
(7, ',', 10, 'punct'),
(8, 'kendilerini', 10, 'obj'),
(9, 'iyice', 8, 'advmod'),
(10, 'sınarız', 0, 'root'),
(11, '.', 10, 'punct')]

# Spacy's submodule Displacy can be used to visualize DependencyParser result.
import spacy
from vnlp import DependencyParser
dependency_parser = DependencyParser()
result = dependency_parser.predict("Oğuz'un kırmızı bir Astra'sı vardı.", displacy_format = True)
spacy.displacy.render(result, style="dep", manual = True)

About

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

https://vngrs.com

License:GNU Affero General Public License v3.0


Languages

Language:Python 94.7%Language:Jupyter Notebook 5.3%