Anastasia Nikiforova's starred repositories

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23345Issues:311Issues:974

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17905Issues:173Issues:2144

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14543Issues:265Issues:205

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13779Issues:201Issues:2308

nlp_course

YSDA course in Natural Language Processing

Language:Jupyter NotebookLicense:MITStargazers:9644Issues:368Issues:46

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8814Issues:121Issues:970

textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Language:PythonLicense:MITStargazers:3345Issues:64Issues:0

awesome-fastapi-projects

List of FastAPI projects! :sunglasses: :rocket:

parser

:rocket: State-of-the-art parsers for natural language.

Language:PythonLicense:MITStargazers:828Issues:17Issues:131

NLP-Cube

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing

Language:HTMLLicense:Apache-2.0Stargazers:551Issues:31Issues:59

xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

Language:PythonLicense:Apache-2.0Stargazers:365Issues:3Issues:11

compling_nlp_hse_course

Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ

Language:Jupyter NotebookStargazers:171Issues:8Issues:1

Pytorch-tutorial-on-Google-colab

PyTorch Tutorial on google colaboratory.

pytorch_Highway_Networks

Highway Networks implement in pytorch

DeepNLP

Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики

Language:Jupyter NotebookStargazers:47Issues:6Issues:0

pytorch-cvt

Cross view training for sequence labeling in pytorch

Language:PythonStargazers:20Issues:0Issues:0

nfr

Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine translation.

Language:PythonLicense:Apache-2.0Stargazers:11Issues:3Issues:0

junky

Layers, datasets and utilities for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:10Issues:2Issues:0

mordl

Morphological parser (POS, lemmata, NER etc.)

Language:PythonLicense:BSD-3-ClauseStargazers:5Issues:2Issues:0

corpuscula

Toolkit that simplifies corpus processing

Language:PythonLicense:BSD-3-ClauseStargazers:3Issues:2Issues:0

rucor_to_conllu

RuCor corpus to CoNLL-U format conversion

Language:Jupyter NotebookLicense:CC0-1.0Stargazers:1Issues:1Issues:0