Aleksei Dorkin's repositories

RuWiktionaryParser

Extraction of the Russian word forms and their segmentation from the Russian Wiktionary

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

datasets

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

GNNs-Recipe

A recipe to study Graph Neural Networks (GNNs)

Stargazers:1Issues:0Issues:0

NeuralMachineTranslation

Neural machine translation using encoder decoder architecture with scaled dot product attention

Language:PythonLicense:MITStargazers:1Issues:1Issues:1

QualiaAnnotationUI

A prototype UI for annotation of qualia relations between FrameNet Lexical Units inferred from an external knowledge base

Language:HTMLLicense:MITStargazers:1Issues:1Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

NeuralMorphemeSegmenter

Implementation of various neural network architectures for the purpose of morpheme segmentation

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.

License:Apache-2.0Stargazers:0Issues:0Issues:0

BabelNetExtractor

A scala tool to extract data from local BabelNet indices

Language:ScalaLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

biomedical

Tools for curating biomedical training data for large-scale language modeling

Language:PythonStargazers:0Issues:0Issues:0

c2xg

A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CondViT-LRVSF

Official Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

course

The Hugging Face course

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datasets-viewer

Viewer for the 🤗 datasets library.

Language:PythonStargazers:0Issues:0Issues:0

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FnSenseMapper

A tool to map FrameNet Lexical Units to BabelNet synsets using the distance between sentence embeddings of corresponding definitions

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

jiant

jiant is an nlp toolkit

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

lexicon-enhanced-lemmatization

Neural encoder-decoder model for lemmatization

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

neural-transducer

This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

SetSimilaritySearch

All-pair set similarity search on millions of sets in Python and on a laptop

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0