Aleksei Dorkin's repositories

ancient-lang-adapters

Code for submissions to SIGTYP 2024, EvaLatin 2024, and AXOLOTL 2024 shared tasks

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

datasets

šŸ¤— The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

GNNs-Recipe

A recipe to study Graph Neural Networks (GNNs)

Stargazers:1Issues:0Issues:0

QualiaAnnotationUI

A prototype UI for annotation of qualia relations between FrameNet Lexical Units inferred from an external knowledge base

Language:HTMLLicense:MITStargazers:1Issues:1Issues:0

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-huggingface

šŸ¤— A list of wonderful open-source projects & applications integrated with Hugging Face libraries.

License:Apache-2.0Stargazers:0Issues:0Issues:0

BabelNetExtractor

A scala tool to extract data from local BabelNet indices

Language:ScalaLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

biomedical

Tools for curating biomedical training data for large-scale language modeling

Language:PythonStargazers:0Issues:0Issues:0

c2xg

A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CondViT-LRVSF

Official Implementation of Conditional ViT on LAION ā€” Referred Visual Search ā€” Fashion

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

course

The Hugging Face course

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

diffusers

šŸ¤— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FnSenseMapper

A tool to map FrameNet Lexical Units to BabelNet synsets using the distance between sentence embeddings of corresponding definitions

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

lexicon-enhanced-lemmatization

Neural encoder-decoder model for lemmatization

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

min-dalle

min(DALLĀ·E) is a fast, minimal port of DALLĀ·E Mini to PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

moondream

tiny vision language model

Stargazers:0Issues:0Issues:0

MorphyNet

MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

neural-transducer

This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

SetSimilaritySearch

All-pair set similarity search on millions of sets in Python and on a laptop

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:1Issues:0

stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0