Aleksei Dorkin's repositories

ancient-lang-adapters

Source code for the submissions to SIGTYP 2024, EvaLatin 2024, and AXOLOTL 2024 shared tasks

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

datasets

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

GNNs-Recipe

A recipe to study Graph Neural Networks (GNNs)

Stargazers:1Issues:0Issues:0

sonajaht

Source code for "Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation" published at *SEM 2024

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-huggingface

🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.

License:Apache-2.0Stargazers:0Issues:0Issues:0

BabelNetExtractor

A scala tool to extract data from local BabelNet indices

Language:ScalaLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

biomedical

Tools for curating biomedical training data for large-scale language modeling

Language:PythonStargazers:0Issues:0Issues:0

c2xg

A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CondViT-LRVSF

Official Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

course

The Hugging Face course

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

lexicon-enhanced-lemmatization

Neural encoder-decoder model for lemmatization

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

moondream

tiny vision language model

Language:PythonStargazers:0Issues:0Issues:0

MorphyNet

MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

neural-transducer

This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

obsidian-semantic-search-plugin

Semantic search for Obsidian.md

Language:RustLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

SetSimilaritySearch

All-pair set similarity search on millions of sets in Python and on a laptop

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0