Aleksei Dorkin's starred repositories

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18424Issues:154Issues:468

alpaca.cpp

Locally run an Instruction-Tuned Chat-Style LLM

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8500Issues:67Issues:196

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonLicense:MITStargazers:8187Issues:69Issues:172

WantWords

An open-source online reverse dictionary.

awesome-totally-open-chatgpt

A list of totally open alternatives to ChatGPT

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3466Issues:31Issues:253

ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Language:PythonLicense:NOASSERTIONStargazers:1246Issues:3Issues:27

safari

Convolutions for Sequence Modeling

Language:AssemblyLicense:Apache-2.0Stargazers:851Issues:35Issues:38

bloomz.cpp

C++ implementation for BLOOM

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:732Issues:16Issues:22

CREMA-D

Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)

Language:RLicense:NOASSERTIONStargazers:327Issues:10Issues:7

Awesome-Sentence-Embedding

A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.

Binder

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

Language:PythonLicense:Apache-2.0Stargazers:289Issues:10Issues:8

ChatGPT-RetrievalQA

A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.

Language:Jupyter NotebookStargazers:137Issues:7Issues:0

ubisoft-laforge-daft-exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Language:PythonLicense:Apache-2.0Stargazers:118Issues:8Issues:17

MultiRD

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

Language:PythonLicense:MITStargazers:107Issues:8Issues:4

GlossBERT

GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)

Language:PythonLicense:MITStargazers:90Issues:6Issues:11

qdrant-azure

Qdrant Vector Database on Azure Cloud

Language:ShellLicense:MITStargazers:85Issues:167Issues:16

CondViT-LRVSF

Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion

Language:PythonLicense:CC-BY-4.0Stargazers:37Issues:3Issues:0

lagonn

Source code and data for Like a Good Nearest Neighbor

Language:PythonLicense:Apache-2.0Stargazers:28Issues:6Issues:1
Language:PythonLicense:Apache-2.0Stargazers:25Issues:2Issues:1

swissbert

The multilingual language model for Switzerland

Language:Jupyter NotebookLicense:MITStargazers:25Issues:1Issues:1

defsent

DefSent: Sentence Embeddings using Definition Sentences

BertForRD

This is the code for the EMNLP2020 Finding paper "BERT for Monolingual and Cross-Lingual Reverse Dictionary"

Truth-O-Meter-Making-ChatGPT-Truthful

fact checking of GPT and other LLMs

Language:PythonLicense:Apache-2.0Stargazers:15Issues:3Issues:2

heygpt

A simple command-line interface tool that allows you to interact with ChatGPT from OpenAI or Azure.

Language:RustStargazers:7Issues:0Issues:0

i-like-paintings

A package for predicting painting appreciation from images using a linear regressor on top of a frozen CLIP model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7Issues:2Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0