Sergey Vakhreev (JegernOUTT)

JegernOUTT

Geek Repo

Company:@smallcloudai

Location:Australia, Adelaide

Github PK Tool:Github PK Tool


Organizations
smallcloudai
trassir

Sergey Vakhreev's starred repositories

gpt4all

gpt4all: run open-source LLMs anywhere

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34505Issues:346Issues:1664

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18226Issues:156Issues:467

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:11715Issues:135Issues:192

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11238Issues:382Issues:3276

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonLicense:MITStargazers:8351Issues:78Issues:285

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7942Issues:92Issues:347

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5816Issues:69Issues:267

DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Language:PythonLicense:Apache-2.0Stargazers:3683Issues:137Issues:125

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:2923Issues:42Issues:216

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2455Issues:30Issues:135

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2063Issues:32Issues:93

lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Language:PythonLicense:MITStargazers:1926Issues:14Issues:23

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Language:PythonLicense:MITStargazers:1898Issues:37Issues:16

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1460Issues:26Issues:174

CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Language:PythonLicense:Apache-2.0Stargazers:1426Issues:21Issues:34

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1251Issues:34Issues:64

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:889Issues:16Issues:40

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonLicense:Apache-2.0Stargazers:727Issues:7Issues:35

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonLicense:Apache-2.0Stargazers:648Issues:13Issues:114
Language:PythonLicense:MITStargazers:554Issues:18Issues:16

recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Language:PythonLicense:MITStargazers:382Issues:13Issues:19

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:259Issues:5Issues:26

refact-self-hosting

Refact.ai self-hosted server and Docker image

Language:PythonLicense:BSD-3-ClauseStargazers:222Issues:10Issues:11

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:162Issues:3Issues:24

guanaco-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:73Issues:2Issues:0
Language:Jupyter NotebookStargazers:56Issues:2Issues:0

deblatting_python

[IJCV 2021] Python implementation of deblatting

Language:PythonLicense:MITStargazers:21Issues:2Issues:0

lexlms

LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development

Language:PythonLicense:MITStargazers:3Issues:1Issues:0