Beast code in Giters

mathon's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035453 347 1715

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonApache-2.027398 247 6982

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.021695 197 3197

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.017743 157 1369

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT16683 189 215

immersive-translate

沉浸式双语网页翻译扩展 , 支持输入框翻译，鼠标悬停翻译， PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension

NOASSERTION13655 78 1441

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause11727 104 849

mamba

Mamba SSM architecture

Language:PythonApache-2.011347 98 376

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

10256 238 99

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION9203 158 574

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9133 96 626

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6335 61 77

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT5962 66 148

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT5289 27 28

QuantsPlaybook

量化研究-券商金工研报复现

Language:Jupyter Notebook2344 73 4

DeepRL

Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

MIT2267 99 6

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2050 24 156

FinRL-Trading

For trading. Please star.

Language:Jupyter NotebookMIT1945 97 41

randomfun

Notebooks and various random fun

Language:Jupyter Notebook1067 46 4

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookNOASSERTION723 16 22

LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Language:PythonApache-2.0658 18 21

M5-methods

Data, Benchmarks, and methods submitted to the M5 forecasting competition

Language:Jupyter Notebook559 47 13

RL-book

Language:Python479 31 22

bigcode-dataset

Language:Jupyter NotebookApache-2.0336 9 39

hive-third-functions

Some useful custom hive udf functions, especial array, json, math, string functions.

Language:JavaApache-2.0219 17 8

aqt

Language:PythonApache-2.0207 6 25

Reinforcement-Learning-for-Market-Making

Using tabular and deep reinforcement learning methods to infer optimal market making strategies

Language:Jupyter Notebook140 40

VidToMe

Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)

Language:PythonMIT130 9 4

ffrecord

FireFlyer Record file format, writer and reader for DL training samples.

Language:PythonMIT107 5 8

DisCo-CLIP

Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".

Language:PythonApache-2.042 7 5