mathon (doldre)

doldre

Geek Repo

Location:China

Home Page:luoxinchen.me

Github PK Tool:Github PK Tool

mathon's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35453Issues:347Issues:1715

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27398Issues:247Issues:6982

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21695Issues:197Issues:3197

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17743Issues:157Issues:1369

llama2.c

Inference Llama 2 in one file of pure C

immersive-translate

沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11727Issues:104Issues:849

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11347Issues:98Issues:376

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9203Issues:158Issues:574

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9133Issues:96Issues:626

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6335Issues:61Issues:77

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5962Issues:66Issues:148

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5289Issues:27Issues:28

QuantsPlaybook

量化研究-券商金工研报复现

Language:Jupyter NotebookStargazers:2344Issues:73Issues:4

DeepRL

Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2050Issues:24Issues:156

FinRL-Trading

For trading. Please star.

Language:Jupyter NotebookLicense:MITStargazers:1945Issues:97Issues:41

randomfun

Notebooks and various random fun

Language:Jupyter NotebookStargazers:1067Issues:46Issues:4

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:723Issues:16Issues:22

LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Language:PythonLicense:Apache-2.0Stargazers:658Issues:18Issues:21

M5-methods

Data, Benchmarks, and methods submitted to the M5 forecasting competition

Language:Jupyter NotebookStargazers:559Issues:47Issues:13
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:336Issues:9Issues:39

hive-third-functions

Some useful custom hive udf functions, especial array, json, math, string functions.

Language:JavaLicense:Apache-2.0Stargazers:219Issues:17Issues:8
Language:PythonLicense:Apache-2.0Stargazers:207Issues:6Issues:25

Reinforcement-Learning-for-Market-Making

Using tabular and deep reinforcement learning methods to infer optimal market making strategies

Language:Jupyter NotebookStargazers:140Issues:4Issues:0

VidToMe

Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)

Language:PythonLicense:MITStargazers:130Issues:9Issues:4

ffrecord

FireFlyer Record file format, writer and reader for DL training samples.

Language:PythonLicense:MITStargazers:107Issues:5Issues:8

DisCo-CLIP

Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".

Language:PythonLicense:Apache-2.0Stargazers:42Issues:7Issues:5