Jiashu's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:12639Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1481Issues:0Issues:0

matplotlib-curly-brace

Plot curly brace with matplotlib

Language:HTMLLicense:MITStargazers:44Issues:0Issues:0

Paper-Picture-Writing-Code

MLNLP: Paper Picture Writing Code

Language:TeXStargazers:985Issues:0Issues:0

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:937Issues:0Issues:0

allo

Allo: A Programming Model for Composable Accelerator Design

Language:PythonLicense:Apache-2.0Stargazers:96Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10756Issues:0Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:529Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:19934Issues:0Issues:0

GPUDB-Prefetch

Source code of our DaMoN@SIGMOD 2024 paper "How Does Software Prefetching Work on GPU Query Processing?"

Language:CudaStargazers:5Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:5576Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:2808Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1260Issues:0Issues:0

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1322Issues:0Issues:0

BurstGPT

A GPT-3.5 & GPT-4 Workload Trace to Optimize LLM Serving Systems

Language:PythonLicense:CC-BY-4.0Stargazers:82Issues:0Issues:0
Language:PythonLicense:MITStargazers:149Issues:0Issues:0

Vexless

A code base for Vexless

Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:JsonnetLicense:Apache-2.0Stargazers:112Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35511Issues:0Issues:0

GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Language:PythonLicense:MITStargazers:114Issues:0Issues:0

retro

Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:33121Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:76124Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1588Issues:0Issues:0

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonLicense:NOASSERTIONStargazers:629Issues:0Issues:0

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Language:PythonLicense:BSD-3-ClauseStargazers:4793Issues:0Issues:0

RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:845Issues:0Issues:0

pytorch-model-train-template

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Language:PythonStargazers:46Issues:0Issues:0
Language:PythonStargazers:152Issues:0Issues:0

AdaQP

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training

Language:PythonLicense:MITStargazers:18Issues:0Issues:0