Zijie-Tian

followers

following

stars

Tsinghua University

Beijing, China

Zijie Tian's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.044992 300 647

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.028777 326 5247

terraformer

CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code

Language:GoApache-2.011989 140 775

ZLUDA

CUDA on AMD GPUs

Language:RustApache-2.08090 114 147

trax

Trax — Deep Learning with Clear Code and Speed

Language:PythonApache-2.07983 146 232

bypy

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Language:PythonMIT7599 298 567

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07119 82 1445

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05860 67 268

script-commands

Script Commands let you tailor Raycast to your needs. Think of them as little productivity boosts throughout your day.

Language:ShellMIT5847 46 206

sd-scripts

Language:PythonApache-2.04363 47 800

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonApache-2.01958 50 78

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Apache-2.0774 17 9

sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Language:PythonApache-2.0653 16 31

PiPPy

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause651 36 248

llm-inference-benchmark

LLM Inference benchmark

Language:PythonMIT265 2 2

local_llama

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

Language:PythonApache-2.0193 6 13

ao

torchao: PyTorch Architecture Optimization (AO). Performant kernels that work with PyTorch.

Language:PythonBSD-3-Clause192 6 6

reason

A shell for research papers

Language:RustMIT189 4 5

gdGPT

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

Language:PythonApache-2.086 1 8

cuda_scheduling_examiner_mirror

A tool for examining GPU scheduling behavior.

Language:CudaNOASSERTION62 11 2

Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

CC0-1.055 50

SparseFinetuning

Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry

Language:PythonApache-2.035 50

structured_transposable_masks

Code for ICML 2021 submission

Language:Python35 1 2

UnderstandingNLP

Natural Language Processing Analysis

Language:Jupyter Notebook30 4 4

frame

FRAME: Fast Roofline Analytical Modeling and Estimation

Language:Jupyter NotebookMIT25 20

MLCarbon

End-to-end carbon footprint mod- eling tool

Language:Python24 20

Opara

Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.

Language:Python16 20

model-evaluator

Evaluate Transformers from the Hub 🔥

Language:PythonApache-2.012 8 16

wandbtocsv

CLI tool to export W&B metrics to a csv file.

Language:Jupyter NotebookApache-2.010 20

lowtime

A time-cost tradeoff problem solver

Language:PythonApache-2.010 30