zhengjia (ZJLi2013)

ZJLi2013

Geek Repo

Company:null

Location:Detroit

Github PK Tool:Github PK Tool

zhengjia's starred repositories

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:5763Issues:115Issues:222

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4087Issues:34Issues:435

perfetto

Performance instrumentation and tracing for Android, Linux and Chrome (read-only mirror of https://android.googlesource.com/platform/external/perfetto/)

Language:C++License:Apache-2.0Stargazers:2592Issues:60Issues:741
Language:C++License:Apache-2.0Stargazers:1519Issues:42Issues:187

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:1119Issues:19Issues:50

FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Language:C++License:NOASSERTIONStargazers:1110Issues:53Issues:143

cccl

CUDA Core Compute Libraries

Language:C++License:NOASSERTIONStargazers:967Issues:33Issues:1078

mip-splatting

[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting

Language:PythonLicense:NOASSERTIONStargazers:914Issues:21Issues:36

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:798Issues:8Issues:18

pixelsplat

[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:748Issues:21Issues:82

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Language:HTMLLicense:NOASSERTIONStargazers:655Issues:27Issues:204

PyProf

A GPU performance profiling tool for PyTorch models

Language:PythonLicense:Apache-2.0Stargazers:487Issues:20Issues:0

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonLicense:NOASSERTIONStargazers:415Issues:12Issues:5

omnitrace

Omnitrace: Application Profiling, Tracing, and Analysis

Language:C++License:MITStargazers:272Issues:17Issues:102

ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Language:PythonLicense:Apache-2.0Stargazers:233Issues:9Issues:34
Language:PythonLicense:Apache-2.0Stargazers:192Issues:12Issues:8

ai-matrix

To make it easy to benchmark AI accelerators

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:178Issues:31Issues:14

AMDMIGraphX

AMD's graph optimization engine.

Language:C++License:MITStargazers:168Issues:36Issues:1138

XCube

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Language:PythonLicense:NOASSERTIONStargazers:151Issues:12Issues:7

nann

A flexible, high-performance framework for large-scale retrieval problems based on TensorFlow.

Language:C++License:Apache-2.0Stargazers:136Issues:5Issues:15
Stargazers:116Issues:0Issues:0

nsight-training

Training material for Nsight developer tools

Language:CLicense:NOASSERTIONStargazers:113Issues:6Issues:2

vllm-rocm

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:84Issues:2Issues:13

NeRF-HuGS

Reference implementation of CVPR 2024 (Oral) paper "NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:31Issues:3Issues:3

GS-LoRA

Continual Forgetting for Pre-trained Vision Models (CVPR 2024)

Language:PythonLicense:MITStargazers:27Issues:3Issues:2