Tianheng Cheng (wondervictor)

wondervictor

Geek Repo

Company:Huazhong University of Science and Technology

Location:China

Home Page:https://scholar.google.com/citations?user=PH8rJHYAAAAJ&hl

Twitter:@tiahch

Github PK Tool:Github PK Tool


Organizations
HRNet
hustvl
msra-alumni

Tianheng Cheng's starred repositories

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20466Issues:132Issues:220

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:9864Issues:64Issues:11

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8481Issues:80Issues:33

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:6859Issues:38Issues:168

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:5789Issues:67Issues:183

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2087Issues:26Issues:62

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1769Issues:6Issues:236

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1199Issues:22Issues:73

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language:PythonLicense:Apache-2.0Stargazers:1040Issues:16Issues:41

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:555Issues:18Issues:14

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:541Issues:16Issues:5

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:472Issues:0Issues:0

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonLicense:Apache-2.0Stargazers:444Issues:6Issues:46

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Language:PythonLicense:Apache-2.0Stargazers:417Issues:21Issues:27

megalodon

Reference implementation of Megalodon 7B model

Language:CudaLicense:MITStargazers:397Issues:9Issues:6

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:269Issues:0Issues:0

MM-Vet

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Language:PythonLicense:Apache-2.0Stargazers:190Issues:2Issues:5

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonLicense:BSD-3-ClauseStargazers:185Issues:4Issues:10

DiG

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Language:PythonLicense:MITStargazers:75Issues:0Issues:0

tiny-flash-attention

flash attention tutorial written in python, triton, cuda, cutlass

Language:CudaStargazers:65Issues:0Issues:0

GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Language:PythonLicense:Apache-2.0Stargazers:57Issues:3Issues:7
License:MITStargazers:53Issues:0Issues:0

CCoT

[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

OnnxSlim

A Toolkit to Help Optimize Large Onnx Model

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

Linearized-LLM

[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

License:Apache-2.0Stargazers:2Issues:0Issues:0