GeneZC

GeneZC's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION25167 207 215

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7722 76 152

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5807 47 75

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03110 26 129

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT2910 37 200

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT1917 18 46

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonApache-2.01556 24 92

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION1176 3 102

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.01135 19 62

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonApache-2.0909 21 54

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.0832 19 104

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT825 12 110

Campus2025

2025届互联网校招信息汇总

605 21 1

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookMIT496 8 42

ring-flash-attention

Ring attention implementation with flash attention

Language:Python467 9 28

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonApache-2.0391 9 31

arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Language:Jupyter NotebookApache-2.0358 5 23

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:Python244 4 11

Valley

The official repository of "Video assistant towards large language model makes everything easy"

Language:Python195 4 34

MixEval

The official evaluation suite and dynamic data release for MixEval.

Language:Python189 1 16

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonApache-2.0161 4 5

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Language:PythonMIT122 5 5

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python116 1 12

llm-compression-intelligence

Official github repo for the paper "Compression Represents Intelligence Linearly"

Language:PythonMIT114 3 7

Bamboo

Bamboo-7B Large Language Model

Apache-2.088 10 1

rageval

Evaluation tools for Retrieval-augmented Generation (RAG) methods.

Language:PythonApache-2.086 7 34

xRAG

Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Language:Jupyter Notebook75 1 7

Sparkles

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

Language:PythonBSD-3-Clause38 1 5

Blockwise-Parallel-Transformer

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Language:PythonMIT36 3 2

SiMT-Hallucination

source code of paper "On the Hallucination in Simultaneous Machine Translation"

Language:Python200