charlesCXK

Xiaokang Chen's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.036389 347 1760

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.015346 100 784

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

11624 267 108

Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

NOASSERTION10123 84 11

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT4277 63 93

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT3367 28 83

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3114 127 18

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python2875 33 129

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonApache-2.01787 17 26

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonMIT1755 22 125

VideoSys

VideoSys: An easy and efficient system for video generation

Language:PythonApache-2.01608 27 69

CV_interviews_Q-A

CV算法岗知识点及面试问答汇总，主要分为计算机视觉、机器学习、图像处理和 C++基础四大块，一起努力向offers发起冲击！

1566 29 5

LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Language:PythonMIT1555 33 69

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION1319 4 137

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonNOASSERTION1208 23 43

CV_Interview

I hope this repo can help you a lot!

1198 14 5

coco-caption

Language:Jupyter NotebookNOASSERTION1119 24 54

Campus2024

2024届互联网校招信息汇总

959 28 13

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookNOASSERTION898 17 31

VisionLLM

VisionLLM Series

Language:PythonApache-2.0838 42 13

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

822 52 14

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookMIT445 22 3

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookApache-2.0383 12 23

LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Language:Python294 14 25

Plain-DETR

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Language:PythonMIT192 14 25

Tube-Link

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

Language:Python109 5 11

CAE

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Language:Python80 2 2

GroupDETR

[ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

40 8 1

dual-teacher

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Language:PythonNOASSERTION36 1 5

understand-ssl-part-aware

official code for TMLR Paper: "Understanding Self-Supervised Pretraining with Part-Aware Representation Learning"

Language:Python300