Xiaokang Chen (charlesCXK)

charlesCXK

Geek Repo

Company:Peking University

Location:Beijing

Home Page:charlesCXK.github.io

Github PK Tool:Github PK Tool


Organizations
Atten4Vis
HRNet

Xiaokang Chen's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36389Issues:347Issues:1760

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:15346Issues:100Issues:784

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4277Issues:63Issues:93

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1787Issues:17Issues:26

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1755Issues:22Issues:125

VideoSys

VideoSys: An easy and efficient system for video generation

Language:PythonLicense:Apache-2.0Stargazers:1608Issues:27Issues:69

CV_interviews_Q-A

CV算法岗知识点及面试问答汇总,主要分为计算机视觉、机器学习、图像处理和 C++基础四大块,一起努力向offers发起冲击!

LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Language:PythonLicense:MITStargazers:1555Issues:33Issues:69

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1319Issues:4Issues:137

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonLicense:NOASSERTIONStargazers:1208Issues:23Issues:43

CV_Interview

I hope this repo can help you a lot!

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1119Issues:24Issues:54

Campus2024

2024届互联网校招信息汇总

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:898Issues:17Issues:31

VisionLLM

VisionLLM Series

Language:PythonLicense:Apache-2.0Stargazers:838Issues:42Issues:13

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookLicense:MITStargazers:445Issues:22Issues:3

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:383Issues:12Issues:23

LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Plain-DETR

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Language:PythonLicense:MITStargazers:192Issues:14Issues:25

Tube-Link

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

CAE

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

GroupDETR

[ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

dual-teacher

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:36Issues:1Issues:5

understand-ssl-part-aware

official code for TMLR Paper: "Understanding Self-Supervised Pretraining with Part-Aware Representation Learning"

Language:PythonStargazers:3Issues:0Issues:0