Czm369's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:131895Issues:1030Issues:7408

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34861Issues:347Issues:1678

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27127Issues:246Issues:6926

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:22702Issues:312Issues:383

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18631Issues:294Issues:1304

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14663Issues:111Issues:155

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11831Issues:123Issues:353

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:10895Issues:122Issues:207

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10263Issues:192Issues:2089

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonLicense:Apache-2.0Stargazers:9484Issues:129Issues:894

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9327Issues:62Issues:102

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:8856Issues:156Issues:531

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8038Issues:91Issues:353

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6676Issues:59Issues:137

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5214Issues:59Issues:86

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonLicense:Apache-2.0Stargazers:4898Issues:62Issues:1561

stylegan2-ada-pytorch

StyleGAN2-ADA - Official PyTorch implementation

Language:PythonLicense:NOASSERTIONStargazers:3952Issues:50Issues:260

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonLicense:GPL-3.0Stargazers:2792Issues:34Issues:93

mmyolo

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

Language:PythonLicense:GPL-3.0Stargazers:2755Issues:34Issues:374

learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Language:PythonLicense:MITStargazers:1215Issues:21Issues:152

ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Language:PythonLicense:Apache-2.0Stargazers:855Issues:12Issues:50

DrivingDiffusion

Layout-Guided multi-view driving scene video generation with latent diffusion model

Language:PythonLicense:MITStargazers:496Issues:18Issues:10

bert4pytorch

超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新

syn-rep-learn

Learning from synthetic data - code and models

Language:PythonLicense:Apache-2.0Stargazers:262Issues:11Issues:5

Drive-WM

[CVPR 2024] A world model for autonomous driving.

Language:PythonLicense:Apache-2.0Stargazers:214Issues:20Issues:3

PolarFormer

[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers

Language:PythonLicense:MITStargazers:153Issues:8Issues:15

LingoQA

Official GitHub repository for the paper "LingoQA: Video Question Answering for Autonomous Driving"

Language:PythonLicense:NOASSERTIONStargazers:74Issues:10Issues:3