James Chang's repositories
accelerated_features
Do you need robust and fast local feature extraction? You are in the right place!
corenet
CoreNet: A library for training deep neural networks
CuMo
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Diffusion2GAN
Website source files for Diffusion2GAN Project.
faceswap
Deepfakes Software For All
gpt-pilot
The first real AI developer
HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 最接近GPT-4V表现的可商用开源模型
LLM-Inheritune
This is the official repository for Inheritune.
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Megatron-LM
Ongoing research training transformer models at scale
MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
PLLaVA
Official repository for the paper PLLaVA
pykan
Kolmogorov Arnold Networks
RecAI
Bridging LLM and Recommender System.
Rewrite-the-Stars
[CVPR 2024] Rewrite the Stars
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
RSCaMa
RSCaMa: Remote Sensing Image Change Captioning with State Space Model
SegMamba
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation
SRFormer
Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023)
StoryDiffusion
Create Magic Story!
suno-music-generator
基于 suno.ai 实现的文字快速创作音乐网站 (A text-based rapid music creation website based on suno.ai )
TriForce
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Vitron
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Yi
A series of large language models trained from scratch by developers @01-ai