James Chang's starred repositories
DouglasOrr.github.io
Doug's Diversions
Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
SuperCLUE-Video
中文原生多层次文生视频测评基准
MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
suno-music-generator
基于 suno.ai 实现的文字快速创作音乐网站 (A text-based rapid music creation website based on suno.ai )
Swin-UMamba
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining
Rewrite-the-Stars
[CVPR 2024] Rewrite the Stars
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Diffusion2GAN
Website source files for Diffusion2GAN Project.
Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
AI-Paper-Collector
MLNLP社区用来更好进行论文搜索的工具。Fully-automated scripts for collecting AI-related papers