James Chang (strategist922)

strategist922

Geek Repo

Company:Microsoft

Location:Taipei, Taiwan

Github PK Tool:Github PK Tool


Organizations
THUKElab

James Chang's starred repositories

gpt-pilot

The first real AI developer

Language:PythonLicense:NOASSERTIONStargazers:28871Issues:266Issues:479

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:3723Issues:40Issues:56

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:1564Issues:25Issues:48

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1380Issues:23Issues:33

AI-Paper-Collector

MLNLP社区用来更好进行论文搜索的工具。Fully-automated scripts for collecting AI-related papers

Language:PythonLicense:GPL-3.0Stargazers:1106Issues:14Issues:76

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1069Issues:15Issues:25

llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Language:PythonLicense:MITStargazers:904Issues:13Issues:4

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:495Issues:10Issues:23

MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

SegMamba

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

PeRF

[TPAMI 2024] PERF: Panoramic Neural Radiance Field from a Single Panorama

Language:PythonLicense:NOASSERTIONStargazers:191Issues:13Issues:5

SRFormer

Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023)

Language:PythonLicense:NOASSERTIONStargazers:189Issues:9Issues:32

Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Language:PythonLicense:Apache-2.0Stargazers:189Issues:2Issues:15

Swin-UMamba

Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining

Language:PythonLicense:Apache-2.0Stargazers:179Issues:2Issues:13

suno-music-generator

基于 suno.ai 实现的文字快速创作音乐网站 (A text-based rapid music creation website based on suno.ai )

Language:TypeScriptLicense:Apache-2.0Stargazers:167Issues:2Issues:4

NAAF

Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.

CuMo

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Language:PythonLicense:Apache-2.0Stargazers:94Issues:1Issues:9

VidProM

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

M2PT

[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Language:PythonLicense:Apache-2.0Stargazers:74Issues:8Issues:1

Diffusion2GAN

Website source files for Diffusion2GAN Project.

Language:JavaScriptStargazers:63Issues:8Issues:0
Language:HTMLStargazers:55Issues:0Issues:0

Valuate-and-Enhance-Multimodal-Cooperation

The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024

Language:PythonStargazers:21Issues:0Issues:0

SuperCLUE-Video

中文原生多层次文生视频测评基准

Language:PythonLicense:MITStargazers:6Issues:2Issues:0
Language:JavaScriptStargazers:4Issues:0Issues:0
Language:PythonLicense:MITStargazers:3Issues:2Issues:0