James Chang (strategist922)

strategist922

Geek Repo

Company:Microsoft

Location:Taipei, Taiwan

Github PK Tool:Github PK Tool


Organizations
THUKElab

James Chang's starred repositories

llama3.np

llama3.np is pure NumPy implementation for Llama 3 model.

Language:PythonLicense:MITStargazers:735Issues:0Issues:0

DouglasOrr.github.io

Doug's Diversions

Language:HTMLStargazers:7Issues:0Issues:0

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:215Issues:0Issues:0

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:3202Issues:0Issues:0

VidProM

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

Stargazers:75Issues:0Issues:0

SuperCLUE-Video

中文原生多层次文生视频测评基准

Stargazers:13Issues:0Issues:0

PeRF

[TPAMI 2024] PERF: Panoramic Neural Radiance Field from a Single Panorama

Language:PythonLicense:NOASSERTIONStargazers:172Issues:0Issues:0
Language:HTMLStargazers:31Issues:0Issues:0

MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

License:MITStargazers:242Issues:0Issues:0

suno-music-generator

基于 suno.ai 实现的文字快速创作音乐网站 (A text-based rapid music creation website based on suno.ai )

Language:TypeScriptLicense:Apache-2.0Stargazers:137Issues:0Issues:0

Swin-UMamba

Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining

Language:PythonLicense:Apache-2.0Stargazers:157Issues:0Issues:0

SegMamba

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Language:PythonStargazers:245Issues:0Issues:0

Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Language:PythonLicense:Apache-2.0Stargazers:138Issues:0Issues:0

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1055Issues:0Issues:0

Diffusion2GAN

Website source files for Diffusion2GAN Project.

Language:JavaScriptStargazers:53Issues:0Issues:0

Valuate-and-Enhance-Multimodal-Cooperation

The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024

Language:PythonStargazers:18Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookStargazers:1514Issues:0Issues:0
Language:JavaScriptStargazers:4Issues:0Issues:0

gpt-pilot

The first real AI developer

Language:PythonLicense:MITStargazers:28607Issues:0Issues:0

NAAF

Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.

Language:PythonStargazers:99Issues:0Issues:0

CuMo

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Language:PythonLicense:Apache-2.0Stargazers:77Issues:0Issues:0

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:928Issues:0Issues:0

AI-Paper-Collector

MLNLP社区用来更好进行论文搜索的工具。Fully-automated scripts for collecting AI-related papers

Language:PythonLicense:GPL-3.0Stargazers:1102Issues:0Issues:0

M2PT

[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:PythonLicense:MITStargazers:5Issues:0Issues:0
Language:PythonLicense:MITStargazers:64Issues:0Issues:0
Language:PythonLicense:MITStargazers:130Issues:0Issues:0

SRFormer

Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023)

Language:PythonLicense:NOASSERTIONStargazers:186Issues:0Issues:0