Shuchang Zhou (zsc)

zsc

Geek Repo

Location:Beijing

Home Page:https://zsc.github.io/

Github PK Tool:Github PK Tool


Organizations
megvii-research

Shuchang Zhou's starred repositories

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonLicense:Apache-2.0Stargazers:162Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:64Issues:0Issues:0

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1599Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:106Issues:0Issues:0

ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

License:MITStargazers:409Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:1634Issues:0Issues:0

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonLicense:Apache-2.0Stargazers:1444Issues:0Issues:0

Assemble-Them-All

[SIGGRAPH Asia 2022] Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly

Language:C++License:MITStargazers:130Issues:0Issues:0

get-haized

A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.

Stargazers:72Issues:0Issues:0

AEC-Challenge

AEC Challenge

License:MITStargazers:361Issues:0Issues:0

TokenHMR

[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

Language:PythonLicense:NOASSERTIONStargazers:179Issues:0Issues:0

CraftsMan

CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner

Language:PythonStargazers:349Issues:0Issues:0

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonLicense:Apache-2.0Stargazers:816Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:630Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28353Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6958Issues:0Issues:0

suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

Language:TypeScriptLicense:LGPL-3.0Stargazers:1017Issues:0Issues:0

myo_sim

Musculoskeletal Models in MuJoCo

Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

ScreenAI

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Language:PythonLicense:MITStargazers:245Issues:0Issues:0

lightplane

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Language:PythonLicense:NOASSERTIONStargazers:235Issues:0Issues:0

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Language:PythonLicense:Apache-2.0Stargazers:1005Issues:0Issues:0

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:1148Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:13537Issues:0Issues:0

Memary

The Memory Layer For Autonomous Agents

Language:Jupyter NotebookLicense:MITStargazers:1174Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14325Issues:0Issues:0

demucs_batch-multigpu

[Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

emo-visual-data

😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。

Stargazers:84Issues:0Issues:0

APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Language:PythonLicense:GPL-3.0Stargazers:779Issues:0Issues:0

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1179Issues:0Issues:0

HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:337Issues:0Issues:0