Yu Wu (吴俣) (MarkWuNLP)

MarkWuNLP

Geek Repo

Company:Microsoft Research

Location:Beijing, China

Home Page:https://scholar.google.co.jp/citations?user=aQizmzsAAAAJ&hl=en

Github PK Tool:Github PK Tool

Yu Wu (吴俣)'s starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:67850Issues:570Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67608Issues:559Issues:710

lama-cleaner

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language:PythonLicense:Apache-2.0Stargazers:15066Issues:120Issues:336

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7652Issues:99Issues:198

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6827Issues:71Issues:577

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:4001Issues:57Issues:294

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonLicense:Apache-2.0Stargazers:3734Issues:48Issues:28

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3428Issues:57Issues:70

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:2820Issues:51Issues:150

NUWA

A unified 3D Transformer Pipeline for visual synthesis

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Language:PythonLicense:Apache-2.0Stargazers:1971Issues:46Issues:291

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1922Issues:39Issues:43

Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1161Issues:24Issues:85

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Diffusion-LM

Diffusion-LM

Language:PythonLicense:Apache-2.0Stargazers:1034Issues:17Issues:71

wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

roformer

Rotary Transformer

Language:PythonLicense:Apache-2.0Stargazers:783Issues:8Issues:8

audio-dataset

Audio Dataset for training CLAP and other models

prize

A prize for finding tasks that cause large language models to show inverse scaling

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:555Issues:23Issues:29

BeatNet

BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. (ISMIR 2021's paper implementation).

Language:PythonLicense:CC-BY-4.0Stargazers:316Issues:9Issues:27

stopes

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Language:PythonLicense:MITStargazers:247Issues:20Issues:40

Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:245Issues:15Issues:4

vocoder-benchmark

A repository for benchmarking neural vocoders by their quality and speed.

Language:PythonLicense:NOASSERTIONStargazers:201Issues:18Issues:6

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Language:PythonLicense:MITStargazers:194Issues:7Issues:3
Language:PythonLicense:Apache-2.0Stargazers:73Issues:4Issues:5

asr2k

asr2k

Language:PythonLicense:MITStargazers:48Issues:16Issues:0