yangwenwen's starred repositories

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

License:Apache-2.0Stargazers:71Issues:0Issues:0

speech-adapters

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

Stargazers:95Issues:0Issues:0

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:2221Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17754Issues:0Issues:0

LLMsNineStoryDemonTower

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

Stargazers:1570Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35251Issues:0Issues:0

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3793Issues:0Issues:0

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Language:PythonLicense:Apache-2.0Stargazers:2891Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:PythonLicense:MITStargazers:2561Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:799Issues:0Issues:0

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Stargazers:199Issues:0Issues:0

Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Language:PythonStargazers:406Issues:0Issues:0

wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

Stargazers:1863Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3568Issues:0Issues:0

tta

Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"

Language:PythonLicense:Apache-2.0Stargazers:109Issues:0Issues:0

MPNet

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

Language:PythonLicense:MITStargazers:283Issues:0Issues:0

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Language:PythonLicense:Apache-2.0Stargazers:6159Issues:0Issues:0

neural_sp

End-to-end ASR/LM implementation with PyTorch

Language:PythonLicense:Apache-2.0Stargazers:586Issues:0Issues:0

audio_visual_speech_enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

Language:PythonLicense:Apache-2.0Stargazers:101Issues:0Issues:0

SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Language:PythonLicense:Apache-2.0Stargazers:634Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:1932Issues:0Issues:0

VisualizeMNIST

This project is real-time visualization of a network recognizing digits from user's input.

Language:ProcessingLicense:GPL-3.0Stargazers:550Issues:0Issues:0

Lipreading-DenseNet3D

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Language:PythonStargazers:117Issues:0Issues:0

D3D

The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild

Language:PythonStargazers:25Issues:0Issues:0

Faceswap-Deepfake-Pytorch

Faceswap with Pytorch or DeepFake with Pytorch

Language:PythonStargazers:511Issues:0Issues:0

speech_separation

Include some core functions and model to handle speech separation

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

Language:PythonLicense:MITStargazers:162Issues:0Issues:0

awesome-Face_Recognition

papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;

Stargazers:4409Issues:0Issues:0

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonLicense:MITStargazers:4266Issues:0Issues:0