Puyuan Peng (jasonppy)

jasonppy

Geek Repo

Company:The University of Texas at Austin

Location:Austin, TX

Home Page:https://jasonppy.github.io/

Twitter:@PuyuanPeng

Github PK Tool:Github PK Tool

Puyuan Peng's repositories

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7515Issues:89Issues:126

PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

syllable-discovery

Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model

Language:PythonLicense:BSD-3-ClauseStargazers:27Issues:3Issues:0

word-discovery

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:24Issues:4Issues:5

FaST-VGS-Family

Transformer-based visually grounded speech models

Language:PythonLicense:BSD-3-ClauseStargazers:19Issues:4Issues:3
Language:JavaScriptStargazers:4Issues:2Issues:0

moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

academicpages

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

HERO_Video_Feature_Extractor

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MAE-AST-Public

Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer

Language:PythonStargazers:0Issues:1Issues:0

para-nmt-50m

Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"

Language:PythonStargazers:0Issues:1Issues:0

vqwordseg

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

zerospeech2021_baseline

BERT and LSTM baseline models of the ZeroSpeech Challenge 2021

Language:PythonStargazers:0Issues:1Issues:0