jasonppy

followers

following

stars

The University of Texas at Austin

Austin, TX

https://jasonppy.github.io/

Puyuan Peng's repositories

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION7515 89 126

PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Language:Python132 4 8

syllable-discovery

Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model

Language:PythonBSD-3-Clause27 30

word-discovery

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Language:Jupyter NotebookBSD-3-Clause24 4 5

FaST-VGS-Family

Transformer-based visually grounded speech models

Language:PythonBSD-3-Clause19 4 3

VoiceCraft_web

Language:JavaScript4 20

cs61bSpring2018

Language:Java1 10

jasonppy.github.io

Language:HTML1 20

moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonMIT1 20

academicpages

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT010

HERO_Video_Feature_Extractor

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Language:PythonMIT010

MAE-AST-Public

Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer

Language:Python010

para-nmt-50m

Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"

Language:Python010

vqwordseg

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Language:Jupyter NotebookMIT010

yt-dl

Language:Python010

zerospeech2021_baseline

BERT and LSTM baseline models of the ZeroSpeech Challenge 2021

Language:Python010