Changhan Wang (kahne)

kahne

Geek Repo

Company:@facebookresearch

Location:New York, NY

Home Page:changhan.me

Github PK Tool:Github PK Tool

Changhan Wang's starred repositories

algorithm-visualizer

:fireworks:Interactive Online Platform that Visualizes Algorithms from Code

Language:JavaScriptLicense:MITStargazers:46105Issues:1231Issues:112

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonLicense:Apache-2.0Stargazers:14868Issues:465Issues:1247

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13558Issues:202Issues:2257

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Language:C++License:NOASSERTIONStargazers:8400Issues:349Issues:1264

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2389Issues:75Issues:209

wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

magnitude

A fast, efficient universal vector embedding utility package.

Language:PythonLicense:MITStargazers:1610Issues:38Issues:84

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

alfred

alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046

Language:PythonLicense:GPL-3.0Stargazers:883Issues:21Issues:27

epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Language:PythonLicense:MITStargazers:575Issues:22Issues:92

voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Language:PythonLicense:NOASSERTIONStargazers:491Issues:18Issues:22

Phonetisaurus

Phonetisaurus G2P

Language:ShellLicense:BSD-3-ClauseStargazers:430Issues:33Issues:55

COMET

A Neural Framework for MT Evaluation

Language:PythonLicense:Apache-2.0Stargazers:394Issues:17Issues:147

DME

Dynamic Meta-Embeddings for Improved Sentence Representations

Language:PythonLicense:NOASSERTIONStargazers:332Issues:18Issues:7

g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonLicense:Apache-2.0Stargazers:326Issues:15Issues:18

covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Language:PythonLicense:NOASSERTIONStargazers:320Issues:20Issues:4

DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Language:PythonLicense:MITStargazers:319Issues:21Issues:31

NonAutoregGenProgress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

neurst

Neural end-to-end Speech Translation Toolkit

Language:PythonLicense:NOASSERTIONStargazers:293Issues:15Issues:23

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookLicense:MITStargazers:245Issues:10Issues:10

SpeechTransProgress

Tracking the progress in end-to-end speech translation

fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

Language:PythonLicense:MITStargazers:66Issues:2Issues:9

eskmeans

Embedded segmental K-means (ES-KMeans) in Python.

Language:PythonLicense:GPL-3.0Stargazers:13Issues:2Issues:2