Yx Luan (ylkwd)

ylkwd

Geek Repo

Company:Auburn University

Location:Auburn, AL

Github PK Tool:Github PK Tool

Yx Luan's starred repositories

CTC-based-GOP

This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024

Language:PythonStargazers:4Issues:0Issues:0

ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Language:PythonLicense:CC-BY-4.0Stargazers:965Issues:0Issues:0

nkululeko

Machine learning speaker characteristics

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

seed-vc

zero-shot voice conversion with in context learning

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11332Issues:0Issues:0

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language:PythonLicense:Apache-2.0Stargazers:2958Issues:0Issues:0

mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Language:PythonLicense:MITStargazers:2091Issues:0Issues:0

Machine-Learning

Machine learning from scratch

Language:Jupyter NotebookLicense:MITStargazers:972Issues:0Issues:0

WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Language:PythonLicense:MITStargazers:583Issues:0Issues:0

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:8758Issues:0Issues:0

python-coding-interview

A middle-to-high level open source algorithm book designed with coding interview at heart!

Language:TeXLicense:Apache-2.0Stargazers:2111Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:5801Issues:0Issues:0

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28324Issues:0Issues:0

SoundStream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Language:PythonStargazers:342Issues:0Issues:0

audiotoken

Audio tokenization, in the fastest way possible!

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0

awesomeMLSys

An ML Systems Onboarding list

Stargazers:484Issues:0Issues:0

99AI

99AI ็จณๅฎš็‰ˆ๏ผšๅฏๅ•†ไธšๅŒ–็š„ AI Web ๅบ”็”จ๏ผˆๅ…ๆŽˆๆƒ๏ผŒๆ— ๅŽ้—จ๏ผŒๆ”ฏๆŒๅฟซ้€Ÿ้ƒจ็ฝฒ๏ผ‰๏ผŒไปฅ ALL-IN-CHAT ไธบ็›ฎๆ ‡ใ€‚ๅทฒๆ”ฏๆŒ AI ๅฏน่ฏใ€็ป˜ๅ›พใ€้Ÿณไนใ€่ง†้ข‘ๅŠŸ่ƒฝ๏ผŒไปฅๅŠ่”็ฝ‘ใ€ๆ€็ปดๅฏผๅ›พ็ญ‰ๆ’ไปถใ€‚

Language:JavaScriptLicense:NOASSERTIONStargazers:598Issues:0Issues:0

west

We Speech Transcript based on LLM, in 300 lines of code.

Language:PythonLicense:Apache-2.0Stargazers:117Issues:0Issues:0

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptLicense:MITStargazers:2544Issues:0Issues:0

grafx

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:80Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8282Issues:0Issues:0

Child-ASR-Paper

A list of papers for child ASR

License:MITStargazers:25Issues:0Issues:0

audino

Open source audio annotation tool for humans

Language:JavaScriptLicense:MITStargazers:1047Issues:0Issues:0

ai-pronunciation-trainer

This tool uses AI to evaluate your pronunciation.

Language:PythonLicense:AGPL-3.0Stargazers:127Issues:0Issues:0

llm-twin-course

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

Language:PythonLicense:MITStargazers:2401Issues:0Issues:0

sentencex

A sentence segmentation library with wide language support optimized for speed and utility.

Language:PythonLicense:MITStargazers:43Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8545Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2576Issues:0Issues:0

pytorch-deep-learning

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Language:Jupyter NotebookLicense:MITStargazers:10167Issues:0Issues:0

libsoni

libsoni: A Python Toolbox for Sonifying Music Annotations and Feature Representations

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:17Issues:0Issues:0