kang7367's repositories

whisper-timestamped

Multilingual Automatic Speech Recognition with Word-level Timestamps

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

algorithms-book-py

๐Ÿ“–๐Ÿ ๐—ฝ๐˜‚๐—ฏ๐—น๐—ถ๐˜€๐—ต๐—ฒ๐—ฑ ๐—ฏ๐—ผ๐—ผ๐—ธ ๐—ผ๐—ป ๐—ฝ๐˜†๐˜๐—ต๐—ผ๐—ป, ๐—ฎ๐—น๐—ด๐—ผ๐—ฟ๐—ถ๐˜๐—ต๐—บ๐˜€, ๐—ฎ๐—ป๐—ฑ ๐—ฑ๐—ฎ๐˜๐—ฎ ๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€

Language:PythonStargazers:0Issues:0Issues:0

AwesomeKorean_Data

ํ•œ๊ตญ์–ด ๋ฐ์ดํ„ฐ ์„ธํŠธ ๋งํฌ

License:NOASSERTIONStargazers:0Issues:0Issues:0

Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

collaboration

ใ€Ž๋ชจ๋‘์˜ ๊นƒ&๊นƒํ—ˆ๋ธŒใ€ (๊ธธ๋ฒ—) ์‹ค์Šต ์ €์žฅ์†Œ

Stargazers:0Issues:0Issues:0

computing-Korean-STT-error-rates

STT ํ•œ๊ธ€ ๋ฌธ์žฅ ์ธ์‹๊ธฐ ์ถœ๋ ฅ ์Šคํฌ๋ฆฝํŠธ์˜ ์™ธ์ž ์˜ค๋ฅ˜์œจ(CER), ๋‹จ์–ด ์˜ค๋ฅ˜์œจ(WER)์„ ๊ณ„์‚ฐํ•˜๋Š” Python ํ•จ์ˆ˜ ํŒจํ‚ค์ง€

License:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

flores

Facebook Low Resource (FLoRes) MT Benchmark

License:NOASSERTIONStargazers:0Issues:0Issues:0

hunspell-dict-ko

Korean spellchecking dictionary for Hunspell

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Korean-Streaming-ASR

Korean Streaming ASR(with Denoiser and Conformer CTC)

Stargazers:0Issues:0Issues:0

llama

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

Stargazers:0Issues:0Issues:0

open-apis-korea

๐Ÿ‡ฐ๐Ÿ‡ท ํ•œ๊ตญ์–ด ์‚ฌ์šฉ์ž๋ฅผ ์œ„ํ•œ ์„œ๋น„์Šค์— ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•œ ์˜คํ”ˆ API ๋ชจ์Œ

Stargazers:0Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

License:MITStargazers:0Issues:0Issues:0

py-hanspell

ํŒŒ์ด์ฌ ํ•œ๊ธ€ ๋งž์ถค๋ฒ• ๊ฒ€์‚ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ. (๋„ค์ด๋ฒ„ ๋งž์ถค๋ฒ• ๊ฒ€์‚ฌ๊ธฐ ์‚ฌ์šฉ)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

License:MITStargazers:0Issues:0Issues:0

RSPapers

A Curated List of Must-read Papers on Recommender System.

License:MITStargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

License:MITStargazers:0Issues:0Issues:0

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

License:Apache-2.0Stargazers:0Issues:0Issues:0

test-repo

My first github repository!

Stargazers:0Issues:0Issues:0

UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

License:NOASSERTIONStargazers:0Issues:0Issues:0

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

License:UnlicenseStargazers:0Issues:0Issues:0