dqqcasia's starred repositories

SPTK

A suite of speech signal processing tools

Language:C++License:Apache-2.0Stargazers:219Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2332Issues:0Issues:0

awesome-chatgpt

Curated list of awesome tools, demos, docs for ChatGPT and GPT-3

Stargazers:8184Issues:0Issues:0

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:107498Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38400Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16344Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10507Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3326Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1954Issues:0Issues:0
Language:PythonLicense:MITStargazers:243Issues:0Issues:0

STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Language:PythonLicense:MITStargazers:156Issues:0Issues:0

ICL_PaperList

Paper List for In-context Learning 🌷

Stargazers:771Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66834Issues:0Issues:0

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXLicense:MITStargazers:46537Issues:0Issues:0

Emotional-Speech-Data

This is the GitHub page for publicly available emotional speech data.

License:MITStargazers:309Issues:0Issues:0
Language:HTMLStargazers:2Issues:0Issues:0

spanlp

spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings. It contains all the rude words of Spanish-speaking countries.

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

PyChatGPT

⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.

Language:PythonLicense:MITStargazers:4224Issues:0Issues:0
Language:PythonLicense:MITStargazers:26Issues:0Issues:0

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookLicense:MITStargazers:265Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29681Issues:0Issues:0

emoji-cheat-sheet

A markdown version emoji cheat sheet

Language:TypeScriptLicense:MITStargazers:12145Issues:0Issues:0

omegaconf

Flexible Python configuration system. The last one you will ever need.

Language:PythonLicense:BSD-3-ClauseStargazers:1890Issues:0Issues:0

wav2seq

Official code for Wav2Seq

Language:PythonStargazers:93Issues:0Issues:0

diffgram

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

Language:PythonLicense:NOASSERTIONStargazers:1825Issues:0Issues:0
Language:PythonStargazers:155Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:966Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:339Issues:0Issues:0

Stream-level_Latency_Evaluation_for_Simultaneous_Machine_Translation

This repository contains the code of the paper "Stream-level Latency Evaluation for Simultaneous Machine Translation".

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

Speech_Translation_Segmenter

This repository contains the code of segmentation system proposed in "Direct Segmentation Models for Streaming Speech Translation"

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0