cafew's starred repositories
lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
Ebook-Publisher
A Python tool for converting online stories into portable formats
AWSRDS-ChatGPT-API-Caching
Utilizing AWS RDS with Python PyMySQL for Efficient ChatGPT API Calls Management
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Audio-Sentiment-Analysis
This repository consists of work done to analyse sentiment of a customer in a conversation with a call center agent using various machine learning algorithms and audio features.
vits-cantonese
Cantonese Text to Speech with VITS implementation
whisper-dictation
Dictation app based on the OpenAI speech-to-text models
llm_response_streaming
Streaming of Fine tuned LLM Response using Fast API
Speaker_diarization
Speech Diarization for scrum automation
AutoAudiobook
Automatically create an audiobook using OpenAI
mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
sub-to-audio
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
PunctuationModel
中文标点符号模型,可以给文本添加标点符号。
punctuator
A small seq2seq punctuator tool based on DistilBERT
deepmultilingualpunctuation
A python package for deep multilingual punctuation prediction.
PITS-44100-Ja
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。
AutoAudiobook
Automatically create an audiobook using OpenAI
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
chatbot-ui
AI chat for every model.
ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.