Eren Gölge's starred repositories
open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
faster-whisper
Faster Whisper transcription with CTranslate2
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
parler-tts
Inference and training library for high-quality TTS models.
deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
accelerated-scan
Accelerated First Order Parallel Associative Scan
brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
voice-dataset-creation
Tools to create your own voice dataset for TTS training