brdhunga's repositories
django-ai-assistant
Integrate AI Assistants with Django to build intelligent applications
audio-deepfake-detection
Audio deepfake detection sytem on CNN
bark
🔊 Text-Prompted Generative Audio Model
cerebellum
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
ClonedVoiceDetection
Single- and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features
deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
denser-retriever
An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.
Django-CRM
Open Source CRM based on Django
Django-GenAI-LLM-RAG-bot
A PM tool utilizing LangChain LLM prompting to analyze project data and return RAG status for project tasks.
EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
fraud-adapter
Site for synthetic detector adapter
knn-vc
Voice Conversion With Just Nearest Neighbors
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
mo-sql-parsing
Let's make a SQL parser so we can provide a familiar interface to non-sql datastores!
OnnxStream
Running Stable Diffusion on a RPI Zero 2 (or in 260MB of RAM)
pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
PhaseAntispoofing_INTERSPEECH
Official repository of the Interspeech 2023 paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"
rag-knowledge-chatbot-django
Knowledge chatbot using Agentic Retrieval Augmented Generation (RAG) techniques. Full-stack proof of concept built on langchain, llama-index, django, pgvector, with multiple advanced RAG techniques used.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. Designed for real-time applications like voice assistants.
reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
SGMM
Audio Source Recording Device Recognition Based on Representation Learning of Sequential Gaussian Mean Matrix
speech2speech
Generating synthetic speech
synthetic-trust.github.io
Content for blackhat presentation
Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices