miendinh's repositories
vocode-python
🤖 Build voice-based LLM agents. Modular + open source.
agentcloud
Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Create Multi Agent process automation apps (crewai) 4) Tools 5) Teams+user permissions. Get started fast with Docker and our install.sh
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
auto-dev
🧙AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
BitNet
Official inference framework for 1-bit LLMs
bolna
End-to-end platform for building voice first multimodal agents
crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
dify
Dify is an open-source LLM app development platform. It has the core tech required to build AI-native apps, including RAG, agent capabilities, model management, observability and more, packaged into one intuitive interface.
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
flux
Official inference repo for FLUX.1 models
grok-1
Grok open release
linkedIn_auto_jobs_applier_with_AI
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
llama3
The official Meta Llama 3 GitHub site
LLM101n
LLM101n: Let's build a Storyteller
lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
pipecat
Open Source framework for voice and multimodal conversational AI
Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
RealtimeTTS
Converts text to speech in realtime
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
vixtts-demo
A Vietnamese Voice Text-to-Speech Model ✨
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation