孙琦's repositories
anomaly-detection-resources
Anomaly detection related books, papers, videos, and toolboxes
api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
dify
One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.
EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
fabric
Read-only mirror of https://gerrit.hyperledger.org/r/#/admin/projects/fabric
FastGPT
FastGPT is a knowledge-based QA system built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization!
graphrag-accelerator
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
GraphRAG-Ollama-UI
GraphRAG using Ollama with Gradio UI and Extra Features
IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kubeai
Private Open AI on Kubernetes
LabelLLM
The Open-Source Data Annotation Platform
labelU
Data annotation toolbox supports image, audio and video data.
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
llm-graph-builder
Neo4j graph construction from unstructured data using LLMs
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
openspg
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constrained knowledge modeling, 2) facts and logic fused representation, 3) kNext SDK(python): LLM-enhanced knowledge construction, reasoning and generation
porcupine
On-device wake word detection powered by deep learning
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
safetensors
Simple, safe way to store and distribute tensors
self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation