孙琦 (skytodmoon)

skytodmoon

Geek Repo

Company:WeiQIao

Location:Zouping

Home Page:http://blog.futuremake.tech/

Github PK Tool:Github PK Tool

孙琦's repositories

anomaly-detection-resources

Anomaly detection related books, papers, videos, and toolboxes

License:AGPL-3.0Stargazers:0Issues:0Issues:0

api4sensevoice

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Stargazers:0Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dify

One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

License:Apache-2.0Stargazers:0Issues:0Issues:0

fabric

Read-only mirror of https://gerrit.hyperledger.org/r/#/admin/projects/fabric

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FastGPT

FastGPT is a knowledge-based QA system built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization!

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

graphrag-accelerator

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GraphRAG-Ollama-UI

GraphRAG using Ollama with Gradio UI and Extra Features

License:MITStargazers:0Issues:0Issues:0

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

License:Apache-2.0Stargazers:0Issues:0Issues:0

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

License:NOASSERTIONStargazers:0Issues:0Issues:0

kubeai

Private Open AI on Kubernetes

License:Apache-2.0Stargazers:0Issues:0Issues:0

LabelLLM

The Open-Source Data Annotation Platform

License:Apache-2.0Stargazers:0Issues:0Issues:0

labelU

Data annotation toolbox supports image, audio and video data.

Stargazers:0Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

License:MITStargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-graph-builder

Neo4j graph construction from unstructured data using LLMs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

License:MITStargazers:0Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

License:AGPL-3.0Stargazers:0Issues:0Issues:0

openspg

OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constrained knowledge modeling, 2) facts and logic fused representation, 3) kNext SDK(python): LLM-enhanced knowledge construction, reasoning and generation

License:Apache-2.0Stargazers:0Issues:0Issues:0

porcupine

On-device wake word detection powered by deep learning

License:Apache-2.0Stargazers:0Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

safetensors

Simple, safe way to store and distribute tensors

License:Apache-2.0Stargazers:0Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust

License:Apache-2.0Stargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

License:MITStargazers:0Issues:0Issues:0

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

License:MITStargazers:0Issues:0Issues:0