张昕伟's starred repositories

DKVMN

Dynamic Key-Value Memory Networks for Knowledge Tracing

Language:PythonStargazers:135Issues:0Issues:0

cosyvoice-api

一个用于CosyVoice的api接口项目

Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:295Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:10676Issues:0Issues:0

Memary

The Open Source Memory Layer For Autonomous Agents

Language:Jupyter NotebookLicense:MITStargazers:1347Issues:0Issues:0

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonLicense:Apache-2.0Stargazers:3747Issues:0Issues:0

kubeadmiral

Multi-Cluster Kubernetes Orchestration

Language:GoLicense:Apache-2.0Stargazers:749Issues:0Issues:0

comfyui_segment_anything

Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.

Language:PythonLicense:Apache-2.0Stargazers:622Issues:0Issues:0

olp-en-cefrj

Open Language Profiles — English profile datasets from CEFR-J

Stargazers:88Issues:0Issues:0

llm-graph-builder

Neo4j graph construction from unstructured data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:149Issues:0Issues:0

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonLicense:MITStargazers:815Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:8106Issues:0Issues:0

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonStargazers:876Issues:0Issues:0

claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.

Language:PythonStargazers:7258Issues:0Issues:0

AMchat

AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics, and their solutions. AM (Advanced Mathematics) chat 高等数学大模型。一个集成数学知识和高等数学习题及其解答的大语言模型。

Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:10156Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3580Issues:0Issues:0

ComfyUI_Bxb

SD变现宝:一键把comfyui工作流转换成小程序。

Language:PythonLicense:Apache-2.0Stargazers:790Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:3935Issues:0Issues:0

MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Language:PythonLicense:NOASSERTIONStargazers:1384Issues:0Issues:0

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

Language:PythonLicense:GPL-3.0Stargazers:4861Issues:0Issues:0

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:7009Issues:0Issues:0

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:2802Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7045Issues:0Issues:0

everyone-can-use-english

人人都能用英语

Language:TypeScriptLicense:MPL-2.0Stargazers:23964Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2145Issues:0Issues:0

RTranslator

Open source real-time translation app for Android that runs locally

Language:C++License:Apache-2.0Stargazers:6089Issues:0Issues:0

llama-fs

A self-organizing file system with llama 3

Language:Jupyter NotebookLicense:MITStargazers:4715Issues:0Issues:0

TMSpeech

腾讯会议摸鱼工具

Language:C#License:MITStargazers:364Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2944Issues:0Issues:0