Ma-Dan's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:24901Issues:168Issues:791

everyone-can-use-english

人人都能用英语

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptLicense:AGPL-3.0Stargazers:18397Issues:101Issues:1405

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:10727Issues:96Issues:329

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:5240Issues:85Issues:446

GeminiProChat

Minimal web UI for GeminiPro.

Language:TypeScriptLicense:MITStargazers:4097Issues:31Issues:105
Language:PythonLicense:Apache-2.0Stargazers:3719Issues:50Issues:101

ml-interviews-book

https://huyenchip.com/ml-interviews-book/

iverilog

Icarus Verilog

Language:C++License:GPL-2.0Stargazers:2669Issues:135Issues:670

gowebsocket

golang基于websocket单台机器支持百万连接分布式聊天(IM)系统

Language:GoLicense:NOASSERTIONStargazers:2608Issues:56Issues:57

KuiperInfer

带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

Language:C++License:MITStargazers:2027Issues:20Issues:24

KG-demo-for-movie

从无到有构建一个电影知识图谱,并基于该KG,开发一个简易的KBQA程序。

vocal-separate

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网

Language:PythonLicense:GPL-3.0Stargazers:977Issues:8Issues:10

KnowledgeGraph

史上最大规模1.4亿知识图谱数据免费下载,知识图谱,通用知识图谱,融合了两千五百多万的实体,拥有亿级别的实体属性关系。

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Language:PythonLicense:Apache-2.0Stargazers:859Issues:11Issues:41

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:736Issues:16Issues:73

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:734Issues:8Issues:18

CNSD

中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++License:Apache-2.0Stargazers:385Issues:11Issues:51

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonLicense:Apache-2.0Stargazers:358Issues:13Issues:19

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:348Issues:15Issues:51

CAT

A CRF-based ASR Toolkit

Language:PythonLicense:Apache-2.0Stargazers:307Issues:21Issues:48

glake

GLake: optimizing GPU memory management and IO transmission.

Language:C++License:Apache-2.0Stargazers:275Issues:6Issues:17

awesome-drones-zh

无人机资源汇总

xlang

A next-generation dynamic and high-performance language for AI and IOT with natural born distributed computing ability.

Language:CLicense:Apache-2.0Stargazers:47Issues:9Issues:1

SeIF

SeIF: Semantic-constrained Deep Implicit Function for Single-image 3D Head Reconstruction

Language:PythonStargazers:23Issues:0Issues:0

whisper-trtllm

Whisper in TensorRT-LLM

Language:C++Stargazers:12Issues:3Issues:0

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0