huxianhe0's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62937Issues:0Issues:0

ggml

Tensor library for machine learning

Language:C++License:MITStargazers:10484Issues:0Issues:0

cpp-httplib

A C++ header-only HTTP/HTTPS server and client library

Language:C++License:MITStargazers:12449Issues:0Issues:0

Qt-Open-Source-Project

记录Qt开发的开源项目

License:Apache-2.0Stargazers:254Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonStargazers:22225Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:39011Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2977Issues:0Issues:0

nlp_paper_study

该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记

Language:C++Stargazers:3823Issues:0Issues:0

Awesome-Skeleton-based-Action-Recognition

Skeleton-based Action Recognition

Language:HTMLStargazers:6Issues:0Issues:0

imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language:PythonLicense:MITStargazers:7923Issues:0Issues:0

faceswap

Deepfakes Software For All

Language:PythonLicense:GPL-3.0Stargazers:49999Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:30159Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8181Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2859Issues:0Issues:0

Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)

Language:C++License:Apache-2.0Stargazers:6903Issues:0Issues:0

nsfw_data_scraper

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

Language:ShellLicense:MITStargazers:12215Issues:0Issues:0

open_nsfw

Not Suitable for Work (NSFW) classification using deep neural network Caffe models.

Language:PythonLicense:BSD-2-ClauseStargazers:5848Issues:0Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:PythonStargazers:2360Issues:0Issues:0

FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。

Language:CLicense:Apache-2.0Stargazers:475Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2660Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5250Issues:0Issues:0

SummerTTS_VS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out

Language:C++Stargazers:9Issues:0Issues:0

SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out

Language:C++Stargazers:376Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5319Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3979Issues:0Issues:0

Realtime_Multi-Person_Pose_Estimation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5085Issues:0Issues:0

CS-Books

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

Stargazers:19875Issues:0Issues:0

examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:418Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52435Issues:0Issues:0

act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Language:PythonLicense:MITStargazers:2884Issues:0Issues:0