hommmm's repositories
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
spleeter
Deezer source separation library including pretrained models.
EssayKiller_V2
基于开源GPT2.0的初代创作型人工智能 | 可扩展、可进化
PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
frankmocap
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
im2markup
Neural model for converting Image-to-Markup (by Yuntian Deng github.com/da03)
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a lightweight and high-performance deep learning framework for mobile inference. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework. TNN:由腾讯优图实验室和光影实验室协同打造,移动端高性能、轻量级推理框架,同时拥有跨平台、高性能、模型压缩、代码裁剪等众多突出优势。TNN框架在原有Rapidnet、ncnn框架的基础上进一步加强了移动端设备的支持以及性能优化,同时也借鉴了业界主流开源框架高性能和良好拓展性的优点。目前TNN已经在手Q、微视、P图等应用中落地,欢迎大家参与协同共建,促进TNN推理框架进一步完善。
EvoSkeleton
Official project website for the CVPR 2020 paper (Oral Presentation) "Cascaded deep monocular 3D human pose estimation wth evolutionary training data"
TensorflowASR-1
集成了Tensorflow 2版本的端到端语音识别模型,并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2
InterHand2.6M
Official PyTorch implementation of "InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image", ECCV 2020
MobileStyleGAN.pytorch
An official implementation of MobileStyleGAN in PyTorch
ParallelTTS
A fast parallel text-to-speech (tts) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速并行语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
lyra
A Very Low-Bitrate Codec for Speech Compression
mmediting
OpenMMLab Image and Video Editing Toolbox
mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,
Deepfake-using-Wave2Lip
A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or training.
smplify-x
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
smart-sketch
🖌 photorealistic drawings from simple sketches using NVIDIA's GauGAN
handpose_x
手部21个关键点检测,二维手势姿态,手势识别,pytorch,handpose
AHANLP
啊哈自然语言处理包,提供包括分词、依存句法分析、自动摘要、语义相似度计算、LDA 主题预测、词云等服务。
pytorch-deep-image-matting
Pytorch implementation of deep image matting
KeypointNet
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations (CVPR2020)
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
jina
An easier way to build neural search on the cloud
t5-pegasus
中文生成式预训练模型
speechbrain
A PyTorch-based Speech Toolkit
milvus
An open source embedding vector similarity search engine powered by Faiss, NMSLIB and Annoy