Cano's starred repositories
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
flash-linux0.11-talk
你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
consistency_models
Official repo for consistency models.
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
awesome-systematic-trading
A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
vits2_pytorch
unofficial vits2-TTS implementation in pytorch