wtmarvel's starred repositories
stable-diffusion
A latent text-to-image diffusion model
generative-models
Generative Models by Stability AI
flash-attention
Fast and memory-efficient exact attention
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
consistency_models
Official repo for consistency models.
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
FaceDetection-DSFD
腾讯优图高精度双分支人脸检测器
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
voxceleb_trainer
In defence of metric learning for speaker recognition
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
AttentionIsOFFByOne
Implementation of "Attention Is Off By One" by Evan Miller
HADAR
This is an LWIR stereo-hyperspectral database to develop HADAR algorithms for thermal navigation. Based on this database, one can develop algorithms for TeX decomposition to generate TeX vision. One can also develop algorithms about object detection, semantic or scene segmentation, optical or scene flow, stereo depth etc. based on TeX vision instead of traditional RGB or thermal vision.
TTS-TextAnalyzer
TTS Text Analyzer