smksyj's repositories
diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
parler-tts
Inference and training library for high-quality TTS models.
LightSB-Matching
Light and Optimal Schrödinger Bridge Matching official PyTorch implementation
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
qa-lora
Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
ONE-PEACE
A general representation modal across vision, audio, language modalities.
DragGAN
Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold
basic-algo-lecture
바킹독의 실전 알고리즘 강의 자료
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
gpt4free
decentralising the Ai Industry, just some language model api's...
tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
StableLM
StableLM: Stability AI Language Models
JARVIS
JARVIS, a system to connect LLMs with ML community
the-algorithm
Source code for Twitter's Recommendation Algorithm
open_flamingo
An open-source framework for training large multimodal models
SpeechDewarping
Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023
Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages
Diffusion-GAN
Official PyTorch implementation for paper: Diffusion-GAN: Training GANs with Diffusion
erasing
Erasing Concepts from Diffusion Models
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily search and find personal or work documents by asking questions in everyday language.
TriAAN-VC
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
langchain
⚡ Building applications with LLMs through composability ⚡
WaveDiff
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
pal
PaL: Program-Aided Language Models
LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
line-bot-sdk-java
Java SDK for Messaging API BOT