Juan A. Rodriguez's repositories
figure-diffusion
Generating figures from research papers, using textual captions from the paper.
paper2figure-dataset
Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)
CLIP
Contrastive Language-Image Pretraining
CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
ControlNet
Let us control diffusion models!
cross-modal-retrieval-with-triplet-network
Text-to-Image and Image-to-Text model retrieval
k-diffusion
Karras et al. (2022) diffusion models for PyTorch
tracknet
TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera Vehicle Tracking
gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Megatron-LM
Ongoing research training transformer models at scale
moviepy
Video editing with Python
open_clip
An open source implementation of CLIP.
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
v-diffusion-pytorch
v objective diffusion inference code for PyTorch.