joanrod

followers

following

stars

ServiceNow Research

Montreal

joanrod.github.io

Juan A. Rodriguez's repositories

star-vector

ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Language:Python67 2 10

figure-diffusion

Generating figures from research papers, using textual captions from the paper.

Language:PythonNOASSERTION11 1 3

paper2figure-dataset

Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)

Language:Python2 20

awesome-tips

MIT100

galai

Model API for GALACTICA

Language:PythonApache-2.01 10

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookMIT000

CodeGen

CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonBSD-3-Clause000

ControlNet

Let us control diffusion models!

Language:PythonApache-2.0000

cross-modal-retrieval-with-triplet-network

Text-to-Image and Image-to-Text model retrieval

Language:Python000

deforum-stable-diffusion

Language:PythonNOASSERTION000

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonMIT000

M3-Project

Language:Jupyter Notebook010

M5-Visual-Recognition

Language:Python000

tracknet

TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera Vehicle Tracking

Language:Python000

UPF-Hand-Written-Text-Recognition

Language:Python000

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe

Language:PythonMIT000

joanrod

000

joanrod.github.io

Language:HTMLMIT010

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonBSD-3-Clause000

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

moviepy

Video editing with Python

MIT000

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION000

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT000

torch-fidelity

High-fidelity performance metrics for generative models in PyTorch

Language:PythonNOASSERTION000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

Language:PythonMIT000

vdm

Language:Jupyter NotebookApache-2.0000