Juan A. Rodriguez (joanrod)

joanrod

Geek Repo

Company:ServiceNow Research

Location:Montreal

Home Page:joanrod.github.io

Twitter:@joanrod_ai

Github PK Tool:Github PK Tool

Juan A. Rodriguez's repositories

ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

figure-diffusion

Generating figures from research papers, using textual captions from the paper.

Language:PythonLicense:NOASSERTIONStargazers:11Issues:1Issues:3

paper2figure-dataset

Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)

Language:PythonStargazers:2Issues:2Issues:0
License:MITStargazers:1Issues:0Issues:0

galai

Model API for GALACTICA

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CodeGen

CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cross-modal-retrieval-with-triplet-network

Text-to-Image and Image-to-Text model retrieval

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

tracknet

TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera Vehicle Tracking

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

moviepy

Video editing with Python

License:MITStargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torch-fidelity

High-fidelity performance metrics for generative models in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0