Lluis Gomez i Bigorda's repositories
TextProposals
Implementation of the method proposed in the papers " TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild" and "Object Proposals for Text Extraction in the Wild" (Gomez & Karatzas), 2016 and 2015 respectively.
TextTopicNet
Self-supervised learning of visual features through embedding images into text topic spaces
single-shot-str
Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.
ST-VQA_Loc
Multimodal grid features and cell pointers for Scene Text Visual Question Answering
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
cvpr2019
Workshop materials for OpenCV day at CVPR 2019 conference
DEXPERT
A Transformer-based object-centric approach for date estimation of historical photographs
IMGrocery-100K
Towards a large-scale dataset of branded food product images
Lottery-Ticket-Hypothesis-in-Pytorch
This repository contains a Pytorch implementation of the paper "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks" by Jonathan Frankle and Michael Carbin that can be easily adapted to any model/dataset.
mcv-m5
Master in Computer Vision - M5 Visual recognition
mutt-office365
A mutt configuration file ready for Office 365
prompt-to-prompt-with-sdxl
An implementation of the Prompt-to-Prompt paper for the SDXL architecture
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vimrc
The ultimate Vim configuration: vimrc