Jesus Perez-Martin's repositories
video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
visual_syntactic_embedding_video_captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
video_features_extractor
Python implementation of extraction of several visual features representations from videos
attentive_specialized_network_video_captioning
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
SemanticMemes
This project aims to determine the discourses associated with a meme and how these discourses change according to their local reality. For this, we are creating a large dataset of memes collected from Chile’s tweets. And we developed a site for manual classification of tweet’s images
covid19_cuba
Dashboard with data and comparisons on COVID-19 infection in Cuba
image-captioning
Python implementation for an Image Captioning System
trecvid-vtt
Video to Text Description (VTT) task of the TREC Video Retrieval Evaluation (TRECVID)
ActivityNet
This repository is intended to host tools and demos for ActivityNet
annotationsite
Django project for memes annotation process
Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
deep-learning-tutorial
Tutorial for Deep Learning
delving-deeper-into-the-decoder-for-video-captioning
Source code for Delving Deeper into the Decoder for Video Captioning
dual_encoding
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
ECO-pytorch
PyTorch implementation for "ECO: Efficient Convolutional Network for Online Video Understanding", ECCV 2018
emotion-intensity-classification
Python implementation for classifying tweets into emotion intensities
HerokuFiles
Free file storage options for Heroku hosted applications
httpx
A next generation HTTP client for Python. 🦋
mkdocs
Project documentation with Markdown.
nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Semantics-AssistedVideoCaptioning
Source code for Semantics-Assisted Video Captioning Model
spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora
ssh-key-action
GitHub Action that installs SSH key to .ssh
temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding.
video_description_eval
Python implementation for video description evaluation metrics
video_events_detector
Python implementation of an events detector on long videos