adeljalalyousif's repositories
adeljalalyousif
Config files for my GitHub profile.
Ala_eddin
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
attend-and-tell-2015_ipynb
Implementing and summarizing interesting research papers in Machine Learning and Neural Network domain.
BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
delving-deeper-into-the-decoder-for-video-captioning
Source code for Delving Deeper into the Decoder for Video Captioning
GL-RG
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
Image-Captioning
Image Captioning using CNN and Transformer.
image_captioning_Attend-and-Tell_DeepRNN
Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Install-OpenCV-Jetson-Nano
OpenCV installation script with CUDA and cuDNN support
Jetson-Nano-image
Jetson Nano image with deep learning frameworks
Jetson-Nano-Ubuntu-20-image
Jetson Nano with Ubuntu 20.04 image
long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
my-hello-word22
my Description is this the first project
PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
PyTorch-Beam-Search-Decoding
PyTorch implementation of beam search decoding for seq2seq models
pytorch-video-feature-extractor
A repository for extract CNN features from videos using pytorch
self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
TeSTra
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
VASTA
A Video-to-Text Framework
Video-Captioning-Transformer
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
Video-Captioning_shynie
Video Captioning is an encoder decoder mode based on sequence to sequence learning
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
yunjey__pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers