video-transformer

There are 3 repositories under video-transformer topic.

MCG-NJU / VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
self-supervised-learning action-recognition video-understanding masked-autoencoder transformer vision-transformer video-transformer mae pytorch video-representation-learning video-analysis neurips-2022
Language:Python 1570
transfomers-silicon-research
aliemo / transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
accelerator bert fpga-accelerator gpu-acceleration hardware-designs pretrained-models processing-in-memory research-paper systolic-arrays transformer video-transformer fpga natural-language-processing vision-transformer
Language:Jupyter Notebook 253
junchen14 / Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.
image-transformer language efficiency-transformer vision-transformer video-transformer video-language mlp-mixer transformer-readling-list multi-modal multi-modal-cvpr2021
225
fcakyon / video-transformers
Easiest way of fine-tuning HuggingFace video classification models
classification layer machine-learning neptune onnx onnxruntime tensorboard video accelerate evaluate huggingface pytorch transformers video-classification wandb deep-learning python pytorch-video vision video-transformer
Language:Python 145
amazon-science / long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
online-action-detection video-analysis video-transformer
Language:Python 133
MCG-NJU / VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
pytorch mae masked-autoencoder neurips-2022 transformer video-transformer video-understanding videomae action-detection
Language:Python 67
mlvlab / vid-TLDR
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
cvpr2024 computer-vision efficient-vision-transformers video-transformer token-pruning token-merging
Language:Python 46
mdnuruzzamanKALLOL / VideoMAE_Tensorflow
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
action-recognition mae pytorch self-supervised-learning tensorflow tensorflow2 transformer video-analytics video-representation-learning video-transformer vision-transformer
Language:Python
shotstack / mp4-to-mov-demo
A demo project showing how to convert an MP4 video to MOV format using the Shotstack Ingest API.
converter mp4-converter quicktime-codec transcoding video video-processing video-transcoding video-transformer
Language:JavaScript