cross-modal-learning

There are 2 repositories under cross-modal-learning topic.

KimMeen / Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
cross-modal-learning cross-modality deep-learning language-model large-language-models machine-learning multimodal-deep-learning multimodal-time-series prompt-tuning time-series time-series-analysis time-series-forecast time-series-forecasting
Language:Python 1573
whwu95 / Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
cross-modal-learning video-language-understanding video-text-retrieval video-understanding
Language:Python 248
MohamedAfham / CrossPoint
Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)
3d-point-clouds cross-modal-learning deep-learning few-shot-learning object-classification point-cloud self-supervised-learning transfer-learning unsupervised-learning
Language:Python 243
whwu95 / Text4Vis
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
cross-modal-learning transfer-learning video-recognition video-understanding action-recognition
Language:Python 205
whwu95 / BIKE
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
action-recognition cross-modal-learning video-language-understanding video-recognition video-understanding
Language:Python 162
choyingw / Cross-Modal-Perceptionist
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
3d 3d-models 3dmm biometrics cognitive-science computer-vision cross-modal-learning cvpr cvpr2022 deep-learning machine-learning pytorch speech speech-synthesis speech-to-face
Language:Python 127
Toytiny / CMFlow
[CVPR 2023 Highlight 💡] Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision
4d-radar automotive-radar autonomous-driving cross-modal-learning deep-learning ego-motion-estimation mobile-robotics motion-segmentation optical-flow scene-flow
Language:Python 121
RunpeiDong / ACT
[ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
3d-point-clouds cross-modal-learning representation-learning self-supervised-learning
Language:Python 100
mako443 / Text2Pos-CVPR2022
Code, dataset and models for our CVPR 2022 publication "Text2Pos"
pytorch deep-learning localization nlp language-processing cross-modal cross-modal-retrieval cross-modal-learning computer-vision cvpr cvpr2022
Language:Python 40
knightyxp / DGL
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
cross-modal-learning cross-modal-retrieval parameter-efficient-tuning prompt-tuning video-language-understanding video-text-retrieval
Language:Python 32
frank-chris / ImageTextRetrieval
In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Projection Learning model and study their performance. We also propose a modified Deep Cross-Modal Projection Learning model that uses a different image feature extractor. We evaluate the model’s performance on image-text retrieval on a fashion clothing dataset.
cross-modal-learning cross-modal-retrieval flask image-text-retrieval pytorch tensorflow
Language:Jupyter Notebook 11
Markin-Wang / CAMANet
[IJBHI 2023] This is the official implementation of CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation accepted to IEEE Journal of Biomedical and Health Informatics (J-BHI), 2023.
cross-modal-learning medical-report-generation radiology-report-generation
Language:Python 8
verlab / StraightToThePoint_CVPR_2020
Original PyTorch implementation of the code for the paper "Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data" at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
computer-vision vision-and-language reinforcement-learning agent video-processing video-analysis video-summarization hyperlapse fast-forward video-fast-forward multimodal-deep-learning multimodal-learning cross-modal-learning text-and-image cvpr
Language:Python 8
IGITUGraz / MemoryDependentComputation
Code for Limbacher, T., Özdenizci, O., & Legenstein, R. (2022). Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity. arXiv preprint arXiv:2205.11276.
associations babi-tasks cross-modal-learning hebbian-learning memory-networks neural-networks one-shot-learning python pythorch question-answering recurrent-neural-networks reinforcement-learning spiking-neural-networks
Language:Python 6
codiceSpaghetti / T4SA-2.0
This project creates the T4SA 2.0 dataset, i.e. a big set of data to train visual models for Sentiment Analysis in the Twitter domain using a cross-modal student-teacher approach.
computer-vision cross-modal-learning dataset-creation nlp student-teacher-learning twitter-sentiment-analysis
Language:Jupyter Notebook 4
PrithivirajDamodaran / WhatTheFood
An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.
cross-modal-retrieval cross-modal cross-modal-learning multimodal
4
Qwinpin / DanceBERT-Masked-Motion-Modeling
bert cross-modal-learning dance-generation motion-generation pytorch
Language:Jupyter Notebook 3
kjanjua26 / Do_Cross_Modal_Systems_Leverage_Semantic_Relationships
This is the code for our ICCV'19 paper on cross-modal learning and retrieval.
semantic-similarity cross-modal-learning retrieval iccv tensorflow multi-modal-learning scene-understanding retrieval-systems caption-retreival
1
basiclab / TrajPrompt
[ECCV 2024] Official Implementation of "TrajPrompt: Aligning Color Trajectory with Vision-Language Representations"
birds-eye-view cross-modal-learning prompt-tuning trajectory-prediction vision-language
0
TataMoktari / CrossModal_GAN
We design a cross-modal GAN which learns image-to-image modality transformation across cross-domain. This network is able to synthesize Infrared images from VISIBLE images for VEDAI dataset
cross-modal-learning infrared vedai visible
Language:Python 0

cross-modal-learning

KimMeen / Time-LLM

whwu95 / Cap4Video

MohamedAfham / CrossPoint

whwu95 / Text4Vis

whwu95 / BIKE

choyingw / Cross-Modal-Perceptionist

Toytiny / CMFlow

RunpeiDong / ACT

mako443 / Text2Pos-CVPR2022

knightyxp / DGL

frank-chris / ImageTextRetrieval

Markin-Wang / CAMANet

verlab / StraightToThePoint_CVPR_2020

IGITUGraz / MemoryDependentComputation

codiceSpaghetti / T4SA-2.0

PrithivirajDamodaran / WhatTheFood

Qwinpin / DanceBERT-Masked-Motion-Modeling

kjanjua26 / Do_Cross_Modal_Systems_Leverage_Semantic_Relationships

basiclab / TrajPrompt

TataMoktari / CrossModal_GAN