FengYidan's starred repositories

CML

offical implementation of "Calibrating Multimodal Learning" on ICML 2023

Language:PythonStargazers:18Issues:0Issues:0

CM-VQVAE

Research code for the WACV2024 paper "Complementary-Contradictory Feature Regularization against Multimodal Overfitting"

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

Multimodal-Learning-with-Alternating-Unimodal-Adaptation

Multimodal Learning Method MLA for CVPR 2024

Language:PythonStargazers:21Issues:0Issues:0

FactorCL

[NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy

Language:Jupyter NotebookLicense:MITStargazers:47Issues:0Issues:0

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonLicense:MITStargazers:1973Issues:0Issues:0
Language:PythonLicense:MITStargazers:72Issues:0Issues:0

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonLicense:Apache-2.0Stargazers:1451Issues:0Issues:0

torchio

Medical imaging toolkit for deep learning

Language:PythonLicense:Apache-2.0Stargazers:1972Issues:0Issues:0

sam

SAM: Sharpness-Aware Minimization (PyTorch)

Language:PythonLicense:MITStargazers:1665Issues:0Issues:0

multi-domain-imbalance

[ECCV 2022] Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization, and Beyond

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

rtdl

Research on Tabular Deep Learning: Papers & Packages

Language:PythonLicense:Apache-2.0Stargazers:820Issues:0Issues:0

VAP_Former

[MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction

Language:PythonStargazers:15Issues:0Issues:0
Language:PythonStargazers:9Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

Med-PaLM

Towards Generalist Biomedical AI

Language:PythonLicense:MITStargazers:248Issues:0Issues:0

kzhang-cs205-l0-smoothing

Parallel Image Smoothing via L0 Gradient Minimization

Language:PythonStargazers:42Issues:0Issues:0

factorized

[ICLR 2019] Learning Factorized Multimodal Representations

Language:PythonLicense:MITStargazers:63Issues:0Issues:0

TLC

PyTorch implementation of the paper "Trustworthy Long-Tailed Classification" (CVPR 2022)

Language:PythonStargazers:54Issues:0Issues:0
Language:PythonStargazers:58Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonLicense:Apache-2.0Stargazers:1139Issues:0Issues:0

ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Language:PythonLicense:Apache-2.0Stargazers:856Issues:0Issues:0

perceiver-multi-modality-pytorch

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

MADDi

This repository is for the Multimodal Alzheimer’s Disease Diagnosis framework (MADDi).

Language:Jupyter NotebookLicense:MITStargazers:65Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:70Issues:0Issues:0

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLLicense:MITStargazers:437Issues:0Issues:0

HighMMT

[TMLR 2022] High-Modality Multimodal Transformer

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

ViPT

[CVPR23] Visual Prompt Multi-Modal Tracking

Language:PythonLicense:MITStargazers:230Issues:0Issues:0

IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language:PythonLicense:Apache-2.0Stargazers:17449Issues:0Issues:0

inat_comp

iNaturalist competition details

Language:PythonLicense:MITStargazers:707Issues:0Issues:0