Awj2021's repositories

awesome-labels-learning

The papers and projects with multi-label learning

Stargazers:1Issues:0Issues:0

diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Language:PythonStargazers:1Issues:0Issues:0

psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

ProMix

PyTorch Code for ProMix: Combating Label Noise via Maximizing Clean Sample Utility

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ControlNet

Let us control diffusion models!

License:Apache-2.0Stargazers:0Issues:0Issues:0

da-fusion

Effective Data Augmentation With Diffusion Models

License:MITStargazers:0Issues:0Issues:0

DiffMIC

[MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

Language:PythonStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Stargazers:0Issues:0Issues:0

ICLR24

Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"

Stargazers:0Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LRA-diffusion

This is the source code of LRA-diffusion for learning from noisy labels

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

License:NOASSERTIONStargazers:0Issues:0Issues:0

PVT

Pyramid Transformer Networks for Our Own dataset.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

quilt1m

[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.

License:MITStargazers:0Issues:0Issues:0

ResourceOfAI

Add some codes and commands which are common in research

Stargazers:0Issues:0Issues:0

SoTTA

This is the official PyTorch Implementation of "SoTTA: Robust Test-Time Adaptation on Noisy Data Streams (NeurIPS '23)" by Taesik Gong*, Yewon Kim*, Taeckyung Lee*, Sorn Chottananurak, and Sung-Ju Lee (* Equal contribution).

License:MITStargazers:0Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

License:MITStargazers:0Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

License:MITStargazers:0Issues:0Issues:0

TestProjects

Mainly including some test files, like jupyter notebooks

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

tiny-transformers

[ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

License:Apache-2.0Stargazers:0Issues:0Issues:0

unionnet

Replementation of unionnet "Deep Learning from Multiple Noisy Annotators as A Union"

Language:PythonStargazers:0Issues:0Issues:0

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Stargazers:0Issues:0Issues:0

VMamba

VMamba: Visual State Space Models

Language:PythonStargazers:0Issues:0Issues:0

VPD

[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.

License:MITStargazers:0Issues:0Issues:0