TechMN (techmn)

techmn

Geek Repo

Github PK Tool:Github PK Tool

TechMN's starred repositories

ImageProcessing

Basic Image Processing using C#

Language:C#License:MITStargazers:1Issues:0Issues:0

ImageBlending

Alpha Blend two images in MATLAB

Language:MatlabLicense:MITStargazers:2Issues:0Issues:0

Face-Detection

Face Detection using EmguCV and C#

Language:C#License:MITStargazers:1Issues:0Issues:0

Computer-Vision-Video-Lectures

A curated list of free, high-quality, university-level courses with video lectures related to the field of Computer Vision.

License:CC0-1.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

changebind

A Hybrid Change Encoder for Remote Sensing Change Detection (IGARSS 2024)

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

satmae_pp

Official repository for "Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery" (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:65Issues:0Issues:0

elgcnet

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

DDAM-PS

DDAM-PS: Diligent Domain Adaptive Mixer for Person Search -- WACV2024

Language:PythonStargazers:10Issues:0Issues:0

ScratchFormer

ScratchFormer: Remote Sensing Change Detection With Transformers Trained from Scratch

Language:PythonStargazers:34Issues:0Issues:0

Clip2Protect

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

Language:PythonStargazers:96Issues:0Issues:0

Multimodality-Representation-Learning

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .

Stargazers:61Issues:0Issues:0

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1015Issues:0Issues:0

ViFi-CLIP

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Language:PythonLicense:MITStargazers:221Issues:0Issues:0

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:548Issues:0Issues:0

awesome-transformers-in-medical-imaging

A collection of resources on applications of Transformers in Medical Imaging.

Stargazers:1132Issues:0Issues:0