Xinyu Huang (xinyu1205)

xinyu1205

Geek Repo

Company:Fudan University

Location:Shanghai, China

Home Page:https://xinyu1205.github.io

Github PK Tool:Github PK Tool

Xinyu Huang's repositories

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2549Issues:26Issues:146

robust-loss-mlml

Code for paper: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

Language:PythonLicense:MITStargazers:47Issues:6Issues:2

IDEA-pytorch

Code for paper: IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training [ACM MM2022]

Language:PythonLicense:MITStargazers:8Issues:3Issues:0

ActionCLIP

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

daily_fudan

一键平安复旦小脚本,自动化快速上报疫情

Language:PythonStargazers:0Issues:2Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Tag2Text & Stable Diffusion & BLIP & Whisper - Automatically Recognize, Detect, Segment and Generate Anything with Image, Text, and Speech Inputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

GroundingDINO

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

object_detection_metrics

Object Detection Metrics

License:MITStargazers:0Issues:0Issues:0

query2labels

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

ssl-small

Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0