Beast code in Giters

baoyb's repositories

2024-AAAI-HPT

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)

Language:PythonMIT000

Awesome-Scene-Text-Image-Super-Resolution

A collection of papers and resources on scene text image super-resolution.

000

BiFormer

[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"

MIT000

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause000

CloFormer

The official code of "Rethinking Local Perception in Lightweight Vision Transformer"

MIT000

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

000

darknet

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

NOASSERTION000

DDP-practice

A demo of image classification with PyTorch DDP (DistributedDataParallel) and amp (Automatic Mixed Precision) modules. TODO: Add English version

000

FashionTex

The official implementation of SIGGRAPH 2023 conference paper, FashionTex: Controllable Virtual Try-on with Text and Texture.

MIT000

Fast-BEV

Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline

NOASSERTION000

GLCNet

Official implementation of "Global-Local Context Network for Person Search" in PyTorch.

MIT000

Graphormer

Do Transformers Really Perform Bad for Graph Representation? [NIPS-2021]

000

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

BSD-3-Clause000

MambaIR

A simple baseline for image restoration with state-space model.

Apache-2.0000

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

NOASSERTION000

MMIF-CDDFuse

[CVPR 2023] Official implementation for "CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion."

000

mobile-vision

Mobile vision models and code

NOASSERTION000

MSINet

[CVPR2023] Twins Contrastive Search of Multi-Scale Interaction for Object Re-Identification

000

OpenGait

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

000

personal-paper-code-daily

🎓 Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)

MIT000

Point-cloud-quality-assessment

Collections of papers, databases, and codes targeted at point cloud quality assessment (PCQA), mesh quality assessment (MQA), 3D model quality assessment (3DQA).

000

Qwen-7B

The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.

NOASSERTION000

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

000

SDT

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).

Language:PythonMIT000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Apache-2.0000

A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent

Apache-2.0000

VTG-GPT

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT

MIT000

Zero-shot-RIS

[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"

000

ahwhbc

baoyb's repositories

2024-AAAI-HPT

Awesome-Scene-Text-Image-Super-Resolution

BiFormer

BLIP

CloFormer

Contrastive-Learning-NLP-Papers

darknet

DDP-practice

FashionTex

Fast-BEV

GLCNet

Graphormer

LAVIS

MambaIR

MIGC

MMIF-CDDFuse

mobile-vision

MSINet

OpenGait

personal-paper-code-daily

PICR-Net_ACMMM23

Point-cloud-quality-assessment

Qwen-7B

qwen-sft

RPG-DiffusionMaster

SDT

sentence-transformers

SOLIDER

VTG-GPT

Zero-shot-RIS