刘恒 (orzlh)

orzlh

Geek Repo

Company:Wuhan University

Location:Wuhan, China

Home Page:www.bilibili.com

Github PK Tool:Github PK Tool

刘恒's starred repositories

exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Language:PythonLicense:GPL-3.0Stargazers:5645Issues:0Issues:0

composed-video-retrieval

Composed Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:40Issues:0Issues:0

ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Language:PythonStargazers:4596Issues:0Issues:0

SiTH

[CVPR 2024] SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

T-MASS-text-video-retrieval

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

Language:PythonStargazers:37Issues:0Issues:0

xpool

https://layer6ai-labs.github.io/xpool/

Language:PythonStargazers:109Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:829Issues:0Issues:0
Language:PythonStargazers:100Issues:0Issues:0
Language:PythonLicense:MITStargazers:17Issues:0Issues:0

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:273Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4182Issues:0Issues:0

paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.

Language:PythonLicense:MITStargazers:263Issues:0Issues:0

ghiaseddin

Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)

Language:Jupyter NotebookLicense:MITStargazers:42Issues:0Issues:0

LaBo

CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification

Language:PythonStargazers:66Issues:0Issues:0

LM4CV

The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**

Language:PythonStargazers:38Issues:0Issues:0
Language:PythonStargazers:152Issues:0Issues:0

DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

I2DFormer

Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification" and NeurIPS2022 "I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification"

Language:PythonLicense:GPL-3.0Stargazers:18Issues:0Issues:0

flickr_scraper

Simple Flickr Image Scraper

Language:PythonLicense:AGPL-3.0Stargazers:208Issues:0Issues:0