linzhiqiu's repositories
cross_modal_adaptation
Cross-modal few-shot adaptation with CLIP
t2v_metrics
Evaluating text-to-image/video/3D models with VQAScore
digital_chirality
Testing the chirality of digital imaging operations.
visual_gpt_score
VisualGPTScore for visio-linguistic reasoning
CLIP-FlanT5
Training code for CLIP-FlanT5
open_active
Open World Active Learning
modern-resume-theme
A modern static resume template and theme. Powered by Jekyll and GitHub pages.
debiased-pseudo-labeling
[CVPR 2022] Debiased Learning from Naturally Imbalanced Pseudo-Labels
HRNet-Semantic-Segmentation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
HTML4Vision
A simple HTML visualization tool for computer vision research :hammer_and_wrench:
linzhiqiu.github.io
Zhiqiu Lin's site
nips_policy_learning
NeuralIPS Policy Learning Scripts
PerceptualSimilarity
LPIPS metric. pip install lpips
vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
vl_finetuning
Few-shot Finetuning of CLIP
why-winoground-hard
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022