hzhang57's repositories
2prime.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
behave-dataset
code to access BEHAVE dataset
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
CLIP
Contrastive Language-Image Pretraining
CogVideo
Text-to-video generation.
coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
GLIP
Grounded Language-Image Pre-training
Group-Contextualization
[CVPR22] Group Contextualization for Video Recognition
GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
LaViLa
Code release for "Learning Video Representations from Large Language Models"
lightning-sam
Fine-tune Segment-Anything Model with Lightning Fabric.
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
mega
Sequence modeling with Mega.
METER
METER: A Multimodal End-to-end TransformER Framework
multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
Neighborhood-Attention-Transformer
[Preprint] Neighborhood Attention Transformer, 2022
openai-cookbook
Examples and guides for using the OpenAI API
Paper-Implementation-Template
A simple reproducible template to implement AI research papers
Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: MSC, CeCo (CVPR 2023)
pytorch_scatter
PyTorch Extension Library of Optimized Scatter Operations
SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.
X-Decoder
Official Implementation of X-Decoder for generalized decoding for pixel, image and language