hzhang57

hzhang57's repositories

hzhang57.github.io

Language:HTML1 20

2prime.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

000

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

000

behave-dataset

code to access BEHAVE dataset

Language:Python010

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter Notebook000

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookMIT000

CogVideo

Text-to-video generation.

000

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Language:Python000

GLIP

Grounded Language-Image Pre-training

Language:PythonMIT000

Group-Contextualization

[CVPR22] Group Contextualization for Video Recognition

Language:PythonApache-2.0010

GSS

[CVPR 2023] Official repository of Generative Semantic Segmentation

Language:Python000

HowToCook

程序员在家做饭方法指南。

Unlicense010

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

Unlicense010

LaViLa

Code release for "Learning Video Representations from Large Language Models"

Language:PythonMIT000

lightning-sam

Fine-tune Segment-Anything Model with Lightning Fabric.

Language:PythonApache-2.0000

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonNOASSERTION010

mega

Sequence modeling with Mega.

Language:PythonNOASSERTION000

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonMIT000

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥

Language:PythonMIT000

Neighborhood-Attention-Transformer

[Preprint] Neighborhood Attention Transformer, 2022

Language:PythonMIT010

openai-cookbook

Examples and guides for using the OpenAI API

Language:Python000

Paper-Implementation-Template

A simple reproducible template to implement AI research papers

MIT000

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: MSC, CeCo (CVPR 2023)

Language:PythonMIT000

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

Language:PythonMIT000

qna

[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention

Language:PythonMIT010

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

Language:PythonMIT000

VideoMAE

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonNOASSERTION010

vidt

Language:PythonApache-2.0010

X-Decoder

Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonMIT000