hzhang57's repositories

2prime.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Stargazers:0Issues:0Issues:0

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

Stargazers:0Issues:0Issues:0

behave-dataset

code to access BEHAVE dataset

Language:PythonStargazers:0Issues:1Issues:0

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CogVideo

Text-to-video generation.

Stargazers:0Issues:0Issues:0

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Language:PythonStargazers:0Issues:0Issues:0

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Group-Contextualization

[CVPR22] Group Contextualization for Video Recognition

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

GSS

[CVPR 2023] Official repository of Generative Semantic Segmentation

Language:PythonStargazers:0Issues:0Issues:0

HowToCook

程序员在家做饭方法指南。

License:UnlicenseStargazers:0Issues:1Issues:0

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

License:UnlicenseStargazers:0Issues:1Issues:0

LaViLa

Code release for "Learning Video Representations from Large Language Models"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lightning-sam

Fine-tune Segment-Anything Model with Lightning Fabric.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

mega

Sequence modeling with Mega.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥

License:MITStargazers:0Issues:0Issues:0

Neighborhood-Attention-Transformer

[Preprint] Neighborhood Attention Transformer, 2022

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:PythonStargazers:0Issues:0Issues:0

Paper-Implementation-Template

A simple reproducible template to implement AI research papers

License:MITStargazers:0Issues:0Issues:0

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: MSC, CeCo (CVPR 2023)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

qna

[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VideoMAE

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

X-Decoder

Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:MITStargazers:0Issues:0Issues:0