hzhang57's repositories

Language:HTMLStargazers:0Issues:0Issues:0

2prime.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

License:MITStargazers:0Issues:0Issues:0

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥

License:MITStargazers:0Issues:0Issues:0

Paper-Implementation-Template

A simple reproducible template to implement AI research papers

License:MITStargazers:0Issues:0Issues:0

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Stargazers:0Issues:0Issues:0

lightning-sam

Fine-tune Segment-Anything Model with Lightning Fabric.

License:Apache-2.0Stargazers:0Issues:0Issues:0

GSS

[CVPR 2023] Official repository of Generative Semantic Segmentation

Stargazers:0Issues:0Issues:0

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: MSC, CeCo (CVPR 2023)

License:MITStargazers:0Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Stargazers:0Issues:0Issues:0

X-Decoder

Official Implementation of X-Decoder for generalized decoding for pixel, image and language

License:MITStargazers:0Issues:0Issues:0

LaViLa

Code release for "Learning Video Representations from Large Language Models"

License:MITStargazers:0Issues:0Issues:0

mega

Sequence modeling with Mega.

License:NOASSERTIONStargazers:0Issues:0Issues:0

GLIP

Grounded Language-Image Pre-training

License:MITStargazers:0Issues:0Issues:0

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Stargazers:0Issues:0Issues:0

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

Stargazers:0Issues:0Issues:0

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

License:MITStargazers:0Issues:0Issues:0

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

License:MITStargazers:0Issues:0Issues:0

METER

METER: A Multimodal End-to-end TransformER Framework

License:MITStargazers:0Issues:0Issues:0

CogVideo

Text-to-video generation.

Stargazers:0Issues:0Issues:0

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Stargazers:0Issues:0Issues:0

CLIP

Contrastive Language-Image Pretraining

License:MITStargazers:0Issues:0Issues:0

Neighborhood-Attention-Transformer

[Preprint] Neighborhood Attention Transformer, 2022

License:MITStargazers:0Issues:0Issues:0

VideoMAE

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

License:NOASSERTIONStargazers:0Issues:0Issues:0

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

License:UnlicenseStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

qna

[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention

License:MITStargazers:0Issues:0Issues:0

behave-dataset

code to access BEHAVE dataset

Stargazers:0Issues:0Issues:0

Group-Contextualization

[CVPR22] Group Contextualization for Video Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

License:NOASSERTIONStargazers:0Issues:0Issues:0

HowToCook

程序员在家做饭方法指南。

License:UnlicenseStargazers:0Issues:0Issues:0