HenryHZY

Zi-Yuan Hu's repositories

Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

VL-PET

[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"

Language:PythonMIT48 2 4

annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookMIT000

Ask-Anything

[CVPR2024][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT000

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

000

cheatsheets-ai

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

MIT010

CLEVA

[EMNLP 2023 Demo] CLEVA: Chinese Language Models EVAluation Platform

Language:ShellNOASSERTION000

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookMIT000

CVPR2020_Poster

Speech2Action CVPR Poster Source Code

Language:TeXGPL-3.0000

HawkEye

Language:Python000

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).

Language:PythonApache-2.0000

latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

000

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Apache-2.0000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

Language:PythonApache-2.0000

MyArxiv

Language:CSSGPL-2.0000

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Apache-2.0000

ST-LLM

Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonApache-2.0000

TempCompass

A benchmark to evaluate the temporal perception ability of Video LLMs

Language:Python000

Video-ChatGPT

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

CC-BY-4.0000

Visual-Table

Stay tuned!

000

HenryHZY

Zi-Yuan Hu's repositories

Awesome-Multimodal-LLM

VL-PET

glados_auto_checkin

GlaDOS-auto-checkin

annotated_deep_learning_paper_implementations

Ask-Anything

bolei_awesome_posters

cheatsheets-ai

CLEVA

Conference-Acceptance-Rate

CVPR2020_Poster

HawkEye

helm

latex_paper_writing_tips

LLaMA-VID

LLaVA

MyArxiv

sglang

ST-LLM

TempCompass

Video-ChatGPT

Visual-Table