vhzy - Giters

Aaron Han's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.040316 393 1292

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLMIT10679 266 46

annotated-transformer

An annotated implementation of the Transformer paper.

Language:Jupyter NotebookMIT5520 64 85

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3053 127 18

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonMIT1708 17 79

Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

1644 50 13

awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.

MPL-2.01468 40 12

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1178 33 4

atlas

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Language:PythonNOASSERTION506 13 18

ChineseNMT

ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer

Language:Python433 6 16

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

354 5 25

awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language:TeXMIT309 120

LongVA

Long Context Transfer from Language to Vision

Language:PythonApache-2.0283 8 12

MA-LMM

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Language:PythonMIT208 4 32

hands-on-research-tutorial

《动手做科研》面向科研初学者，一步一步地展示如何入门人工智能科研

Language:Jupyter Notebook15700

diffusion-models-class-CN

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookApache-2.015300

Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

144 90

IG-VLM

Language:PythonBSD-3-Clause99 4 7

VideoAgent

This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)

Language:PythonApache-2.074 3 5

VideoTree

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Language:PythonMIT60 2 4

Flash-VStream

Please refer to our official repo at https://github.com/IVGSZ/Flash-VStream.

Language:PythonApache-2.043 2 4

Soda

Search, organize, discover anything!

Language:Jupyter NotebookApache-2.04300

LongVLM

Language:Python41 4 2

LangRepo

Language Repository for Long Video Understanding

Language:PythonMIT27 2 1

Koala-video-llm

Language:PythonBSD-3-Clause27 1 5

mvu

Multimodal Video Understanding Framework (MVU)

Language:PythonMIT2200

explore-eqa

Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"

Language:Python21 7 4

Sealing

[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"

Language:PythonMIT9 40

Paper-Writing-Tips

该仓库是MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

300

CVPR24Track-LongVideo

Language:PythonBSD-3-Clause100