LI Minghan (MinghanLi)

MinghanLi

Geek Repo

Company:Hong Kong Polytechnic University

Location:Hong Kong

Home Page:https://sites.google.com/view/minghanli-homepage/academic

Github PK Tool:Github PK Tool

LI Minghan's starred repositories

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9535Issues:0Issues:0

moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonLicense:MITStargazers:257Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1977Issues:0Issues:0

deep-residual-networks

Deep Residual Learning for Image Recognition

License:MITStargazers:6395Issues:0Issues:0
Language:PythonStargazers:190Issues:0Issues:0

open-images-dataset

Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.

Stargazers:982Issues:0Issues:0

conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

License:NOASSERTIONStargazers:351Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:774Issues:0Issues:0

localized-narratives

Localized Narratives

Language:HTMLLicense:Apache-2.0Stargazers:79Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4430Issues:0Issues:0

LongVA

Long Context Transfer from Language to Vision

Language:PythonLicense:Apache-2.0Stargazers:276Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2123Issues:0Issues:0

TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Language:PythonStargazers:139Issues:0Issues:0

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Stargazers:353Issues:0Issues:0

Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

Language:PythonStargazers:111Issues:0Issues:0

MMDU

Official repository of MMDU dataset

Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Language:PythonStargazers:1208Issues:0Issues:0

Valley

The official repository of "Video assistant towards large language model makes everything easy"

Language:PythonStargazers:196Issues:0Issues:0

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:532Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2838Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18096Issues:0Issues:0
Language:PythonStargazers:35Issues:0Issues:0

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:145Issues:0Issues:0

Video-guided-Machine-Translation

Starter code for the VMT task and challenge

Language:PythonStargazers:50Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11109Issues:0Issues:0

pycocoevalcap

Python 3 support for the MS COCO caption evaluation tools

Language:PythonLicense:NOASSERTIONStargazers:297Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1113Issues:0Issues:0

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:696Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1941Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonLicense:UnlicenseStargazers:130999Issues:0Issues:0