mengcaopku

followers

following

stars

Peking University

MungTso's starred repositories

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT79200

llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.083200

Shot2Story

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Language:Python7700

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonApache-2.0123700

ImageNet-1K

ImageNet-1K data download, processing for using as a dataset

Language:Python4800

Awesome-Incremental-Learning

Awesome Incremental Learning

Awesome-Continual-Learning

A curated list of Continual Learning papers and BibTeX entries

Language:TeX12400

TCL

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Language:PythonMIT25700

DenseSSM

A repository for DenseSSMs

Language:Python8500

TriDet

[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling

Language:PythonMIT15700

mega.pytorch

Memory Enhanced Global-Local Aggregation for Video Object Detection, CVPR2020

Language:PythonNOASSERTION56600

PTSEFormer

[ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

Language:PythonMIT2800

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.01532800

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.081500

SoM

Set-of-Mark Prompting for LMMs

Language:PythonMIT105900

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

kinetics-dataset

Language:Shell72600

Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

NavGPT

[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Language:PythonMIT11800

QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonNOASSERTION18700

MedLLMsPracticalGuide

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

MIT81900

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.0158500

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01652900

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Mesh_Segmentation

some materials about mesh processing, including papers, videos, codes, and so on. Updating every day!

Awesome-3D-Object-Detection-for-Autonomous-Driving

3D Object Detection for Autonomous Driving: A Comprehensive Survey (IJCV 2023)

DeepViewAgg

[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

Language:PythonNOASSERTION21900

AdaMPI

[SIGGRAPH 2022] Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images

Language:Python21000

3D-aware-Gen

[CSUR 2023] A Survey on Deep Generative 3D-aware Image Synthesis

Language:HTMLMIT15400