MungTso (mengcaopku)

mengcaopku

Geek Repo

Company:Peking University

Github PK Tool:Github PK Tool

MungTso's starred repositories

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:792Issues:0Issues:0

llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

Stargazers:814Issues:0Issues:0

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:832Issues:0Issues:0

Shot2Story

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Language:PythonStargazers:77Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonLicense:Apache-2.0Stargazers:1237Issues:0Issues:0

ImageNet-1K

ImageNet-1K data download, processing for using as a dataset

Language:PythonStargazers:48Issues:0Issues:0

Awesome-Incremental-Learning

Awesome Incremental Learning

Stargazers:3634Issues:0Issues:0

Awesome-Continual-Learning

A curated list of Continual Learning papers and BibTeX entries

Language:TeXStargazers:124Issues:0Issues:0

TCL

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Language:PythonLicense:MITStargazers:257Issues:0Issues:0

DenseSSM

A repository for DenseSSMs

Language:PythonStargazers:85Issues:0Issues:0

TriDet

[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling

Language:PythonLicense:MITStargazers:157Issues:0Issues:0

mega.pytorch

Memory Enhanced Global-Local Aggregation for Video Object Detection, CVPR2020

Language:PythonLicense:NOASSERTIONStargazers:566Issues:0Issues:0

PTSEFormer

[ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15328Issues:0Issues:0

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:815Issues:0Issues:0

SoM

Set-of-Mark Prompting for LMMs

Language:PythonLicense:MITStargazers:1059Issues:0Issues:0

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:809Issues:0Issues:0
Language:ShellStargazers:726Issues:0Issues:0

Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

Stargazers:772Issues:0Issues:0

NavGPT

[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Language:PythonLicense:MITStargazers:118Issues:0Issues:0

QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonLicense:NOASSERTIONStargazers:187Issues:0Issues:0

MedLLMsPracticalGuide

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

License:MITStargazers:819Issues:0Issues:0

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1585Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16529Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10992Issues:0Issues:0

Mesh_Segmentation

some materials about mesh processing, including papers, videos, codes, and so on. Updating every day!

Stargazers:241Issues:0Issues:0

Awesome-3D-Object-Detection-for-Autonomous-Driving

3D Object Detection for Autonomous Driving: A Comprehensive Survey (IJCV 2023)

Stargazers:515Issues:0Issues:0

DeepViewAgg

[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:219Issues:0Issues:0

AdaMPI

[SIGGRAPH 2022] Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images

Language:PythonStargazers:210Issues:0Issues:0

3D-aware-Gen

[CSUR 2023] A Survey on Deep Generative 3D-aware Image Synthesis

Language:HTMLLicense:MITStargazers:154Issues:0Issues:0