MingJian.L (matrixgame2018)

matrixgame2018

Geek Repo

Company:Monash University / AIRS

Location:shenzhen China

Home Page:https://mingjian.liang2000@gmail.com

Twitter:@matrixMingzai

Github PK Tool:Github PK Tool

MingJian.L's starred repositories

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8845Issues:56Issues:513

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6286Issues:40Issues:296

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4864Issues:64Issues:82

lagent

A lightweight framework for building LLM-based agents

Language:PythonLicense:Apache-2.0Stargazers:1749Issues:17Issues:62

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonLicense:NOASSERTIONStargazers:1226Issues:23Issues:44

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1163Issues:18Issues:63

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

SAM-Med2D

Official implementation of SAM-Med2D

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:834Issues:13Issues:66

DriveLM

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

Language:HTMLLicense:Apache-2.0Stargazers:798Issues:19Issues:80

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:683Issues:15Issues:57

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

Language:PythonLicense:Apache-2.0Stargazers:537Issues:27Issues:7

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Language:PythonLicense:NOASSERTIONStargazers:496Issues:8Issues:47

Awesome-Text-to-3D

A growing curation of Text-to-3D, Diffusion-to-3D works.

CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Language:PythonLicense:MITStargazers:343Issues:12Issues:12

allenact

An open source framework for research in Embodied-AI from AI2.

Language:PythonLicense:NOASSERTIONStargazers:314Issues:10Issues:91

Drive-WM

[CVPR 2024] A world model for autonomous driving.

Language:PythonLicense:Apache-2.0Stargazers:282Issues:22Issues:5

DriveDreamer

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language:PythonLicense:Apache-2.0Stargazers:277Issues:8Issues:78

MedFM

Official Repository of NeurIPS 2023 - MedFM Challenge

MMIF-DDFM

[ICCV 2023 Oral] Official implementation for "DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion."

SegMiF

ICCV2023 | Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation

OmniScient-Model

This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:88Issues:10Issues:4

distill-bev

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

Language:PythonLicense:MITStargazers:47Issues:1Issues:9

Lifelong-MonoDepth

About official Pytorch implementation of "Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

Active_room_segmentation

Code for Human cognition-inspired active room segmentation

Language:PythonLicense:MITStargazers:8Issues:1Issues:1
Language:HTMLLicense:MITStargazers:2Issues:0Issues:0

MedFCMEA

NeurIPS 2023 - Challenge / NeurIPS 2024 Dataset Track

Language:PythonStargazers:1Issues:3Issues:0