idejie

idejie

Geek Repo

Company:Peking University

Location:Beijing

Home Page:https://blog.idejie.com

Github PK Tool:Github PK Tool


Organizations
sdunlp

idejie's starred repositories

llama.cpp

LLM inference in C/C++

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11270Issues:75Issues:13

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7985Issues:75Issues:295

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:6710Issues:50Issues:248

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonLicense:MITStargazers:5222Issues:58Issues:406

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2129Issues:33Issues:96

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:1938Issues:39Issues:135

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1073Issues:14Issues:107

Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:1056Issues:13Issues:113

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonLicense:MITStargazers:675Issues:8Issues:128

puppet-padlocal

Puppet PadLocal is a Pad Protocol for WeChat

Language:TypeScriptLicense:Apache-2.0Stargazers:608Issues:7Issues:306

LangSplat

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:569Issues:21Issues:50

PointLLM

[ECCV 2024] PointLLM: Empowering Large Language Models to Understand Point Clouds

PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Language:PythonLicense:Apache-2.0Stargazers:240Issues:13Issues:49

3D-VLA

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

DiffuScene

[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis

Language:PythonLicense:NOASSERTIONStargazers:188Issues:13Issues:33

Partial2Complete

[ICCV 2023] P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds

Language:PythonLicense:MITStargazers:138Issues:2Issues:22

referit3d

Code accompanying our ECCV-2020 paper on 3D Neural Listeners.

Language:C++License:MITStargazers:102Issues:2Issues:9

Dream2Real

[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models

Language:PythonStargazers:45Issues:5Issues:0

CALF

An official implementation of "CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning"

Language:PythonLicense:Apache-2.0Stargazers:45Issues:2Issues:3

CCoT

[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"

Language:PythonLicense:MITStargazers:34Issues:1Issues:3

ego4d-goalstep

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Language:PythonLicense:MITStargazers:27Issues:10Issues:5

3D-VLP

This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).

Language:PythonLicense:MITStargazers:23Issues:5Issues:3

N-EPIC-Kitchens

N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.

Language:PythonStargazers:18Issues:1Issues:0

3dhoi

[3DV 2022] Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges

ganov2

Winner of CVPR23 EGO4D STA challenge

Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:1
Language:PythonLicense:MITStargazers:5Issues:2Issues:0
Language:PythonStargazers:3Issues:1Issues:0