idejie's starred repositories
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
instructor
structured outputs for llms
Time-Series-Library
A Library for Advanced Deep Time Series Models.
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
puppet-padlocal
Puppet PadLocal is a Pad Protocol for WeChat
DiffuScene
[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
Partial2Complete
[ICCV 2023] P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
Dream2Real
[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models
ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
N-EPIC-Kitchens
N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.