LuckyLiYcon's starred repositories

VIBE

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Language:PythonLicense:NOASSERTIONStargazers:2850Issues:0Issues:0
Language:PythonStargazers:19Issues:0Issues:0

MotionBERT

[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"

Language:PythonLicense:Apache-2.0Stargazers:934Issues:0Issues:0

MotionLLM

Official repo of "MotionLLM: Multimodal Motion-Language Learning with Large Language Models"

Stargazers:20Issues:0Issues:0

MotionLLM

[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Language:PythonLicense:NOASSERTIONStargazers:177Issues:0Issues:0

MQT-LLaVA

Matryoshka Query Transformer for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:73Issues:0Issues:0

ttskit

text to speech toolkit. 好用的中文语音合成工具箱,包含语音编码器、语音合成器、声码器和可视化模块。

Language:PythonLicense:MITStargazers:1034Issues:0Issues:0

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

Language:PythonLicense:MITStargazers:954Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:81Issues:0Issues:0

navsim

NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

Language:PythonLicense:Apache-2.0Stargazers:168Issues:0Issues:0

EmbodiedScan

[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Language:PythonLicense:Apache-2.0Stargazers:361Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2618Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:12958Issues:0Issues:0

Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Language:PythonLicense:BSD-2-ClauseStargazers:1785Issues:0Issues:0

MiniGPT4Qwen

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

Language:Jupyter NotebookStargazers:291Issues:0Issues:0

so-large-lm

大模型基础: 一文了解大模型基础知识

Stargazers:1733Issues:0Issues:0

tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Language:PythonStargazers:487Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5999Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10096Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:609Issues:0Issues:0

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:458Issues:0Issues:0

NanoGPT-Pytorch2.0-Implementation

This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.

Language:PythonLicense:Apache-2.0Stargazers:59Issues:0Issues:0

EEG-To-Text

code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"

Language:PythonStargazers:144Issues:0Issues:0

DeWave

Exploration on introducing discrete codex and raw wave decoding to realize Brain-to-Text translation.

Stargazers:136Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 40+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2178Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7777Issues:0Issues:0

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Language:PythonStargazers:1034Issues:0Issues:0

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonLicense:Apache-2.0Stargazers:211Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5499Issues:0Issues:0