Y445n9's starred repositories
Awesome-Vision-Mamba-Models
[Official Repo] A Survey on Vision Mamba: Models, Applications and Challenges
Tool-Planner
Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering
DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
tuning_playbook_zh_cn
一本系统地教你将深度学习模型的性能最大化的战术手册。
LLMAgentPapers
Must-read Papers on LLM Agents.
LLM-Agents-Papers
A repo lists papers related to LLM based agent
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
facexformer
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
vfe.pytorch
Video Feature Enhancement with PyTorch
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
AmadeusGPT
[NeurIPS 2023] We turn natural language descriptions of behaviors into machine-executable code
multimodal-across-domains-gaze-target-detection
Official repo of "Multimodal Across Domains Gaze Target Detection" @ ICMI 2022
object-aware-gaze-target-detection
Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)
human-gaze-target-detection-transformer
An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"
End-to-End-Human-Gaze-Target-Detection-with-Transformers
Unofficial Realization of paper