Neu-Robin1993's starred repositories

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Language:PythonLicense:NOASSERTIONStargazers:22055Issues:401Issues:287

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17194Issues:154Issues:1329

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:8114Issues:101Issues:1154

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonLicense:MITStargazers:4714Issues:57Issues:381

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4593Issues:55Issues:63

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3980Issues:43Issues:1342

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2960Issues:60Issues:86

latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

Language:TeXStargazers:2147Issues:22Issues:0

HigherHRNet-Human-Pose-Estimation

This is an official implementation of our CVPR 2020 paper "HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation" (https://arxiv.org/abs/1908.10357)

Language:PythonLicense:MITStargazers:1299Issues:47Issues:113

ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Language:PythonLicense:Apache-2.0Stargazers:1218Issues:22Issues:130

plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

HGNN

Hypergraph Neural Networks (AAAI 2019)

Language:PythonLicense:MITStargazers:628Issues:6Issues:21

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookLicense:MITStargazers:592Issues:29Issues:35

Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

SAM-Med3D

SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image

Language:PythonLicense:Apache-2.0Stargazers:392Issues:4Issues:60

bigdetection

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Language:PythonLicense:Apache-2.0Stargazers:381Issues:8Issues:9

Awesome-Skeleton-based-Action-Recognition

A curated paper list of awesome skeleton-based action recognition.

adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:248Issues:1Issues:47

awesome-SOTA-FER

A curated list of facial expression recognition in both 7-emotion classification and affect estimation.

Count-Anything

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

infogcn

Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"

MMTracking_Tutorials

Jupyter notebook tutorials for MMTracking

Language:Jupyter NotebookStargazers:96Issues:4Issues:4

MMPD_rPPG_dataset

MMPD: Multi-Domain Mobile Video Physiology Dataset(EMBC2023 Oral)

Language:PythonLicense:MITStargazers:95Issues:1Issues:3

PhysBench

Simple, fast, and fair evaluation of remote physiological sensing models

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:69Issues:2Issues:11

Hyperformer

This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."

Language:PythonLicense:Apache-2.0Stargazers:65Issues:4Issues:16

TAILOR

Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

SMG

SMG source code and dataset

Language:PythonStargazers:12Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:11Issues:1Issues:1

LXD_Build

This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among different users.

Language:ShellStargazers:5Issues:2Issues:0

Multimodal-Driver-State-Modeling-Through-Unsupervised-Learning

Multimodal Driver State Modeling Through Unsupervised Learning

Language:Jupyter NotebookStargazers:2Issues:0Issues:0