Beast code in Giters

Neu-Robin1993's starred repositories

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Language:PythonNOASSERTION22055 401 287

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.017194 154 1329

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonBSD-3-Clause8114 101 1154

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonMIT4714 57 381

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonMIT4593 55 63

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonApache-2.03980 43 1342

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonBSD-3-Clause2960 60 86

latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

Language:TeX2147 220

HigherHRNet-Human-Pose-Estimation

This is an official implementation of our CVPR 2020 paper "HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation" (https://arxiv.org/abs/1908.10357)

Language:PythonMIT1299 47 113

ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Language:PythonApache-2.01218 22 130

plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Language:Python849 10 16

HGNN

Hypergraph Neural Networks (AAAI 2019)

Language:PythonMIT628 6 21

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookMIT592 29 35

Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

457 8 1

SAM-Med3D

SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image

Language:PythonApache-2.0392 4 60

bigdetection

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Language:PythonApache-2.0381 8 9

Awesome-Skeleton-based-Action-Recognition

A curated paper list of awesome skeleton-based action recognition.

MIT339 13 2

adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

Language:PythonApache-2.0248 1 47

awesome-SOTA-FER

A curated list of facial expression recognition in both 7-emotion classification and affect estimation.

170 5 2

Count-Anything

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

Language:Python117 3 5