guojietian's repositories
aitviewer
A set of tools to visualize and interact with sequences of 3D data.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
ChatGPT-Shortcut
让生产力加倍的 ChatGPT 快捷指令,按照领域和功能分区,可对提示词进行标签筛选、关键词搜索和一键复制。
ClothWild_RELEASE
This repo is official PyTorch implementation of 3D Clothed Human Reconstruction in the Wild (ECCV 2022).
CodeProject.AI-Server
CodeProject SenseAI is a self contained service that software developers can include in, and distribute with, their applications in order to augment their apps with the power of AI.
CogVideo
Text-to-video generation.
esp-csi
Applications based on Wi-Fi CSI (Channel state information), such as indoor positioning, human detection
EVA
EVA Series: Vision Foundation Model Fanatics from BAAI
gjtjx
Config files for my GitHub profile.
GLAMR
[CVPR 2022 Oral] Official PyTorch Implementation of "GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras”.
Hand4Whole_RELEASE
Official PyTorch implementation of "Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation", CVPRW 2022 (Oral.)
kubric
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
L2CS-Net
The official PyTorch implementation of L2CS-Net: Fine-Grained Gaze Estimation in Unconstrained Environments
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
RegAD
[ECCV2022 Oral] Registration based Few-Shot Anomaly Detection
tinyengine
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
videoCC-data
VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automatic pipeline starting from the Conceptual Captions Image-Captioning Dataset.
videodl
Videodl: A lightweight video downloader written by pure python.
ViTPose
PyTorch implementation of ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
WuDaoMM
WuDaoMM this is a data project
yolact
A simple, fully convolutional model for real-time instance segmentation.
yoloair
🔥🔥🔥YOLOv5, YOLOv6, YOLOv7, PPYOLOE, YOLOX, YOLOR, YOLOv4, YOLOv3, PPYOLO, PPYOLOv2, Transformer, Attention, TOOD and Improved-YOLOv5-YOLOv7... Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
yolov7
🔥🔥🔥🔥 YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
yolov7-pose
Deploy yolov7-pose TensorRT for Windows