deep_shf's repositories
FacePose_pytorch
🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简,极快,高效是我们的宗旨)
HairMapper
HairMapper: Removing Hair from Portraits Using GANs
TextLogoLayout
[CVPR 2022] Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
APTM
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
ARKitTrack
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.
BiSeNet
Add bisenetv2. My implementation of BiSeNet
CPlusPlusThings
C++那些事
Fatigue-Driven-Detection-Based-on-CNN
本科毕设内容:基于卷积神经网络的疲劳驾驶检测。
chat2KnowL
知识文档问答,用大模型与文档对话,提供Al分析、阅读、问答工具,助你快速了解文档内容。
Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
insightface
State-of-the-art 2D and 3D Face Analysis Project
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
lite.ai.toolkit
🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOv5, YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv8. MNN, NCNN, TNN, ONNXRuntime.
MaskFaceTool
This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient pipeline for facial detection, alignment, and mask wearing.
mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
nniefacelib
nniefacelib是一个在海思35xx系列芯片上运行的人脸算法库
onnx2tflite
Tool for onnx->keras or onnx->tflite. If tool is useful for you, please star it.
OPERA
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
revTongYi
阿里云 通义千问、通义万相 逆向工程 Python API
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
SHIKE
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation (CVPR 2023)
Simple-TensorRT
Secondary encapsulation of NVIDIA TensorRT interface to simplify the calling process
tensorRT_Pro
C++ library based on tensorrt integration
ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
YOLOv6-NCNN
Deploy YOLOv6 by NCNN
YoloV7-ncnn-Raspberry-Pi-4
YoloV7 for a bare Raspberry Pi using ncnn.