deep_shf's repositories

FacePose_pytorch

🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简,极快,高效是我们的宗旨)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

HairMapper

HairMapper: Removing Hair from Portraits Using GANs

Language:PythonStargazers:1Issues:0Issues:0

TextLogoLayout

[CVPR 2022] Aesthetic Text Logo Synthesis via Content-aware Layout Inferring

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

APTM

The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"

Language:PythonStargazers:0Issues:0Issues:0

ARKitTrack

PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BiSeNet

Add bisenetv2. My implementation of BiSeNet

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:0Issues:0Issues:0

Fatigue-Driven-Detection-Based-on-CNN

本科毕设内容:基于卷积神经网络的疲劳驾驶检测。

Language:PythonStargazers:0Issues:0Issues:0

chat2KnowL

知识文档问答,用大模型与文档对话,提供Al分析、阅读、问答工具,助你快速了解文档内容。

License:MITStargazers:0Issues:0Issues:0

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

Stargazers:0Issues:0Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOv5, YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv8. MNN, NCNN, TNN, ONNXRuntime.

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

MaskFaceTool

This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient pipeline for facial detection, alignment, and mask wearing.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

License:MITStargazers:0Issues:0Issues:0

nniefacelib

nniefacelib是一个在海思35xx系列芯片上运行的人脸算法库

Language:CLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

onnx2tflite

Tool for onnx->keras or onnx->tflite. If tool is useful for you, please star it.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

License:MITStargazers:0Issues:0Issues:0

revTongYi

阿里云 通义千问、通义万相 逆向工程 Python API

License:AGPL-3.0Stargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SHIKE

Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation (CVPR 2023)

Language:PythonStargazers:0Issues:0Issues:0

Simple-TensorRT

Secondary encapsulation of NVIDIA TensorRT interface to simplify the calling process

Stargazers:0Issues:0Issues:0

tensorRT_Pro

C++ library based on tensorrt integration

License:MITStargazers:0Issues:0Issues:0

ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

License:Apache-2.0Stargazers:0Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

YOLOv6-NCNN

Deploy YOLOv6 by NCNN

Language:C++Stargazers:0Issues:0Issues:0

YoloV7-ncnn-Raspberry-Pi-4

YoloV7 for a bare Raspberry Pi using ncnn.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0