Yipeng Jiang's starred repositories

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:NOASSERTIONStargazers:34638Issues:224Issues:502

sing-box

The universal proxy platform

Language:GoLicense:NOASSERTIONStargazers:18849Issues:143Issues:1672

hiddify-next

Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.

Language:DartLicense:NOASSERTIONStargazers:16049Issues:134Issues:1069

yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Language:PythonLicense:AGPL-3.0Stargazers:9567Issues:50Issues:398

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:6148Issues:58Issues:1106

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:3805Issues:31Issues:618

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:3391Issues:29Issues:154

stt

Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式

Language:PythonLicense:GPL-3.0Stargazers:2196Issues:11Issues:79

splat

WebGL 3D Gaussian Splat Viewer

Language:JavaScriptLicense:MITStargazers:1885Issues:27Issues:48

OpenPano

Automatic Panorama Stitching From Scratch

Language:C++License:MITStargazers:1869Issues:98Issues:130

DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Language:PythonLicense:MITStargazers:295Issues:6Issues:16

DocDiff

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Language:PythonLicense:MITStargazers:223Issues:4Issues:34

LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Language:PythonLicense:Apache-2.0Stargazers:220Issues:12Issues:23

video-mamba-suite

The suite of modeling video with Mamba

Language:PythonLicense:MITStargazers:218Issues:3Issues:18

wild-gaussian-splatting

DUSt3R + Gaussian Splatting

Language:Jupyter NotebookStargazers:185Issues:3Issues:4

UDIS2

ICCV2023 - Parallax-Tolerant Unsupervised Deep Image Stitching (UDIS++)

Language:PythonLicense:Apache-2.0Stargazers:160Issues:5Issues:33

TrOCR-Seal-Recognition

基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用

nd-Mamba2-torch

Only implemented through torch: "bi - mamba2" , "vision- mamba2 -torch". support 1d/2d/3d/nd and support export by jit.script/onnx;

MooER

MooER: Open-sourced LLM for audio understanding trained on 80,000 hours of data

Language:PythonLicense:NOASSERTIONStargazers:118Issues:4Issues:11

VSSD

Introduce Mamba2 to Vision.

DepictQA

DepictQA: Depicted Image Quality Assessment with Vision Language Models

Language:PythonLicense:Apache-2.0Stargazers:68Issues:0Issues:14

ImageAnalysisService

轻量模型的图像分析web服务,包括倾斜矫正OCR,公章(印章)检测+识别,车牌识别。api方案使用FastAPI+Gunicorn,提供gradio展示。

SC-4DGS

The offical code repository for SC-4DGS.

YOLO-MIF

YOLO-MIF is an improved version of YOLOv8 for object detection in gray-scale images, incorporating multi-information fusion to enhance detection accuracy. The detection of RGBT mode is also added. YOLO-MIF是在灰度图像中进行目标检测的改进型YOLOv8模型,引入了多信息融合策略,提高了检测准确性。 并添加了RGBT模式的检测。

Language:PythonLicense:MITStargazers:35Issues:2Issues:3
Language:PythonLicense:MITStargazers:26Issues:0Issues:0

baseboostdepth

[BMVC'24] BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation

Language:PythonLicense:NOASSERTIONStargazers:24Issues:0Issues:0

msvm-unet

The official codes for the work "MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation".

SlowFast-Meet-ViT

We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

seal_project

印章检测和印章文字识别

Language:PythonStargazers:3Issues:0Issues:0

TrOCR-Seal-Recognition

基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用

Language:PythonStargazers:1Issues:0Issues:0