KevenLee's repositories
benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
ChatSim
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn推理 , dbnet(1.7M) + crnn(6.3M) + anglenet(1.5M) 总模型仅10M
FudanOCR
A toolbox of scene text super-resolution and recognition
image-comparer
image comparer - powered by Electron
kohya-trainer
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
model-compression
model compression based on pytorch (1、quantization: 16/8/4/2 bits(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary value(twn/bnn/xnor-net);2、 pruning: normal、regular and group convolutional channel pruning;3、 group convolution structure;4、batch-normalization folding for quantization)
munkres-cpp
Kuhn-Munkres (Hungarian) Algorithm in C++
MVPbev
[ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability"
ngx_healthcheck_module
nginx module for upstream servers health check. support stream and http upstream. 该模块可以为Nginx提供主动式后端服务器健康检查的功能(同时支持四层和七层后端服务器的健康检测)
OpenLane-V2
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
panacea
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
PerlDiff
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
QualityScaler
Image/video deeplearning upscaler app for Windows - BRSGAN & RealSR_JPEG
TopoMLP
[ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
trocr-chinese
transformers ocr for chinese
XVFI
Official repository of XVFI
yolo-tensorrt
darknet -> tensorrt. YoloV4 YoloV3 use raw darknet *.weights and *.cfg fils. If the wrapper is useful to you,please Star it.
YOLOv5-Multibackbone-Compression
YOLOv5 Series Multi-backbone(TPH-YOLOv5, Ghostnet, ShuffleNetv2, Mobilenetv3Small, EfficientNetLite, PP-LCNet, SwinTransformer YOLO), Module(CBAM, DCN), Pruning (EagleEye, Network Slimming) and Quantization (MQBench) Compression Tool Box.
yolov5-tensorrt
A tensorrt implementation of yolov5: https://github.com/ultralytics/yolov5
Yolox_augment
Add some features to yolox