Alexander Lekomtsev's repositories
fuzzy-doc-search
OCR scanned pdf and fuzzy string search in xlsx/pdf
copy-paste-aug
Copy-paste augmentation for segmentation and detection tasks
gbr-yolov5-metric
add f2-score to yolov5 model
R-CenterNet
detector for rotated-object based on CenterNet/基于CenterNet的旋转目标检测
reef-solution
reef-solution for upload
retinanet-examples
Fast and accurate object detection with end-to-end GPU optimization
rotation-yolov5
rotation detection based on yolov5
sahi
A lightweight vision library for performing large scale object detection/ instance segmentation.
speech_analytics
Speech analytics package for call-center
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
todo.md
TODO.md file format - todomd.org
yolact
A simple, fully convolutional model for real-time instance segmentation.
yolo-tiling
Tile (Slice) YOLO Dataset for Small Objects Detection
yolor
implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
yolov7-face
yolov7 face detection with landmark