Alexander Lekomtsev (JI411)

JI411

Geek Repo

Company:Новосибирский Государственный Университет

Location:Новосибирск

Home Page:t.me/JI_411

Github PK Tool:Github PK Tool

Alexander Lekomtsev's repositories

fuzzy-doc-search

OCR scanned pdf and fuzzy string search in xlsx/pdf

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

copy-paste-aug

Copy-paste augmentation for segmentation and detection tasks

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

gbr-yolov5-metric

add f2-score to yolov5 model

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

R-CenterNet

detector for rotated-object based on CenterNet/基于CenterNet的旋转目标检测

Language:PythonStargazers:0Issues:0Issues:0

reef-solution

reef-solution for upload

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

retinanet-examples

Fast and accurate object detection with end-to-end GPU optimization

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

rotation-yolov5

rotation detection based on yolov5

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0

sahi

A lightweight vision library for performing large scale object detection/ instance segmentation.

License:MITStargazers:0Issues:0Issues:0

speech_analytics

Speech analytics package for call-center

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

todo.md

TODO.md file format - todomd.org

Stargazers:0Issues:0Issues:0

yolact

A simple, fully convolutional model for real-time instance segmentation.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

yolo-tiling

Tile (Slice) YOLO Dataset for Small Objects Detection

Stargazers:0Issues:0Issues:0

yolor

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

yolov7-face

yolov7 face detection with landmark

License:GPL-3.0Stargazers:0Issues:0Issues:0