Roman Solovyev's repositories
Weighted-Boxes-Fusion
Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.
MVSEP-MDX23-music-separation-model
Model for MDX23 music separation contest
Music-Source-Separation-Training
Repository for training models for music source separation.
Verilog-Generator-of-Neural-Net-Digit-Detector-for-FPGA
Verilog Generator of Neural Net Digit Detector for FPGA
Keras-RetinaNet-for-Open-Images-Challenge-2018
Code for 15th place in Kaggle Google AI Open Images - Object Detection Track
volumentations
Library for 3D augmentations
classification_models_3D
Set of models for classifcation of 3D volumes
MobileNet-in-FPGA
Generator of verilog description for FPGA MobileNet implementation
Mean-Average-Precision-for-Boxes
Function to calculate mAP for set of detected boxes and annotated boxes.
segmentation_models_3D
Set of models for segmentation of 3D volumes
MVSEP-CDX23-Cinematic-Sound-Demixing
Model for CDX23 (Cinematic Sound Demixing) contest
segmentation_models_pytorch_3d
Segmentation models for 3D data with different backbones. PyTorch.
classification_models_1D
Classification models 1D Zoo - Keras and TF.Keras
VOTS2023-Challenge-Tracker
Code for VOTS2023 Challenge tracker
DrivenData-Open-AI-Caribbean-Challenge-2nd-place-solution
Code for DrivenData Open AI Caribbean Challenge. 2nd place solution.
albumentations
fast image augmentation library and easy to use wrapper around other libraries
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Post-Training-Integer-Quantization
Some examples of quantization process
DTTNet-Pytorch
Dual-Path TFC-TDF UNet for Music Source Separation
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Zero_Shot_Audio_Source_Separation
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
ai-audio-startups
Community list of startups working with AI in audio and music technology
DATASET_RISCV
Датасет реализаций процессорной архитектуры RISC-V.
OpenLane
OpenLane+DREAMPlace (PL_DREAMPLACE_GLB_PLACEMENT in config.json) option for global placement. OpenLane is an automated RTL to GDSII flow based on several components including OpenROAD, Yosys, Magic, Netgen and custom methodology scripts for design exploration and optimization.
pytorch-spice-cnn
CNN for prediction IR-drop of integrated circuits.
sdx-submissions
Sound Demixing Challenge Submission Repo