Beast code in Giters

Okan Köpüklü's starred repositories

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonMIT13806 284 2067

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT5573 32 130

ffcv

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Language:PythonApache-2.02759 20 269

lazypredict

Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

Language:PythonMIT2731 29 115

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonApache-2.02617 71 79

bolt

10x faster matrix and vector operations

Language:C++MPL-2.02467 47 34

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.02115 45 384

DIG

A library for graph deep learning research

Language:PythonGPL-3.01792 31 203

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT1340 44 211

Deep-Learning-In-Production

Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.

Language:Jupyter Notebook1083 32 5

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT890 11 104

4D-Facial-Avatars

Dynamic Neural Radiance Fields for Monocular 4D Facial Avater Reconstruction

Language:Python666 28 61

MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Language:PythonMIT387 14 18

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:Python364 17 63

RawNet

Official repository for RawNet, RawNet2, and RawNet3

Language:PythonMIT335 14 32

Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Language:Python334 17 27

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookMIT299 8 21

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

Language:PythonApache-2.0285 5 4

okankop

Okan Köpüklü's starred repositories

nni

ConvNeXt

ffcv

lazypredict

Resemblyzer

bolt

s3prl

DIG

pyroomacoustics

Deep-Learning-In-Production

torch-audiomentations

4D-Facial-Avatars

MaskGIT-pytorch

VGG-Speaker-Recognition

RawNet

Text-to-sound-Synthesis

sudo_rm_rf

pyaec

dytox

IRM-based-Speech-Enhancement-using-LSTM

DARCN

DNN-based-Speech-Enhancement-in-the-frequency-domain

octuplet-loss

GWA

synthehicle

GaitGraph2

Object-Detection-Confidence-Bias

x-face-verification

driver-gaze-yolov5

german-corpus-aligned