Okan Köpüklü (okankop)

okankop

Geek Repo

Company:Technical University of Munich

Home Page:okankop.github.io

Github PK Tool:Github PK Tool

Okan Köpüklü's starred repositories

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonLicense:MITStargazers:13806Issues:284Issues:2067

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5573Issues:32Issues:130

ffcv

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Language:PythonLicense:Apache-2.0Stargazers:2759Issues:20Issues:269

lazypredict

Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

Language:PythonLicense:MITStargazers:2731Issues:29Issues:115

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:2617Issues:71Issues:79

bolt

10x faster matrix and vector operations

Language:C++License:MPL-2.0Stargazers:2467Issues:47Issues:34

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2115Issues:45Issues:384

DIG

A library for graph deep learning research

Language:PythonLicense:GPL-3.0Stargazers:1792Issues:31Issues:203

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:1340Issues:44Issues:211

Deep-Learning-In-Production

Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.

Language:Jupyter NotebookStargazers:1083Issues:32Issues:5

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:890Issues:11Issues:104

4D-Facial-Avatars

Dynamic Neural Radiance Fields for Monocular 4D Facial Avater Reconstruction

MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Language:PythonLicense:MITStargazers:387Issues:14Issues:18

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

RawNet

Official repository for RawNet, RawNet2, and RawNet3

Language:PythonLicense:MITStargazers:335Issues:14Issues:32

Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookLicense:MITStargazers:299Issues:8Issues:21

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

Language:PythonLicense:Apache-2.0Stargazers:285Issues:5Issues:4

dytox

Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022

Language:PythonLicense:Apache-2.0Stargazers:134Issues:3Issues:25

IRM-based-Speech-Enhancement-using-LSTM

Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM

Language:PythonLicense:MITStargazers:112Issues:3Issues:5

DARCN

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

DNN-based-Speech-Enhancement-in-the-frequency-domain

DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping method.

Language:PythonLicense:MITStargazers:48Issues:2Issues:10

octuplet-loss

Repo for our Paper: Octuplet Loss: Make Your Face Recognition Model Robust to Image Resolution

Language:PythonLicense:MITStargazers:43Issues:1Issues:8

GWA

Geometric-Wave Acoustic dataset

Language:PythonLicense:CC-BY-4.0Stargazers:42Issues:2Issues:2

synthehicle

[WACVW 2023] A massive synthetic dataset for 3D multi-target multi-camera tracking and segmentation.

GaitGraph2

Official code for "Towards a Deeper Understanding of Skeleton-based Gait Recognition" (CVPRW'22)

Object-Detection-Confidence-Bias

Code for "The Box Size Confidence Bias Harms Your Object Detector" (https://arxiv.org/abs/2112.01901)

Language:PythonLicense:MIT-0Stargazers:27Issues:0Issues:0

x-face-verification

Repo for our Paper: Explainable Model-Agnostic Similarity and Confidence in Face Verification

driver-gaze-yolov5

This is the repo for the work "Where and What: Driver Attention-based Object Detection".

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

german-corpus-aligned

Alignments from CTC segmentation on Librispeech and Spoken Wikipedia Corpus

Language:PythonStargazers:7Issues:3Issues:0