Okan Köpüklü's starred repositories
TrackTacular
Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"
acoustic-simulator
Implementation of audio degradation processes
Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
IoU-AwareCalibration
Code to reproduce the experiments described in "Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration" (https://arxiv.org/pdf/2309.03110.pdf)
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
tum-traffic-dataset-dev-kit
TUM Traffic Dataset Development Kit
livefaceidapp
Simple Live Face Recognition Streamlit App
Audio-Super-Resolution-ViT
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
Context-Cluster
[ICLR 2023 Oral] Image as Set of Points