wjchen's starred repositories
C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
tensorRT_Pro
C++ library based on tensorrt integration
ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
ai-audio-startups
Community list of startups working with AI in audio and music technology
Shift-AI-models-to-real-world-products
Share some useful guides and references about how to shift AI models to real world products or projects.
leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
Awesome-Denoise
One-paper-one-short-contribution-summary of all latest image/burst/video Denoising papers with code & citation published in top conference and journal.
bisenetv2-tensorflow
Unofficial tensorflow implementation of real-time scene image segmentation model "BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation"
TensorFlow_Engineering_Implementation
The source code and dataset about <Deep Learning - Best Practices on TensorFlow Engineering Implementation>
CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
SimpNet-Deep-Learning-in-a-Shader
A trainable convolutional neural network inside a fragment shader
Awesome-Bandwidth-Extension
This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
fsrcnn-mpv
FSRCNN, implemented as an mpv hook
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications