wgansir's starred repositories
AI-Job-Notes
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
SenseVoice
Multilingual Voice Understanding Model
speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
CNNDetection
Code for the paper: CNN-generated images are surprisingly easy to spot... for now https://peterwang512.github.io/CNNDetection/
AIGCDetectBaseline
AIGCDetectBaseline
detectree2
Python package for automatic tree crown delineation based on the Detectron2 implementation of Mask R-CNN
supervoice-separate
Supervoice Speaker Separation Network
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Bert-VITS2
vits2 backbone with multilingual-bert
Variations-of-SFANet-for-Crowd-Counting
The official implementation of "Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting"
Rethinking-Counting
[CVPR 2022] Rethinking Spatial Invariance of Convolutional Networks for Object Counting
neural-style-pytorch
Neural Style implementation in PyTorch! :art:
neural-style-pytorch
A fast PyTorch implementation of "A Neural Algorithm of Artistic Style"
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
stable-diffusion-webui
Stable Diffusion web UI