fanOfJava's repositories
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
rawnet2-antispoofing
This repository includes the code to reproduce our paper "End-to-end anti-spoofing with RawNet2" (https://arxiv.org/abs/2011.01108) published in ICASSP '21.
aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
RawGAT-ST-antispoofing
This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.org/abs/2107.12710) published in the ASVspoof 2021 workshop.
AIR-ASVspoof
Official implementation of the paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
PaddleClas
A treasure chest for visual recognition powered by PaddlePaddle
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
asv-subtools
An Open Source Tools for Speaker Recognition
Audio-Classification
Pytorch code for "Rethinking CNN Models for Audio Classification"
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
plda
Probabilistic Linear Discriminant Analysis & classification, written in Python.
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
SSDG-CVPR2020
Single-Side Domain Generalization for Face Anti-Spoofing, CVPR2020
NeuralPlda
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
faiss
A library for efficient similarity search and clustering of dense vectors.
leetcode
Python & JAVA Solutions for Leetcode
mmdetection
Open MMLab Detection Toolbox and Benchmark
detectron2
Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
torch-stft
An STFT/iSTFT for PyTorch.
netron
Visualizer for neural network, deep learning and machine learning models
PAN.pytorch
A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network