liu shuanghong's repositories
DSCNet
Pytorch Implement of Dynamic Snake Convolution (ICCV2023)
UniRepLKNet
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Agent-Attention
Official repository of Agent Attention
objectdetection_script
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
EMA-attention-module
Implementation Code for the ICCASSP 2023 paper " Efficient Multi-Scale Attention Module with Cross-Spatial Learning" and is available at: https://arxiv.org/abs/2305.13563v2
wandb
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
lightning
Deep learning framework to train, deploy, and ship AI products Lightning fast.
3D-Speaker
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
ACA-Net
Pytorch Implementation of ACA-Net for Speaker Verification
ODConv
The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).
DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
NeMo
NeMo: a toolkit for conversational AI
speechbrain
A PyTorch-based Speech Toolkit
PEL4VAD
Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection"
cluster-analysis
K-Means++(HCM), Fuzzy C-Means(FCM), Hierarchical Clustering, DBscan
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
CBAM.PyTorch
Non-official implement of Paper:CBAM: Convolutional Block Attention Module
wespeaker
Research and Production Oriented Speaker Recognition Toolkit
s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
RepRFN
Reparameterized Residual Feature Network For Lightweight Image Super-Resolution