88aggressive

0

followers

following

stars

@Xinjiang University

liu shuanghong's repositories

DSCNet

Pytorch Implement of Dynamic Snake Convolution (ICCV2023)

000

UniRepLKNet

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Apache-2.0000

Agent-Attention

Official repository of Agent Attention

000

objectdetection_script

一些关于目标检测的脚本的改进思路代码，详细请看readme.md

000

EMA-attention-module

Implementation Code for the ICCASSP 2023 paper " Efficient Multi-Scale Attention Module with Cross-Spatial Learning" and is available at: https://arxiv.org/abs/2305.13563v2

000

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

MIT000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

MIT100

pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

MIT000

lightning

Deep learning framework to train, deploy, and ship AI products Lightning fast.

Apache-2.0000

3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

Apache-2.0000

ACA-Net

Pytorch Implementation of ACA-Net for Speaker Verification

MIT000

ODConv

The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).

Apache-2.0000

DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Apache-2.0000

DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch

000

NeMo

NeMo: a toolkit for conversational AI

Apache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

PEL4VAD

Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection"

MIT000

cluster-analysis

K-Means++(HCM), Fuzzy C-Means(FCM), Hierarchical Clustering, DBscan

000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

NOASSERTION000

CBAM.PyTorch

Non-official implement of Paper：CBAM: Convolutional Block Attention Module

000

wespeaker

Research and Production Oriented Speaker Recognition Toolkit

Apache-2.0000

s3prl

Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)

Apache-2.0000

SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Apache-2.0000

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

MIT000

AV-Sepformer

000

EfficientConformer

[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Apache-2.0000

EEND_PyTorch

A PyTorch implementation of End-to-End Neural Diarization

MIT000

RepRFN

Reparameterized Residual Feature Network For Lightweight Image Super-Resolution

MIT000

EEND-1

000