Steve Tang/Yuwu Tang (SteveTanggithub)

SteveTanggithub

Geek Repo

Company:Hangzhou Huacheng Network Technology

Location:Hangzhou,China

Twitter:@SteveTa57657898

Github PK Tool:Github PK Tool

Steve Tang/Yuwu Tang's starred repositories

Stargazers:2Issues:0Issues:0

LaMI-DETR

[ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"

Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:1543Issues:0Issues:0

NormKD

The official implementation of NormKD: Normalized Logits for Knowledge Distillation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8Issues:0Issues:0

audioFlux

A library for audio and music analysis, feature extraction.

Language:CLicense:MITStargazers:2786Issues:0Issues:0

PaSST

Efficient Training of Audio Transformers with Patchout

Language:PythonLicense:Apache-2.0Stargazers:299Issues:0Issues:0
Language:PythonLicense:BSD-3-Clause-ClearStargazers:46Issues:0Issues:0

cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Language:PythonLicense:BSD-2-ClauseStargazers:227Issues:0Issues:0

PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Language:PythonLicense:GPL-3.0Stargazers:30Issues:0Issues:0

SAT

Streaming Audiotransformers for online Audio tagging

Language:PythonLicense:GPL-3.0Stargazers:41Issues:0Issues:0

convit

Code for the Convolutional Vision Transformer (ConViT)

Language:PythonLicense:Apache-2.0Stargazers:461Issues:0Issues:0

NATTEN

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

Language:CudaLicense:NOASSERTIONStargazers:351Issues:0Issues:0

AudioTaggingDoneRight

experiments about AudioSet

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:43Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

Language:PythonLicense:GPL-3.0Stargazers:23Issues:0Issues:0

EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Language:PythonLicense:MITStargazers:224Issues:0Issues:0
Language:C++Stargazers:17Issues:0Issues:0

resampler

A Simple and Efficient Audio Resampler Implementation in C

Language:CLicense:MITStargazers:139Issues:0Issues:0

zita-resampler

Libzita-resampler is a C++ library for resampling audio signals. It is designed to be used within a real-time processing context, to be fast, and to provide high-quality sample rate conversion.

Language:C++License:GPL-3.0Stargazers:22Issues:0Issues:0

libfar

C/C++ fast audio resampling library

Language:CLicense:MITStargazers:40Issues:0Issues:0

r8brain-free-src

High-quality pro audio resampler / sample rate conversion C++ library. Very fast, for both audio resampling and time-series interpolation.

Language:C++License:MITStargazers:572Issues:0Issues:0

LibrosaCpp

LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc

Language:C++License:Apache-2.0Stargazers:185Issues:0Issues:0

Spoken_language_identification

A TensorFlow-based spoken language identification

Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

TCN

Sequence modeling benchmarks and temporal convolutional networks

Language:PythonLicense:MITStargazers:4154Issues:0Issues:0

melspectrogram_c

melspectrogram函数的c++实现

Language:C++License:GPL-3.0Stargazers:4Issues:0Issues:0

librosapp

A C++ implementation of stft, melspectrogram and mel_to_stft

Language:C++License:UnlicenseStargazers:8Issues:0Issues:0

MFCC

mfcc, mel, pcen. (librosa)

Language:C++Stargazers:35Issues:0Issues:0

ODConv

The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).

Language:PythonLicense:Apache-2.0Stargazers:287Issues:0Issues:0

RaDur

The source code of RaDur

Language:PythonStargazers:3Issues:0Issues:0
Language:PythonLicense:MITStargazers:24Issues:0Issues:0