Austin Xiao (swagshaw)

swagshaw

Geek Repo

Company:Nanyang Technological University

Location:Singapore

Home Page:https://swagshaw.github.io/

Github PK Tool:Github PK Tool

Austin Xiao's starred repositories

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonLicense:MITStargazers:11036Issues:104Issues:79

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7205Issues:49Issues:58

awesome-chatgpt

🤖 Awesome list for ChatGPT — an artificial intelligence chatbot developed by OpenAI

License:CC0-1.0Stargazers:4753Issues:54Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

License:CC0-1.0Stargazers:1065Issues:21Issues:0

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1062Issues:14Issues:32

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language:CudaLicense:AGPL-3.0Stargazers:458Issues:11Issues:51

voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Language:PythonLicense:Apache-2.0Stargazers:371Issues:25Issues:25

ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:255Issues:14Issues:11

doatools.py

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Language:PythonLicense:MITStargazers:150Issues:6Issues:1

allie

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

Language:PythonLicense:Apache-2.0Stargazers:139Issues:5Issues:38

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonLicense:MITStargazers:136Issues:7Issues:7

download_audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Language:PythonLicense:NOASSERTIONStargazers:96Issues:2Issues:8

Multi-Source-Sound-Localization

This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.

doa-release

A Direction-of-Arrival estimation code repo accompanying our research paper.

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Language:PythonLicense:BSD-3-ClauseStargazers:57Issues:3Issues:5

seld-dcase2020

Baseline method for sound event localization task of DCASE 2020 challenge

Language:PythonLicense:NOASSERTIONStargazers:51Issues:11Issues:11

pulse

Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)

Language:PythonLicense:MITStargazers:38Issues:2Issues:3

seld-dcase2023

Baseline method for sound event localization task of DCASE 2023 challenge

audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Language:PythonLicense:NOASSERTIONStargazers:34Issues:2Issues:4

hungarian-net

Deep-learning-based implementation of the popular Hungarian algorithm that helps solve the assignment problem.

Language:PythonLicense:NOASSERTIONStargazers:23Issues:2Issues:2

NeSsi

Keras/Pytorch neural network size, operations and parameters counter

DataCI

A platform for tracking data-centric AI pipelines in dynamic streaming data

Language:PythonLicense:MITStargazers:8Issues:2Issues:6

awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

License:CC0-1.0Stargazers:4Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0