Dongkeon Park (DongKeon)

DongKeon

Geek Repo

Company:GIST (Gwangju Institute of Science and Technology)

Location:Gwangju, Republic of Korea

Home Page:https://velog.io/@dongkeon

Github PK Tool:Github PK Tool

Dongkeon Park's repositories

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

LoCoNet-ASD

LoCoNet: Long-Short Context Network for Active Speaker Detection (2023 CVPR)

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

EENDasP

Implementation of "End-to-End Speaker Diarization as Post-Processing"

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Awesome-DeepLearning-Study

Summary of DeepLearning (Korean and English are included)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

2021_5th_MWP_Generator

Problem Generator for Math Word Prediction

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

babelspeech

바벨스피치 (캐글뽀개기X바벨피쉬 콜라보 스터디 자료보관용)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:SCSSLicense:NOASSERTIONStargazers:0Issues:1Issues:0

EEND-vector-clustering

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

FISVDD

Fast Incremental Support Vector Data Description implemented in Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:2Issues:0

GC_track3_DB_GIST

3rd Grand Challenge track 3 DB developed by GIST

Stargazers:0Issues:1Issues:0

GIST_ASD_DETECTION

Deep learning based autism spectral disorder detection from children voice

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SPELL

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

rasta_py

RASTA-PLP and MFCC tool based rasta-mat

Language:PythonStargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Stargazers:0Issues:0Issues:0

theorydb.github.io

theorydb's blog

Language:HTMLLicense:NOASSERTIONStargazers:0Issues:1Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TS-TalkNet

INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues

Language:PythonStargazers:0Issues:0Issues:0

YOLOX_AUDIO

Audio event detection model based on YOLOX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0