Sanghwa Ham's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130578Issues:1118Issues:15479

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32712Issues:276Issues:1086

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:22137Issues:165Issues:1557

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language:PythonLicense:MITStargazers:9022Issues:90Issues:199

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:7861Issues:52Issues:2352

lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7694Issues:84Issues:250

auto-sklearn

Automated Machine Learning with scikit-learn

Language:PythonLicense:BSD-3-ClauseStargazers:7518Issues:214Issues:1018

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6370Issues:44Issues:82

anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Language:PythonLicense:Apache-2.0Stargazers:3559Issues:38Issues:859
Language:PythonLicense:NOASSERTIONStargazers:3193Issues:159Issues:112

onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX

Language:C++License:Apache-2.0Stargazers:2879Issues:68Issues:660

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1706Issues:11Issues:135

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Language:PythonLicense:NOASSERTIONStargazers:1679Issues:18Issues:97

open-unmix-pytorch

Open-Unmix - Music Source Separation for PyTorch

Language:PythonLicense:MITStargazers:1218Issues:33Issues:78

flite

A small fast portable speech synthesis system

Language:CLicense:NOASSERTIONStargazers:850Issues:34Issues:71

how-do-vits-work

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Language:PythonLicense:Apache-2.0Stargazers:802Issues:7Issues:42

Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Language:PythonLicense:MITStargazers:428Issues:31Issues:22

DnCNN-PyTorch

PyTorch implementation of the TIP2017 paper "Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising"

Language:PythonLicense:GPL-3.0Stargazers:392Issues:6Issues:12

DL_Compiler

Study Group of Deep Learning Compiler

w2v2-speaker

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053

Language:PythonLicense:MITStargazers:142Issues:4Issues:14

EDCNN

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

Language:PythonLicense:Apache-2.0Stargazers:141Issues:2Issues:5

DBSN

Unpaired Learning of Deep Image Denoising

SpeechSynthesis

음성합성 관련 자료 모음

MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Language:Jupyter NotebookLicense:MITStargazers:125Issues:4Issues:5

AP-BSN

Official PyTorch implementation of "AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network" in CVPR 2022.

Language:PythonLicense:MITStargazers:99Issues:2Issues:19

chafon-rfid

Read RFID data from Chafon UHF readers

Language:PythonLicense:MITStargazers:68Issues:10Issues:12

video_autoencoder

Video lstm auto encoder built with pytorch. https://arxiv.org/pdf/1502.04681.pdf

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:39Issues:3Issues:4

UIDNet

End-to-End Unpaired Image Denoising with Conditional Adversarial Networks (AAAI-20)

mimic-my-voice

[WIP] Create a Text to Speech Engine using Your Own Voice with Mycroft's Mimic Recording Studio & Coqui Text to Speech.

Language:ShellLicense:MITStargazers:21Issues:3Issues:0
Language:JavaScriptLicense:Apache-2.0Stargazers:12Issues:0Issues:0