Jeong-Sik Lee's starred repositories

Language:PythonStargazers:166Issues:0Issues:0

Faster-Diffusion

[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"

Language:PythonLicense:Apache-2.0Stargazers:285Issues:0Issues:0

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2241Issues:0Issues:0

fastapi-best-practices

FastAPI Best Practices and Conventions we used at our startup

Stargazers:8745Issues:0Issues:0

Misc-Cheatsheet

대학원 생활을 하며 사용하는 작고 소중한 코딩팁 (linux 명령어 등)

Language:Vim scriptStargazers:380Issues:0Issues:0

DenseDiffusion

Official Pytorch Implementation of DenseDiffusion (ICCV 2023)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:475Issues:0Issues:0

machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

Stargazers:9094Issues:0Issues:0

MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

Stargazers:2920Issues:0Issues:0

Ready-For-Tech-Interview

💻 신입 개발자로서 지식을 쌓기 위해 공부하는 공간 👨‍💻

License:MITStargazers:4535Issues:0Issues:0

tech-interview-for-developer

👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖

Language:JavaLicense:MITStargazers:14446Issues:0Issues:0

coding-interview

취업 준비를 위해 공부한 내용을 정리하는 레포

Language:JavaScriptStargazers:375Issues:0Issues:0

ai-tech-interview

👩‍💻👨‍💻 AI 엔지니어 기술 면접 스터디 (⭐️ 1k+)

License:MITStargazers:1814Issues:0Issues:0

Monodepth

PyTorch implementation of Unsupervised Monocular Depth Estimation with Left-Right Consistency

Language:PythonStargazers:24Issues:0Issues:0

Min-SNR-Diffusion-Training

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Language:PythonStargazers:216Issues:0Issues:0

RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Language:PythonLicense:MITStargazers:1160Issues:0Issues:0

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1148Issues:0Issues:0

StereoAlgorithms

Stereo Algorithms (Include:CREStereo,RAFT-Stereo,Hitnet,FastACVNet_plus,Stereo Transformers,RealtimeStereo,DistDepth) with TensorRT,ORT,OpenVINO

Language:C++License:MITStargazers:177Issues:0Issues:0

ONNX-FastACVNet-Depth-Estimation

Python scripts performing stereo depth estimation using the Fast-ACVNet model in ONNX.

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1525Issues:0Issues:0

cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Language:PythonLicense:BSD-2-ClauseStargazers:223Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonLicense:MITStargazers:7577Issues:0Issues:0

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:2746Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52264Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonLicense:MITStargazers:2098Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2247Issues:0Issues:0

TriAAN-VC

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Language:PythonLicense:MITStargazers:143Issues:0Issues:0

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Language:PythonLicense:NOASSERTIONStargazers:868Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11928Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:68140Issues:0Issues:0
Language:PythonStargazers:115Issues:0Issues:0