Beast code in Giters

Jeong-Sik Lee's starred repositories

weight-selection

Language:Python16600

Faster-Diffusion

[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"

Language:PythonApache-2.028500

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookApache-2.0224100

fastapi-best-practices

FastAPI Best Practices and Conventions we used at our startup

874500

Misc-Cheatsheet

대학원 생활을 하며 사용하는 작고 소중한 코딩팁 (linux 명령어 등)

Language:Vim script38000

DenseDiffusion

Official Pytorch Implementation of DenseDiffusion (ICCV 2023)

Language:Jupyter NotebookApache-2.047500

machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

909400

MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

292000

Ready-For-Tech-Interview

💻 신입 개발자로서 지식을 쌓기 위해 공부하는 공간 👨‍💻

MIT453500

tech-interview-for-developer

👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖

Language:JavaMIT1444600

coding-interview

취업 준비를 위해 공부한 내용을 정리하는 레포

Language:JavaScript37500

ai-tech-interview

👩‍💻👨‍💻 AI 엔지니어 기술 면접 스터디 (⭐️ 1k+)

MIT181400

Monodepth

PyTorch implementation of Unsupervised Monocular Depth Estimation with Left-Right Consistency

Language:Python2400

Min-SNR-Diffusion-Training

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Language:Python21600

RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Language:PythonMIT116000

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonMIT114800

StereoAlgorithms

Stereo Algorithms (Include:CREStereo,RAFT-Stereo,Hitnet,FastACVNet_plus,Stereo Transformers,RealtimeStereo,DistDepth) with TensorRT,ORT,OpenVINO

Language:C++MIT17700

ONNX-FastACVNet-Depth-Estimation

Python scripts performing stereo depth estimation using the Fast-ACVNet model in ONNX.

Language:PythonMIT3900

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.0152500

cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Language:PythonBSD-2-Clause22300

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Language:PythonMIT757700

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonApache-2.0274600

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5226400

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonMIT209800

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonNOASSERTION224700

TriAAN-VC

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Language:PythonMIT14300

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Language:PythonNOASSERTION86800

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1192800

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6814000

InjectFusion_official

Language:Python11500