Hyoung-Kyu Song's starred repositories
500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
500 AI Machine learning Deep learning Computer vision NLP Projects with code
VAR
[NeurIPS 2024 Oral][GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Video-LLaVA
ćEMNLP 2024š„ćVideo-LLaVA: Learning United Visual Representation by Alignment Before Projection
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
LanguageBind
ćICLR 2024š„ć Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
DiffusionVideoEditing
Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"
model-stock
Model Stock: All we need is just a few fine-tuned models
netspresso-trainer
A library for training, compressing and deploying computer vision models (including ViT) with edge devices
shortened-llm
Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]
PyNetsPresso
The official NetsPresso Python package.
Typescript-ReactJS-WebRTC-1-1-P2P
1:1 P2P WebRTC with ReactJS, Typescript, Node.js