Hyoung-Kyu Song (deepkyu)

deepkyu

Geek Repo

Company:Captions

Location:New York

Home Page:linktr.ee/deepkyu

Twitter:@deepkyu_song

Github PK Tool:Github PK Tool


Organizations
Hugging-Face-Helping-Hand

Hyoung-Kyu Song's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26575Issues:220Issues:250

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

500 AI Machine learning Deep learning Computer vision NLP Projects with code

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6945Issues:65Issues:21

VAR

[NeurIPS 2024 Oral][GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4055Issues:115Issues:81

Video-LLaVA

怐EMNLP 2024šŸ”„怑Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2892Issues:28Issues:179

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2375Issues:23Issues:231

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Language:PythonLicense:AGPL-3.0Stargazers:1687Issues:23Issues:281

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1302Issues:12Issues:103

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1266Issues:62Issues:224

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1155Issues:34Issues:496

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:882Issues:19Issues:68

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:716Issues:12Issues:41

LanguageBind

怐ICLR 2024šŸ”„怑 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:698Issues:15Issues:59

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonLicense:MITStargazers:693Issues:15Issues:82

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonLicense:Apache-2.0Stargazers:599Issues:26Issues:18

PLLaVA

Official repository for the paper PLLaVA

LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

DiffusionVideoEditing

Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"

Language:PythonLicense:MITStargazers:227Issues:14Issues:11

smirk

Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)

Language:PythonLicense:MITStargazers:164Issues:9Issues:28

MyVLM

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

Language:PythonLicense:NOASSERTIONStargazers:144Issues:14Issues:5
Language:PythonLicense:Apache-2.0Stargazers:101Issues:3Issues:2
Language:PythonLicense:NOASSERTIONStargazers:85Issues:8Issues:0

model-stock

Model Stock: All we need is just a few fine-tuned models

Language:Jupyter NotebookStargazers:80Issues:12Issues:0

netspresso-trainer

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Language:PythonLicense:Apache-2.0Stargazers:61Issues:5Issues:190

shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

PyNetsPresso

The official NetsPresso Python package.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:40Issues:3Issues:174

Typescript-ReactJS-WebRTC-1-1-P2P

1:1 P2P WebRTC with ReactJS, Typescript, Node.js

Language:TypeScriptLicense:MITStargazers:21Issues:2Issues:1

cap2qa

Official implementation of "Visually Dehallucinative Instruction Generation" (ICASSP 2024)

License:BSD-3-ClauseStargazers:5Issues:4Issues:0