김진원's repositories
laplacian-pyramid-blend
Implementation of image blending alogrithm
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
axolotl
Go ahead and axolotl questions
CelebV-HQ
[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
community-events
Place where folks can contribute to 🤗 community events
computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
GFPGAN-1024
GFPGAN 1024
jinwonkim93
My personal repository
ml-engineering
Machine Learning Engineering Open Book
pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
translatotron-v
Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)
unitable
UniTable: Towards a Unified Table Foundation Model
visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models