Daniel Y.T. Kim's repositories

photoswap

Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

License:MITStargazers:0Issues:0Issues:0

PVA-CelebAHQ-IDI

Parallel Visual Attention (WACV 2024) and CelebAHQ Identity-Preserving Inpainting dataset repository.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

HeadStudio

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.

License:MITStargazers:0Issues:0Issues:0

langchain-kr

LangChain ๊ณต์‹ Document, Cookbook, ๊ทธ ๋ฐ–์˜ ์‹ค์šฉ ์˜ˆ์ œ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์ž‘์„ฑํ•œ ํ•œ๊ตญ์–ด ํŠœํ† ๋ฆฌ์–ผ์ž…๋‹ˆ๋‹ค. ๋ณธ ํŠœํ† ๋ฆฌ์–ผ์„ ํ†ตํ•ด LangChain์„ ๋” ์‰ฝ๊ณ  ํšจ๊ณผ์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ฐฐ์šธ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Stargazers:0Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

CompAgent_code

Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".

Stargazers:0Issues:0Issues:0

clip-interrogator

Image to prompt with BLIP and CLIP

License:MITStargazers:0Issues:0Issues:0

FreeNoise-LaVie

[ICLR 2024] Code for FreeNoise based on LaVie

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

RepoForLLMs

Repository featuring fine-tuning code for various LLMs, complemented by occasional explanations, deep dives.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

OMG-Seg

One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

License:NOASSERTIONStargazers:0Issues:0Issues:0

privy

Your private coding assistant

License:MITStargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:0Issues:0Issues:0

KoLLaVA

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

License:Apache-2.0Stargazers:0Issues:0Issues:0

a-person-mask-generator

Extension for Automatic1111 and ComfyUI to automatically create masks for Background/Hair/Body/Face/Clothes in Img2Img

License:MITStargazers:0Issues:0Issues:0

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

License:MITStargazers:0Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

honeybee

Official implementation of Honeybee

License:NOASSERTIONStargazers:0Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

LucidDreamer

Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".

License:NOASSERTIONStargazers:0Issues:0Issues:0