kimx3966

Daniel Y.T. Kim's repositories

photoswap

Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

MIT000

PVA-CelebAHQ-IDI

Parallel Visual Attention (WACV 2024) and CelebAHQ Identity-Preserving Inpainting dataset repository.

000

continuous_3d_words_code

000

HeadStudio

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.

MIT000

langchain-kr

LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

000

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Apache-2.0000

CompAgent_code

Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".

000

clip-interrogator

Image to prompt with BLIP and CLIP

MIT000

FreeNoise-LaVie

[ICLR 2024] Code for FreeNoise based on LaVie

Apache-2.0000

TilingZoeDepth

000

RepoForLLMs

Repository featuring fine-tuning code for various LLMs, complemented by occasional explanations, deep dives.

MIT000

Style-Embeddings

MIT000

OMG-Seg

One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

NOASSERTION000

privy

Your private coding assistant

MIT000

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

BSD-3-Clause000

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

000

KoLLaVA

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

Apache-2.0000

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Apache-2.0000

arxiv-translator

000

GPTEval3D

000

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Apache-2.0000

a-person-mask-generator

Extension for Automatic1111 and ComfyUI to automatically create masks for Background/Hair/Body/Face/Clothes in Img2Img

MIT000

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonApache-2.0000

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

MIT000

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

AGPL-3.0000

ZeCon

MIT000

honeybee

Official implementation of Honeybee

NOASSERTION000

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Apache-2.0000

MotionCtrl

Apache-2.0000

LucidDreamer

Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".

NOASSERTION000