diffusion-models dreambooth ffhq ffhq-dataset generative-adversarial-network generative-ai generative-model latent-diffusion latent-diffusion-models unconditional-generation autoencoder diffusers huggingface unet-image-segmentation unet-segmentation artificial-intelligence research research-project generative-adversarial-networks generative-models

latent-diffusion-FFHQ256-dreambooth

An unconditional generative model trained on FFHQ face data set in 256×256 resolution and then fine-tuned using the Dreambooth method. The Diffusers library from HuggingFace is utilized for implementing the latent diffusion model.

Project Structure

FaceGenerativeModel/
|-- data/
|   |-- ffhq256/   # FFHQ dataset (downloaded separately)
|   |-- subject/   # Small face dataset of a single subject
|-- diffusers/   # Cloned Diffusers library from HuggingFace
|-- src/
|   |-- data_loading.py   # Script for loading and preparing data
|   |-- models.py   # Implementation of FaceAutoencoder and LatentDiffusionModel
|   |-- train_latent_diffusion.py   # Script for Task 1 - Training Latent Diffusion Model
|   |-- train_finetune_model.py   # Script for Task 2 - Finetuning the Model
|   |-- inference.py   # Inference script for generating samples
|-- utils/
|   |-- dreambooth_utils.py   # Utilities for Dreambooth method
|   |-- inference_utils.py   # Utilities for inference script
|-- README.md   # Detailed README with instructions and project overview
|-- report.pdf   # 2-3 pages report with sampled images and loss curves

Prerequisites

Python 3.7+
GPU (recommended for faster training)
Google Colab or Google Cloud (optional, for free GPU usage)

Data Preparation

Download the FFHQ dataset from here and save it in the data/ffhq256/ folder.
Prepare a small face dataset of a single subject in the data/subject/ folder using 10-15 photos.

Training Latent Diffusion Model (Task 1)

Run the following command to train the latent diffusion model on the FFHQ dataset:
```
python src/train_latent_diffusion.py --data_path data/ffhq
```

After training, generate new face samples using:

python src/inference.py --model_path checkpoints/latent_diffusion.pth

Fine-tuning the Model (Task 2)

Run the following command to finetune the model on the subject dataset:

python src/train_finetune_model.py --data_path data/subject --pretrained_model_path checkpoints/latent_diffusion.pth

After finetuning, generate new face samples of the subject using:
```
python src/inference.py --model_path checkpoints/finetune.pth
```

Author

👤 Aras Güngöre

LinkedIn: @arasgungore
GitHub: @arasgungore

About

An unconditional generative model trained on FFHQ face data set in 256×256 resolution and then fine-tuned using the Dreambooth method.

diffusion-models dreambooth ffhq ffhq-dataset generative-adversarial-network generative-ai generative-model latent-diffusion latent-diffusion-models unconditional-generation autoencoder diffusers huggingface unet-image-segmentation unet-segmentation artificial-intelligence research research-project generative-adversarial-networks generative-models

MIT License

Languages

Language:Python 99.9%Language:Dockerfile 0.1%Language:Makefile 0.0%