PLM_Segformer: Automatic Damage Segmentation of Sanskrit Palm-leaf Manuscripts

PLM-Segformer framework is developed to provide an automated damage segmentation method for Sanskrit PLMs, which builds on the original Segformer architecture. The hyperparameters for pre-processing, training, inference, and post-processing phases are fully optimized to make the original model more suitable for the PLM segmentation task. The model has been used for automated PLM damage detection in Potala Palace, and it can complete 10064 pages of PLM damage Segmentation within 12 hours.

Flowchart of the PLM damage segmentation method. (a) The PLM dataset is established by digital camera acquisition and manual annotation. It has been subsequently divided into the training set, validation set, and test set. Then, various pre-processing methods (b) and loss functions (c) are compared to find the best way to build the damage segmentation models. Finally, inference enhancement methods (d) and post-processing methods (e) were used to optimize the prediction results.

Development version

Install Anaconda or Miniconda
Install Git
Open commond line, create environment and enter with the following commands:
```
 conda create -n PLM_Segformer python=3.8
 conda activate PLM_Segformer
```

Clone the repository and enter:

 git clone https://github.com/Ryan21wy/PLM_Segformer.git
 cd PLM_Segformer

Install dependency with the following commands:
```
 pip install -r requirements.txt
```

Model training

Train the model based on your own training dataset with model_train function.

model_train(train_path, val_path, model_name, save_path, arg*)

Optionnal args

train_path : file path of training data
val_path: file path of vaildation data
model_name: file name of model parameters
save_path: file path for saving training history and model parameters

Prediction

The segmentation mask of each damage is predicted using optional inference phase augmentation methods with model_prediction function.

model_prediction(img_path, model_dir, label_path=None, save_path=None, n_class=2, crop_size=None, TTA=False, TLC=False, post=False)

Optionnal args

img_path: file path of PLM images
model_dir: file path of saved model parameters
label_path: file path of damage annotation of PLM images, which uesd to calculate the evaluation metrics
save_path: file path for saving the segmentation mask of each damage
n_class: num of classes, background counts as well
crop_size: the size of image patches when using the resizing and cropping method.If none, using resized images for prediction
TTA: If true, using Test Time Augmentation
TLC: If true, using Test-time Local Converter method
post: If true, using image post-processing methods

Usage

A video demo of PLM-Segformer

Demo.mov

The PLM-Segformer models are provided in release.

The example codes for usage is included in demo.ipynb.

Contact

Wang Yue
E-mail: ryanwy@csu.edu.cn

Ryan21wy / PLM_Segformer