lstm pytorch pytorch-implementation remaining-useful-life remaining-useful-life-prediction rul-prediction

Turbofan engine RUL prediction

A reproduction of this paper by using PyTorch: Machine Remaining Useful Life Prediction via an Attention-Based Deep Learning Approach
for TensorFlow implementation, please visit the original author's repository

Requirements

The package version listed above is the version I used during my development, but you can still try other versions.

Usage

Download

git clone https://github.com/zhmou/Turbofan-engine-RUL-prediction.git
cd ./Turbofan-engine-RUL-prediction

Open project with your IDE

Click main.py and modify the load path of the dataset to suit your needs.

If you load a specific dataset, you need to change the max_rul value in turbofandataset.py:
(for FD001, the value of max_rul is 130 and 150 for FD004. Please refer to another paper)

This paper only tested two sub-datasets of CMAPSS(FD001 and FD004). if you are interested in other datasets, don't forget to normalizing the raw data by trying preprocess.py

Run main.py

It will take up about 2~3 minutes to load the whole dataset, don't worry.
When the output from the console looks like this, congratulations, the network is training now:

The program will save the model parameters at ./checkpoints/ automaticlly during every iteration when it found a better result.

Based on the feedback, you will need to manually create this folder (./checkpoints/) in the current path to avoid reporting an error.

After 10 iterations(32 epochs per iteration), best result of each iteration under eval metrics would save to a txt like this:

Network Architecture

The network can mainly spilt into two parts:
The left one takes a 30(windows size, or time step, default by 30) * 17(sensory nums) sequential data as the inputs of one sample. Firstly, it would be sent into LSTM to output a 30 * 50 feature map. Then, a very simplified attention mechanism would be performed, It will caculate the weights of each particular feature and get an attention matrix:

The attention matrix will make a dot product with the feature map and flatten to be a 1D vector of length 1500. After 2 linear layers(with ReLU, dropout, etc.), we finally get a 1D vector of length 10.

Take a look at the right-side part. for every column of each sample, there are two handcrafted features can be extracted: mean value and trend coefficient(or you can say the slope of the line fitted to these 30 points in one column). Since each sample has 17 columns, we can obtain a 1D vector of length 34. As before, after a linear layer, we get a vector of length 10.

We concatenate these two vectors and get a 1D vector of length 20. The finally thing is to pass through a output layer and get our predicted RUL value. (Note that the label of the dataset is normalized by dividing by the max_rul. )

Eval Metrics

RMSE: root of MSE
Score:

This scoring function penalizes late predictions (too late to perform maintenance) more than early predictions (no big harms although it could waste maintenance resources). This is in line with the risk adverse attitude in aerospace industries. However, there are several drawbacks with this function. The most significant drawback being a single outlier (with a much late prediction) would dominate the overall performance score, thus masking the true overall accuracy of the algorithm. Another drawback is the lack of consideration of the prognostic horizon of the algorithm. The prognostic horizon assesses the time before failure which the algorithm is able to accurately estimate the RUL value within a certain con- fidence level. Finally, this scoring function favors algorithms which artificially lowers the score by underestimating RUL. Despite all these shortcomings, the scoring function is still used in this paper to provide comparison results with other methods in literature.

~Deep Convolutional Neural Network Based Regression Approach for Estimation of Remaining Useful Life

Result

The original paper results:

Guess it's the average or median value of the best results of 10 iterations.

My results(10 best result of every iteration):
FD001:

FD004:

On FD004, the results reproduced using PyTorch differ slightly from those of the original paper, which can be seen is that in my results. The 10 best results are more spread out compared to original authors' paper:

I am trying to figure out the cause of this problem, if you have a good idea please post a issue and let me know, thanks gratefully!

About

RUL prediction for C-MAPSS dataset, reproduction of this paper: https://personal.ntu.edu.sg/xlli/publication/RULAtt.pdf