Text-Summarization Report

Data Collection

Selected t5-small model for abstractive summarization.
Prefine tuning results stored at Summarization_Model/model.ipynb .
Fine tuned it on my dataset , evaluated results at Summarization_Model/evaluation.ipynb
Model Size after fine tuning n ~ 800 MB .
Final rogue scores after fine tuning are as follows: Rouge-1: 0.42 Rouge-2: 0.22 Rouge-L: 0.34

Performed Extractive Summarization pre-processing on the dataset by removing stopwords, punctuation, and special characters and white spaces.
Selected Text Rank Algorithm for extractive summarization .
Implemented through py summa library.
Rouge scores for extractive summarization are as follows: (for 20% ratio of input text vs summary length) Rouge-1: 0.25 Rouge-2: 0.12 Rouge-L: 0.19
Optimized ratio Length of summary to input text to get better results.

Language:Jupyter Notebook 99.3%Language:Python 0.6%Language:Dockerfile 0.1%