GiacomoPracucci / detect-asd

Image classification to detect autism spectrum disorder and analysis of the related scientific literature using NLP and data visualization techniques. Developed during 2nd year of study in Data Science (June 2023), together with my colleague Claudio Pezzoni.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Detect Autism Spectrum Disorder

Authors:

  • Claudio Pezzoni
  • Giacomo Pracucci

The repo contains the implementation of a project for the Data Science Lab in Bioscences exam, completed with a rating of 30/30 (June 2023). It is an image recognition work in which, Using deep learning techniques, patients are classified as having or not having autism spectrum disorder.

Given the excellent results obtained in terms of accuracy, the second part of the project consisted of investigating how much these techniques are used as support of doctors in diagnosing asd, as well as investigating more generally, how research is moving in this area. The necessary data were acquired from pubmed and topic modeling and data visualization techniques were used to achieve this goal.

Link to the original project that inspired this work: https://github.com/Mahmoud-Elbattah/Predicting_ASD

And reference publications:

CNN from scratch results

  • 92% of accuracy on test set achieved in the last training epoch
  • 95% of accuracy on test set achieved in the 19th of 20 training epochs Immagine 2023-09-23 101806

VGG16 results

  • 93% of accuracy on test set achieved in the last epoch of training
  • 95% of accuracy on test set achieved in the 19th of 20 training epochs Immagine 2023-09-23 101905

berTopic results

The original idea was to perform a very general query on PubMed, to obtain a large number of documents on which to do TopicModeling using the BERTopic algorithm. The hope was that topics would be identified that represented various lines of research relating to the diagnosis on which we could then carry out our analysis. However, the attempt failed, as the 11 topics identified by BERT did not group together potential research fields of ASD diagnosis Immagine 2023-09-23 102404

Research trends

Using as a reference what was cited in the reference papers, which lists alternative or complementary support methodologies to eye tracking analysis, we counted how many publications each year contained the keywords associated with the methodologies and their frequency out of the total articles

  • Time series of the number of publications per year (3-year moving average) Immagine 2023-09-23 102834
  • Percentage of publications on topics of interest compared to the total Immagine 2023-09-23 102924
  • Number of publications per keyword before and after 2015 Immagine 2023-09-23 103044

Conclusions

The results obtained from the classification models indicate that scan-path analysis can be a very useful method to support a doctor in the diagnosis of ASD. Scientific research has begun to explore these new possibilities in recent years, but their use still remains limited compared to that of more "traditional" methodologies. The results obtained suggest a significant potential in the diagnostic technique in question. It is therefore essential that research continues along this path, with the aim of further refining existing methodologies and developing practical solutions that can be used effectively by medical professionals.

About

Image classification to detect autism spectrum disorder and analysis of the related scientific literature using NLP and data visualization techniques. Developed during 2nd year of study in Data Science (June 2023), together with my colleague Claudio Pezzoni.

License:Apache License 2.0


Languages

Language:Python 100.0%