Search and IR on COVID-19 Research Papers
Perform NLP on COVID-19 research papers to extract useful information.
-
Exploratory Data Analysis (EDA)
- Find most common words and bigrams in title
- Topic modelling using Latent Dirichlet Allocation (LDA) and gensim, visualize with pyLDAvis
-
Find similar papers
- Get embeddings using Universal Sentence Encoder (USE) and find similar titles using cosine similarity
-
Find papers matching query
- Using cosine similarity & similarity matrix of embeddings
-
Keyword extraction
- Extact keywords from abstracts using Rake
-
Knowledge graphs
- Entity detection, dependency parsing, and knowledge graphs from paper abstracts