amritbulusu / wikidrugs

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

wikidrugs

Extraction of medical drugs side effects from Wikipedia

Project description • Developed and automated the process to extract all possible side effects for medical drugs articles from Wikipedia by using Google NL API to retrieve all possible entities based on the sailence score. • Preprocessed and extracted all possible patterns of side effects from 7 million lines of text using regular expressions, xml, json parsers of python. (Python, Scrapy, Google NL API, HTML, Excel).

About


Languages

Language:Python 73.6%Language:Jupyter Notebook 26.4%