![logo](https://github.com/incubated-geek-cc/WebScraper/raw/main/img/logo.png?raw=true)
๐ Web Scraper
๐ ๏ธ Retrieves HTML text content from https://dexur.com/icd/search/ and returns a JSON formatted response
Runs on Node.js Express framework. ๐ Request proxy setup.
๐ Try it yourself (where query = "panic"
)
Live Demo :: Link
Live Demo :: Backup Link
๐งฐ Run on localhost
- Run
npm install
to install all node dependencies - Double-click file
startup.sh
- Navigate to localhost:5000 and test API
โ Read related post here
๐ Features
- Parses HTML content with
jsdom
- Minifies retrieved HTML text with
html-minifier
(optional) - Traverse the HTML node(s) for raw data extraction
- Formats extracted data into structured JSON formatted data called via a GET API
๐ Preview (e.g. query
= "mood")
โ Join me on ๐ Medium at ~ ฮพ(