Tara-Morovatdar / html_parser

includes readability and mercury parser

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

html_parser

Download and install npm : https://www.npmjs.com/get-npm

Readability:

1-Install the node part of this package by running: npm install.

2- Install the requirements for the Python part of this package by running: pip install -r requirements.txt

open Readability_parser.ipynb change the path and run it

Mercury:

1- npm install @postlight/mercury-parser

2- pip install pixiedust

3- pip install pixiedust_node

open mercury_parser.ipynb change the path

About

includes readability and mercury parser

License:MIT License


Languages

Language:HTML 92.5%Language:Python 4.0%Language:JavaScript 2.6%Language:Jupyter Notebook 0.9%Language:Dockerfile 0.0%