scttnlsn / readability

Port of arc90labs-readability with rust

Home Page:http://readability-rs.herokuapp.com/web/index.html

Repository from Github https://github.comscttnlsn/readabilityRepository from Github https://github.comscttnlsn/readability

readability-rs

readability-rs is a library for extracting the primary readable content of a webpage. This is a rust port of arc90's readability project. inspired by kingwkb/readability.

Hot to use

  • Add readability to dependencies in Cargo.toml
[dependencies]
readability = "^0"
  • Then, use it as below
extern crate readability;
use readability::extractor;

fn main() {
  match extractor::scrape("https://spincoaster.com/chromeo-juice") {
      Ok(product) => {
          println!("------- html ------");
          println!("{}", product.content);
          println!("---- plain text ---");
          println!("{}", product.content);
      },
      Err(_) => println!("error occured"),
  }
}

Demo

Visit demo page.

Related Projects

License

MIT

About

Port of arc90labs-readability with rust

http://readability-rs.herokuapp.com/web/index.html

License:MIT License


Languages

Language:Rust 100.0%