bratao / hero

The web browser that’s nearly impossible for bot blockers to block

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Ulixee Hero

Hero is a web browser built for scraping.

  • Built for scraping - it's the first modern headless browsers designed specifically for scraping instead of just automated testing.
  • Designed for web developers - We've recreated a fully compliant DOM directly in NodeJS allowing you bypass the headaches of previous scraper tools.
  • Powered by Chrome - The powerful Chrome engine sits under the hood, allowing for lightning fast rendering.
  • Emulates any modern browser - BrowserEmulators make it easy to disguise your script as practically any browser.
  • Avoids detection along the entire stack - Don't be blocked because of TLS fingerprints in your networking stack.

Check out our website for more details.

Installation

npm i --save ulixee

or

yarn add ulixee

Usage

Hero provides access to the W3C DOM specification without the need for Puppeteer's complicated evaluate callbacks and multi-context switching:

const Hero = require('@ulixee/hero');

(async () => {
  const hero = new Hero();
  await hero.goto('https://example.org');
  const title = await hero.document.title;
  const intro = await hero.document.querySelector('p').textContent;
  await hero.close();
})();

Browse the full API docs.

Contributing

We'd love your help in making Hero a better tool. Please don't hesitate to send a pull request.

License

MIT

About

The web browser that’s nearly impossible for bot blockers to block

License:MIT License


Languages

Language:TypeScript 94.7%Language:JavaScript 3.0%Language:Go 1.4%Language:HTML 0.7%Language:Dockerfile 0.1%Language:CSS 0.1%Language:Shell 0.0%