Jan Čurn (jancurn)

jancurn

Geek Repo

Company:@apify

Location:Prague, Czech Republic

Home Page:apify.com/jancurn

Twitter:@jancurn

Github PK Tool:Github PK Tool


Organizations
apify

Jan Čurn's repositories

actor-find-broken-links

A source code of an Apify actor that finds and reports broken links on a website. Unlike other SEO analysis tools, it also reports broken URL #fragments.

Language:JavaScriptLicense:Apache-2.0Stargazers:6Issues:3Issues:10

actor-analyze-domains

An Apify actor that crawls web pages from a list of provided domains and analyzes them. For example, it checks whether pages have HTTPS version, saves their HTML content and screenshot, HTTP response headers, SSL certificate information, text body, outgoing links, emails, phone numbers, social handles and more.

Language:JavaScriptLicense:Apache-2.0Stargazers:4Issues:3Issues:1

act-pdf-to-html

Converts PDF to HTML using the pdf2htmlex tool

Language:JavaScriptLicense:Apache-2.0Stargazers:2Issues:3Issues:0

actor-residential-proxy-probe

Probes Apify residential proxies and maintains a pool of proxies from specific ZIP codes or DMAs

Language:JavaScriptStargazers:2Issues:4Issues:0

awesome-puppeteer

A curated list of awesome puppeteer resources.

Stargazers:2Issues:0Issues:0

actor-amazon-crawler

Amazon crawler - this configuration will extract items for a keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.

Language:JavaScriptStargazers:1Issues:0Issues:0

actor-metadata-extractor

An Apify actor that crawls a list of web pages and extracts various metadata from them.

Language:JavaScriptStargazers:1Issues:2Issues:0

act-probe-page-resources

Apify act to load web pages and analyze HTTP resources they request

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

actor-selenium-custom-firefox

Apify actor with custom build of Firefox, instrumented using Selenium.

Language:JavaScriptStargazers:0Issues:2Issues:0

awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

Language:MakefileLicense:NOASSERTIONStargazers:0Issues:2Issues:0

bson-ext

The C++ bson parser for the node.js mongodb driver.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

iron-router

A client and server side router designed specifically for Meteor.

Language:JavaScriptLicense:MITStargazers:0Issues:2Issues:0

llama_index

LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

puppeteer

Headless Chrome Node API

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:2Issues:0

stayinghomeclub

A list of all the companies WFH or events changed because of covid-19

Language:RubyLicense:CC0-1.0Stargazers:0Issues:0Issues:0

www

The mitmproxy website, https://mitmproxy.org/.

Language:CSSStargazers:0Issues:1Issues:0

yclist

List and description of ycombinator companies

Language:RubyStargazers:0Issues:2Issues:0