apify

There are 8 repositories under apify topic.

crawlee
apify / crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify automation crawler crawling headless headless-chrome javascript nodejs npm playwright puppeteer scraper scraping typescript web-crawler web-crawling web-scraping
Language:TypeScript 20505
apify / crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify automation beautifulsoup crawler crawling headless headless-chrome pip playwright python scraper scraping web-crawler web-crawling web-scraping hacktoberfest
Language:Python 7146
apify / apify-cli
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
apify command-line hacktoberfest headless-chrome puppeteer serveless
Language:TypeScript 163
apify / apify-sdk-js
Apify SDK monorepo
actor apify javascript nodejs sdk typescript
Language:TypeScript 163
apify-sdk-python
apify / apify-sdk-python
The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
apify automation python scraping sdk
Language:Python 151
apify / actor-scraper
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
web-scraping apify
Language:JavaScript 128
apify / apify-client-python
Apify API client for Python
api apify client python scraping
Language:Python 82
superryeti / Hands-on-WebScraping
This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to websites using advanced protection.
apify crawler nodejs puppeteer python requests scrapy
Language:Python 82
VaclavRut / actor-amazon-crawler
Amazon crawler - this configuration will extract items for a keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.
amazon-com amazon-crawler amazon-de amazon-extractor apify apify-cli apify-proxy apify-sdk extract-items
Language:JavaScript 77
maxCopell / tripadvisor-scraper
Scrape Tripadvisor restaurant, hotels, and places.
tripadvisor-scraper apify scraper tripadvisor
Language:JavaScript 50
MrXujiang / crawel
基于Apify+node+react搭建的有点意思的爬虫平台
node puppeteer crawler apify react react-hooks umi umi3
Language:JavaScript 41
JuroOravec / crawlee-one
Professional scrapers that provide full control to the users. Crawlee One builds on top of Crawlee and Apify and extends them with features for robust and highly configurable web scrapers.
actor apify crawlee crawler framework scraper scraping web
Language:TypeScript 33
apify / super-scraper
Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
api apify cheerio javascript nodejs playwright scraping typescript web-scraping
Language:TypeScript 32
actor-youtube-scraper
bernardro / actor-youtube-scraper
Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start scraping.
apify apifier crawler search youtube pupetteer
Language:JavaScript 26
apify / actor-content-checker
You can use this act to monitor any page's content and get a notification when content changes.
apify content-selector web-scraping
Language:JavaScript 22
sauermar / web-browser-recorder
Web application for recording, management and editing of inteligent RPA workflows using Playwright technology
playwright react browser automation apify material-ui typescript user-friendly
Language:TypeScript 19
metalwarrior665 / actor-google-sheets
No more dealing with Google API. Simple Node.js program to automate access to Google Sheets.
spreadsheet apify-google apify google-sheets google-sheets-api
Language:JavaScript 18
lhotanok / actor-ticketmaster-scraper
Apify actor for scraping events from Ticketmaster based on their categories
apify apify-proxy concert-api concert-tickets scraper ticketmaster ticketmaster-api
Language:JavaScript 15
pocesar / actor-shopify-scraper
Automate monitoring prices on the most popular solution for building online stores and selling products online. Crawl arbitrary Shopify-powered online stores and extract a list of all products in a structured form, including product title, price, description, etc
apify apify-sdk javascript scraper scraping shopify
Language:JavaScript 15
apify / apify-zapier-integration
Apify integration for Zapier
zapier apify web-scraping api
Language:JavaScript 14
devblack / curlx
CurlX a basic Curl syntax
apify curl get-php http-tunnel luminati php-curl php-library php7 proxy-checker scraper scraping socks tarball
Language:PHP 14
metalwarrior665 / actor-rust-scraper
Experimental scraper in Rust suited for running locally or on the Apify platform. Inspired by Apify SDK.
apify rust web-scraper
Language:Rust 13
apify / actor-scrapy-executor
Apify actor to run web spiders written in Python in the Scrapy library
scrapy apify scrapy-spiders
Language:Python 12
metalwarrior665 / actor-article-extractor-smart
Combines Apify's crawling system and article parsing with unfluff library.
apify article-extractor actor scraper web-scraper
Language:JavaScript 12
pocesar / actor-twitter-scraper
Scrape any Twitter user profile. Extract tweets, retweets, replies, favorites, and conversation threads with no Twitter API limits
apify apify-sdk scraper twitter twitter-scraper
Language:TypeScript 12
lhotanok / zalando-scraper
Apify actor extracting data from Zalando
apify crawlee crawler data-extraction ecommerce fashion fashion-dataset scraper web-scraping zalando zalando-dataset zalando-scaper
Language:TypeScript 11
petrpatek / airbnb-scraper
Apify public actor for scraping Airbnb homes.
crawler scrape apify airbnb airbnb-api data-extraction
Language:JavaScript 11
pocesar / apify-login-session
Grab a session for any website for usage on your own actor
actors apify apify-sdk automation cookies form localstorage login puppeteer sessionstorage
Language:TypeScript 11
apify-projects / store-website-checker
Analyzes target website for anti-scraping protections and performance. Saves screenshots/HTML snapshots.
scraper apify
Language:TypeScript 10
apify / actor-legacy-phantomjs-crawler
The actor implements the legacy Apify Crawler product. It uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of JavaScript code.
phantomjs web-scraping apify web-crawler headless-browsers
Language:JavaScript 9
cermak-petr / act-anti-captcha-recaptcha
Apify act for solving google recaptcha using the anti-captcha.com service.
anti-captcha apify node-js recaptcha
Language:JavaScript 8
Nikolay-Lysenko / servifier
An easy-to-use tool for making web service with API from your own Python functions.
api-maker web-service model-to-production ml-engineering apify
Language:Python 8
orgupdate / Apify-Linkedin-Jobs-Scraper
The latest and most advanced LinkedIn Job Scraper. Our LinkedIn Jobs Scraper extracts real-time job postings at scale from all over the world. A new research tool built for recruitment, insights and HR.
job-board jobs linkedin-jobs linkedin-jobs-scraper linkedin-scraper jobsdata apify apify-actor linkedin market-intelligence market-research analytics api hr recruitment google-jobs-posting google google-jobs-listings
Language:JavaScript 7
ScaleLeap / zine-not-amazon-scraper
How to Scrape Amazon Search Results
amazon scraping apify apify-api
Language:JavaScript 7
orgupdate / Apify-Google-Jobs-Scraper
The latest and most advanced Google Job Scraper. Our Indeed, Linkedin, and Google Jobs Scraper rolled into one. This scraper extracts real-time job postings at scale from any active Google Jobs search results from all over the world. A new research tool built for recruitment, insights and HR.
google google-api google-jobs google-jobs-listings job-board jobdata jobs jobsearch market-intelligence google-jobs-posting apify-actor apify market-research analytics api hr recruitment googlesearch insights google-jobs-scraper
Language:JavaScript 6
orgupdate / Apify-Indeed-Jobs-Scraper
The latest and most advanced Indeed Job Scraper. Our Indeed Jobs Scraper extracts real-time job postings at scale from all over the world. A new research tool built for recruitment, insights and HR.
indeed-scraper indeed-scraping jobs-search indeed-jobs apify-actor indeed apify market-intelligence market-research analytics api hr recruitment google google-jobs google-jobs-listings google-jobs-posting jobdata google-job-posting jobs
Language:JavaScript 6

apify

apify / crawlee

apify / crawlee-python

apify / apify-cli

apify / apify-sdk-js

apify / apify-sdk-python

apify / actor-scraper

apify / apify-client-python

superryeti / Hands-on-WebScraping

VaclavRut / actor-amazon-crawler

maxCopell / tripadvisor-scraper

MrXujiang / crawel

JuroOravec / crawlee-one

apify / super-scraper

bernardro / actor-youtube-scraper

apify / actor-content-checker

sauermar / web-browser-recorder

metalwarrior665 / actor-google-sheets

lhotanok / actor-ticketmaster-scraper

pocesar / actor-shopify-scraper

apify / apify-zapier-integration

devblack / curlx

metalwarrior665 / actor-rust-scraper

apify / actor-scrapy-executor

metalwarrior665 / actor-article-extractor-smart

pocesar / actor-twitter-scraper

lhotanok / zalando-scraper

petrpatek / airbnb-scraper

pocesar / apify-login-session

apify-projects / store-website-checker

apify / actor-legacy-phantomjs-crawler

cermak-petr / act-anti-captcha-recaptcha

Nikolay-Lysenko / servifier

orgupdate / Apify-Linkedin-Jobs-Scraper

ScaleLeap / zine-not-amazon-scraper

orgupdate / Apify-Google-Jobs-Scraper

orgupdate / Apify-Indeed-Jobs-Scraper