datascraping

There are 19 repositories under datascraping topic.

UltimaHoarder / UltimaScraper
Scrape all the media from an OnlyFans account - Updated regularly
archive datascraping onlyfans scraper
Language:Python 4184
Python
Tanu-N-Prabhu / Python
This repository helps you learn Python and Machine Learning from scratch.
python jupyter-notebook pandas-dataframe numpy python3 python-3 numpy-arrays data datascraping dataanalysis google-colab google-colab-notebook machine-learning prediction data-analysis data-visualization machine-learning-algorithms
Language:Jupyter Notebook 1791
fansly-downloader
Avnsx / fansly-downloader
Easy to use fansly.com content downloading tool. Written in python, but ships as a standalone Executable App for Windows too. Enjoy your Fansly content offline anytime, anywhere in the highest possible content resolution! Fully customizable to download in bulk or single: photos, videos & audio from timeline, messages, collection & specific posts 👍
cross-platform database datascraping downloader fansly fansly-download fansly-downloader fansly-scraper gui image-download linux macos open-source portable python reddit scraper video video-download windows
Language:Python 1358
datawhores / OF-Scraper
A completely revamped and redesigned fork, reimagined from scratch based on the original onlyfans-scraper
datascraping downloader fansite onlyfans scraping
Language:Python 899
benibela / xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
xquery xml html json xpath cli command-line http web rest css-selector wget curl httpie xmlstarlet webscraper webscraping scraper datascraping data-processing
Language:Pascal 818
scrapfly / scrapfly-scrapers
Scalable Python web scraping scripts for +40 popular domains
crawling python crawler scraping web-scraping web-scraping-python antibot automation captcha-bypass crawling-python datascraping proxies python-scraper scraper scraping-python spider twitter-scraper web-crawler webscraper webscraping
Language:Python 743
sim0n00ps / OF-DRM
C# console app to download DRM protected videos from Onlyfans accounts
datascraping of-dl of-drm onlyfans
Language:C# 208
DwarfThief / Raspagem-de-dados-para-iniciantes
Raspagem de dados para iniciante usando Scrapy e outras libs básicas
estudo python scrapy jupyter-notebook opensource web-crawler spyder webcrawling raspagem-de-dados datascraping hacktoberfest
Language:Python 136
Gertje823 / Vinted-Scraper
This is a tool to scrape/download images and data from Vinted & Depop using the API and stores the data in a SQLite database.
database datascraping depop downloader python python3 scraper sqlite sqlite3 vinted
Language:Python 113
jordon31 / OnlySnap
Scrape content from OnlyFans #onlyfans -- #of-scr -- #onlyfans scrape -- #onlyfans-dl -- OnlyFans content downloader -- #of scrap -- #onlysnap
onlyfans python scraper of-scr onlyfans-scr onlyfans-scrape scrape-onlyfans archive datascraping onlyfans-dl onlysnap
Language:Python 96
castlelemongrab / parlance
A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content
data-science datascience datascraping disinformation es7 hatespeech javascript law-enforcement misinformation node nodejs osint parlance parler social-media social-networks speech twitter
Language:JavaScript 70
kennymkchan / funko-pop-data
Open-source database of all Funko Pop data.
funko-pop database datascraping opensource csv json-data contributions-welcome scraper puppeteer funko-pop-scrapper open-source funko
Language:JavaScript 58
arbuzovv / rusquant
Official version of rusquant package for R
r datasource finance trading investing-api finam investing cryptocurrency dividends ipo earnings splits data-science quant quantitative-finance dataset datascience datascraping stocks package
Language:R 45
jwillmer / web-scraper-chrome-extension
Web data extraction tool implemented as chrome extension
webscraper webscraping datascraping chrome-extension
Language:JavaScript 28
Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-
Reljod / Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-
Data Scraping using Python BeautifulSoup
beautifulsoup datascience datascraping python webscraping
Language:Jupyter Notebook 25
yuis-ice / jseval
Evaluate JavaScript on a URL through headless Chrome browser.
command-line headless-browser web-browser browser-automation pupeteer headless-browsers cmdline commandline-interface cli-utilities eval evaluator scrapers datascraping scrapping data-scraping webscrapping web-crawling scrapper website-scraper web-scrapping
Language:JavaScript 25
Agenty / scrapingai
Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty
crawler crawling datascraping extract-data scraping webscraper webscraping
Language:TypeScript 21
agnosto / fansly-recorder
Record fansly streams live and upload to remote using rclone
datahoarding datascraping rclone rclone-backup fansly fansly-downloader
Language:Python 21
dimitryzub / hotels-scraper-js
Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.
webscraping airbnb booking data datascraping hotels hotels-api playwright puppeteer puppeteer-extra
Language:JavaScript 18
kanishkan91 / Python-DataUpdate-DataProcessor-kbn
The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process data in general. The module can be used to process data from the following sources, 1) World Bank World Development Indicators (WDI) 2) UNESCO Education indicators(UIS) 3) FAO Food Balance Sheets (FAO) 4) IMF Global Finance Statistics (IMF GFS) 5) Health data from the Institute for Health and Metric Evaluation (IHME) 6) Water data from FAO AQUASTAT 7) Energy data from EIA Currently this module can be run as is on Windows. For usage on Macs, the user may have to make changes to the code lines which specify paths.
data python dataprocessing concordance indexing datascraping fao imf aquastat wdi uis eia ihme
Language:Python 15
sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
facebook-api facebook-graph-api facebook datascraping facebook-data-extract
Language:Python 13
easonlai / playstore_reviews_scraping_and_text_analytics
This is demo repo to demostrate how to scrape apps review data from Google Play Store by Python with library Google-Play-Scraper. And then use Azure Text Analytics to perform sentiment analysis for reviews content (aka comments).
python python3 datascraping data-scraping text-analytics google-play-store google-play-store-scraper google-play-store-data-analysis azure-text-analytics azure-text-analysis azure microsoft-azure microsoft-cognitive-services sentiment-analysis pandas seaborn
Language:Jupyter Notebook 11
ice-wzl / DataReaper
DataReaper is a powerful Python tool designed to harvest data from publicly accessible HTTP servers. It combines the capabilities of Shodan search with web scraping techniques to efficiently gather information from targeted websites.
data-visualization datascience datascraping osint osint-python osint-tool python3 redteam vulnerability
Language:Python 11
nba-topshot-scraper
kennymkchan / nba-topshot-scraper
Node script that will use Selenium to scrape card information from NBA Topshot including card names, rarity, and lowest cost at the moment. Data is scraped once per day.
open-source scraper opensource nft nba-topshot topshot nba datascraping database json-data contributions-welcome nft-database puppeteer nba-topshot-scraper node analytics
Language:JavaScript 11
scrape-do / scrapedo-scrapers
Web scraping examples with Scrape.do 😎
antibot crawler datascraping proxies python scraper spider web-crawler web-scraping webscraper
Language:Python 11
Data-Horde / ytcc-archive
archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
youtube heroku closed-captions caption-contributions archiving datascraping community-contributions
Language:Python 9
VirginiaTech / pyvt
A Python API for the VT timetable of classes
datascraping python virginia-tech webscraping
Language:HTML 8
DeDeDeDer / Personal_Projects
This holds all my personal data-related project's (Automation, Modelling, Analysis)
python3 datascience datascraping insurance-claims actuarial-science actuarial-statistics modelling-framework predictive-modeling excelvba claims-reserving exploratory-data-analysis feature-engineering
Language:Python 7
lavgen / WikileaksAPI-project
javasctipt nodejs wikileaks datascraping mongodb mongoose information-overload sony count leak-groups
Language:JavaScript 7
LynnFernandes23 / Movie-Recommedation-System
I developed a sophisticated movie recommendation system using Python, leveraging key libraries such as Pandas, NumPy, Scikit-Learn, and Natural Language Toolkit (NLTK). The system utilizes data scraping techniques to gather movie information and employs advanced data visualization techniques for insightful analysis.
data-visualization datascraping excel nltk-python numpy pandas pyhton scikit-learn
Language:Jupyter Notebook 5
cchrisnguyen / FlightRadar24
A shell script for scraping FlightRadar24's flight tracking data.
datascraping flightradar24
Language:Shell 4
dimitryzub / py-google-scholar-organic-cite-to-csv-sqlite
Scrape historic Google Scholar Organic and Cite results to CSV, MySQL Lite using Python and SerpApi.
webscraping serpapi python data datascraping webscraper googlescholar google scraper dataextraction datamining sqlite csv dataset datascience
Language:Python 4
greeshmasunil10 / LottoMaxAnalyserBE
A tool for analyzing the results of the Canadian Lotto Max lottery
python beautifulsoup datascraping matplotlib numpy react analysis lotto-numbers lottomax
Language:Python 4
greater-toronto-area-housing-data
kennymkchan / greater-toronto-area-housing-data
Data scraped from various sites for housing data around the greater Toronto area (GTA). Scrapes happen daily and data is in both JSON and CSV formats. Free to use for analysis.
data housing-prices toronto-open-data data-scraping public-data real-estate data-mining csv json toronto open-source contributions-welcome datascraping housing-data housing-dataset
4
LynnFernandes23 / Loksabha-Election-2024-Analysis-Through-Power-BI
This repository hosts interactive dashboards and detailed data visualizations that provide insights into the 2024 Indian parliamentary elections. Utilizing Power BI, we've analyzed voter demographics, electoral results, constituency-wise trends, and more, offering a comprehensive view of the election dynamics.
datascraping datavisualization excel powerbi
4
TheOwaisShaikh / Langchainwebsitescraper
Extract product details from WooCommerce sites using the langchain web extraction library and OpenAI's GPT models.
chatgpt chatgpt-api datascraping ecommerce-website extraction langchain-python open-ai python scraping-python scraping-web scraping-websites woocommerce langchain-data-scraping langchian-web-extraction langchian-web-scraping wocommerce-product-scraping woocommerce-website-scraping
Language:Python 4

datascraping

UltimaHoarder / UltimaScraper

Tanu-N-Prabhu / Python

Avnsx / fansly-downloader

datawhores / OF-Scraper

benibela / xidel

scrapfly / scrapfly-scrapers

sim0n00ps / OF-DRM

DwarfThief / Raspagem-de-dados-para-iniciantes

Gertje823 / Vinted-Scraper

jordon31 / OnlySnap

castlelemongrab / parlance

kennymkchan / funko-pop-data

arbuzovv / rusquant

jwillmer / web-scraper-chrome-extension

Reljod / Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-

yuis-ice / jseval

Agenty / scrapingai

agnosto / fansly-recorder

dimitryzub / hotels-scraper-js

kanishkan91 / Python-DataUpdate-DataProcessor-kbn

sahilbhange / Facebook-Data-Extraction

easonlai / playstore_reviews_scraping_and_text_analytics

ice-wzl / DataReaper

kennymkchan / nba-topshot-scraper

scrape-do / scrapedo-scrapers

Data-Horde / ytcc-archive

VirginiaTech / pyvt

DeDeDeDer / Personal_Projects

lavgen / WikileaksAPI-project

LynnFernandes23 / Movie-Recommedation-System

cchrisnguyen / FlightRadar24

dimitryzub / py-google-scholar-organic-cite-to-csv-sqlite

greeshmasunil10 / LottoMaxAnalyserBE

kennymkchan / greater-toronto-area-housing-data

LynnFernandes23 / Loksabha-Election-2024-Analysis-Through-Power-BI

TheOwaisShaikh / Langchainwebsitescraper