There are 1 repository under web-parsing topic.
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
Парсер для получения списка игровых ноутбуков с их комплектующими с сайта dns-shop.ru в виде xlsx, csv, xml, json файлов и добавления данных в виде таблицы в PostgreSQL базу данных.
Telegram Music Bot written in Python.
Парсер для получения списка игровых ноутбуков с их комплектующими с сайта eldorado.ru в виде xlsx, csv, xml, json файлов.
Predict short-term stock prices based on the first half of 2020 stock price history, covid 19 cases, and related stock news. Goals to implement machine learning models by tensorflow, data processing, and Restful API. My contribution is Apple and Tesla stock prediction from NASDAQ.
Group project to create a currency telegram bot
English-Chinese Translation Scraper - Sinatra, Ruby, ERB
Telegram bot that monitors and purchases NFTs using Selenium automation.
Gets the reception hospital in Linz from a public listing and formats it machine readable.
Мой ЯГТУ - мобильный доступ к личному кабинету ЯГТУ.
AI Product Hack 2024 repo: service for recognizing product “infomodels” and generating descriptions and summaries based on them
Parser script that gets a word info from https://dictionary.goo.ne.jp explanatory dictionary
Imprint is a lightweight, declarative DOM scraping library for extracting structured data from web pages. Define JSON-like schemas to easily map and extract data from complex websites.
Tracks changes to webpages and sends Telegram notification
Scraping Flipkart Books Title, Price and Rating and importing into MySQL Database
Simple data optimization program for data entry/analysis
Web-Scraping using BeautifulSoup in Python for creating a dataframe to retrieve the Safest Cars rating from Latin NCAP website for the year 2020-2023 and before 2020.
Extracts images from a website using puppeteer and wget
Настроенный проект для парсинга сайтов с использованием Selenium + Python
Gets the COVID-19 stats from the austrian goverment and formats it machine readable.
Signing-in on Instagram, follow all the people who follow the same account as you - automated using Selenium & Python
A scraper using Phoenix in Elixir
Fast and asynchronous web crawling and scraping library for Python.
This repository contains the code that I wrote in one of my Geocomputing course (Geog573). It contains my customized K-means clustering and demonstrats my ability to use machine learning package (Scikit-learn).
Spell Corrector functionality for medical domain in Scala which consists modules to build a medical word corpus and correct misspelled words.
Telegram-Bot-Interface — Socially Significant Project on TeleBot & BeautifulSoup4
Amazon Products Review Scraping for Natural Language Processing
Spider Search Engine — поисковая система на C++ для веб-парсинга и поиска данных. Включает веб-паук (SpiderApp) для сбора данных с сайтов и HTTP-сервер (HttpServerApp) для поиска по базе PostgreSQL. Использует Boost, libpqxx, MyHTML, inih и OpenSSL.