90h / user-agent_strings_db

Python scraper that grabs user agent strings and populates them into sqlite/JSON files

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

#User Agent Srings DB Scraper

This is a python script which will scrape two web sources and combine them into sqlite3 and json file.

I couldn't find a simple database to sort by crawler, OS, or version type so that's why I worte this. I'm also learning python and gave me a good opportunity to practice. It's not the best script but it does the job.

Screenshot

Viewing the database

Screenshot of database on OSX using SQLProSQLite

Requirements

  • Python 2.7+
  • BeautifulSoup4
  • User_agents

Installation

Using either pip or easy_install:

easy_install user_agents
easy_install beautifulsoup4

Running

python uadown.py

The final output files will be combined.json and useragents.sqlite.

About

Python scraper that grabs user agent strings and populates them into sqlite/JSON files


Languages

Language:Python 100.0%