asross / babynames

Data visualization of US baby name popularity

Home Page:http://babyname-visualization.s3-website-us-east-1.amazonaws.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Popular Baby Names in the US

This repo is for a data visualization of baby name popularity from the US Office of Social Security. It contains the code necessary to scrape/parse the baby name popularity data from the ssa.gov website, analyze and augment it with extra features like similar names, and then expose it as a webpage with charts using d3.

To Get The Data

The easiest way to get the data is to go to the visualization website and download it using the links at the very end of the page. That way, you will be able to get it as JSON in a hopefully convenient format. If you'd like to generate it all from scratch, though, you can clone this repository, run ruby download.rb (assuming you have Ruby and curl installed), and it will re-download the data by scraping the Office of Social Security's website. Then you can run ruby parse.rb, which will parse those HTML pages, compute the closest names, and then compress everything into a format optimized for sending to the browser. At that point, you can cd into visualization/ and run the visualization Middleman app to see it in your local browser.

About

Data visualization of US baby name popularity

http://babyname-visualization.s3-website-us-east-1.amazonaws.com/


Languages

Language:CoffeeScript 44.8%Language:Haml 25.3%Language:Ruby 17.1%Language:CSS 6.7%Language:JavaScript 4.7%Language:HTML 1.4%