rafaveguim / vis-passwords-lang

An interactive display comparing the language used in passwords and common language.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Visualizing Language in Passwords

Click here to try it live!

This is an interactive display comparing the language used in passwords with common language. I parsed the 32 million passwords of the RockYou leak and built a frequency distribution of the words found. The parallel coordinates plot shows the 500 words whose frequency is mostly deviates from their frequency in common language, as represented by the British National Corpus (BNC), according to the G2 measure.

Each word is represented by a polyline and its color tells whether it appear more (blue) or less (brown) frequently in passwords than in the BNC. You can click a word (polyline) to see the most frequent passwords containing it in the left pane. You can also invert the axes by clicking them, reorder the axes by dragging them horizontally (use the axis title on top) or filter the axes by clicking and dragging vertically.

About

An interactive display comparing the language used in passwords and common language.


Languages

Language:JavaScript 68.8%Language:HTML 20.8%Language:CSS 10.4%