m-rey / nickname-and-diminutive-names-lookup

A CSV file that containing US given names (first name) and their associated nicknames or diminutive names.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nickname-and-diminutive-names-lookup

A simple CSV file containing US given names (first name) and their associated nicknames or diminutive names.

This lookup file was initially created by mining this genealogy page. Because the lookup originates from a dataset used for genealogy purposes there are old names that aren't commonly used these days, but there are recent ones as well. Examples are "gregory", "greg", or "geoffrey", "geoff". There was also a significant effort to make it machine readable, i.e. separate it with commas, remove human conventions, like "rickie(y)" would need to be made into two different names "rickie", and "ricky".

There are Java, Perl, Python, and R parsers provided for convenience.

This is a relatively large list with roughly 1600 names. Any help from people to clean this list up and add to it is greatly appreciated.

This project was created by Old Dominion University - Web Science and Digital Libraries Research Group. More information about the creation of this lookup can be found here.

About

A CSV file that containing US given names (first name) and their associated nicknames or diminutive names.

License:Apache License 2.0


Languages

Language:Perl 53.2%Language:R 27.3%Language:Java 10.9%Language:Python 8.5%