SajjadPourali / Surnames

Surnames dispersion around the world which sorted by population

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Surname Dispersion Dataset

This repository provides comprehensive data on surname dispersion around the world, sorted by population, in two different formats. The dataset includes information on the distribution of surnames across various countries and is based on data collected by crawling locatefamily.com in July 2019.

Applications

The dataset in this repository can be utilized for a wide range of applications, including:

  • Machine Learning: The dataset can be used for training machine learning models to predict or classify surnames based on geographic location and surnames popularity.
  • Recommender Systems: It can be utilized in recommender systems to enhance personalized recommendations for users.
  • Forensic Analysis: Understanding surname dispersion can be crucial for forensic investigations, might help to identify potential suspects country.

Usage

To use this dataset, simply download the relevant data files in the desired format (i.e., CSV, TXT) from the repository. The data can then be incorporated into your own research, analysis, or machine learning projects.

Please note that the data in this repository is based on crawling locatefamily.com in July 2019 and may not reflect the most up-to-date information.

License

The dataset in this repository is provided under the MIT file, which outlines the terms and conditions for using and distributing the data.

About

Surnames dispersion around the world which sorted by population

License:MIT License


Languages

Language:Roff 100.0%