Results seem more like re-ordered names vs variations
wsiegelman opened this issue · comments
This is a great concept and would be very useful. However, the only variations I see are
- first name, last name
- first name initial, last name
- last name, first name
Should the tool be called name re-ordering vs variations? I assumed variations would show results like
Dmytro Firtash or Dmytrii or Dimitry
Igor Kolomoisky or Ihor or Kolomoiskii or Kolomoisky or Kolomoiskiy
etc.
And this is a very basic one:
Joe Biden or Joseph Biden
![Screen Shot 2023-07-30 at 2 05 36 PM](https://private-user-images.githubusercontent.com/140974956/257068444-92eb7793-6938-4a45-98c4-b9d589f615b6.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTUxNDUxMzksIm5iZiI6MTcxNTE0NDgzOSwicGF0aCI6Ii8xNDA5NzQ5NTYvMjU3MDY4NDQ0LTkyZWI3NzkzLTY5MzgtNGE0NS05OGM0LWI5ZDU4OWY2MTViNi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNTA4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDUwOFQwNTA3MTlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1jY2IzMDU0ZTc4ZTZhNGQ5NzliZThhNGE3OTE3Y2Q2NjdjMjlkZDczZDNlMWJiM2QwN2U5YWE5NDM5YTA2YTlhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.QQIRp6V-cDVO8hX-ZbffO4WAWwwq-Y5_ARQUO3ytUlg)
![Screen Shot 2023-07-30 at 2 06 01 PM](https://private-user-images.githubusercontent.com/140974956/257068446-078e46ea-aa77-49c2-8074-e2ed704b9c69.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTUxNDUxMzksIm5iZiI6MTcxNTE0NDgzOSwicGF0aCI6Ii8xNDA5NzQ5NTYvMjU3MDY4NDQ2LTA3OGU0NmVhLWFhNzctNDljMi04MDc0LWUyZWQ3MDRiOWM2OS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNTA4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDUwOFQwNTA3MTlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kNGI1N2YyNmM4Y2QwZGU0ZDFiMWEyNTgxZjgxYzRjNzhmNzcyMTNlYTkyMWM1MzgwYzY5NDNjY2M2NjcxYjk4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.uwx2EawfHZq6cLuxABJlvbHfGXeASZqYgD_Vo3JJtFo)
Hi @wsiegelman. You're right. There should be more variations included in the result. Rest assured, the current version is only the bare minimum for what might be useful to a researcher, and I hope to add more features to it soon. 😄
Also, I see you're new to github. Welcome to the world of collaborative software development! I'd like to offer some tips to you and others for contributing to this project, so I wrote them up here: CONTRIBUTING.md
Once you've read that, I would encourage you to open a new issue on the alias-generator codebase. There are already a few open issues about alternative spellings, but none with these specific examples or Ukrainian names in general, which would probably be quite useful to many researchers.
We now have Joe -> Joseph and similar in alias-generator 4.0.0.
Follow bellingcat/alias-generator#8 for work on spelling variations.