Documentation of the project.
Updated with 30000 name dataset test results. The first version of the documentation (with the tests made with the smaller 10000 name dataset) can be downloaded here.
Note: If you want to execute the program, the jupyter notebook is available here.