Text Similarity Analysis of Major Religious Books

This Jupyter notebook tool was developed to enable the visualisation of similarities and differences of the major texts from many of the worlds largest religions. It also includes non-religious works from approximately similar periods (of translation) as a benchmark. While the tool is focused on religious works, it is general enough to be applied to be used to visualize the comparison of books from any genre.

more ...









Kaggle Titanic survival competition

Kaggle Titanic competition - SVM and Random Forest entries. The key to good results was creating the right features and then tuning the classifiers, then back to the features and finally a re-tune of the classifiers. Arguably the classifiers are too finely tuned and a 'real' result should be about 1% less than that submitted.

more ...