-
Python
Yelp: Reverse geocoding businesses to extract detailed location information
I’ve been playing around with the Yelp Open Dataset and wanted to extract more detailed location information for each business.…
Read More » -
JavaScript
Leaflet: Fit polyline in view
I’ve been playing with the Leaflet.js library over the Christmas holidays to visualise running routes drawn onto the map using…
Read More » -
Python
scikit-learn: Using GridSearch to tune the hyper-parameters of VotingClassifier
In my last blog post I showed how to create a multi class classification ensemble using scikit-learn’s VotingClassifier and finished…
Read More » -
Python
scikit-learn: Building a multi class classification ensemble
For the Kaggle Spooky Author Identification I wanted to combine multiple classifiers together into an ensemble and found the VotingClassifier…
Read More » -
Python
Python: Learning about defaultdict’s handling of missing keys
While reading the scikit-learn code I came across a bit of code that I didn’t understand for a while but…
Read More » -
Python
Python: Combinations of values on and off
In my continued exploration of Kaggle’s Spooky Authors competition, I wanted to run a GridSearch turning on and off different…
Read More » -
Python
scikit-learn: Creating a matrix of named entity counts
I’ve been trying to improve my score on Kaggle’s Spooky Author Identification competition, and my latest idea was building a…
Read More » -
Python
Python: polyglot – ModuleNotFoundError: No module named ‘icu’
I wanted to use the polyglot NLP library that my colleague Will Lyon mentioned in his analysis of Russian Twitter…
Read More » -
Python
Python 3: TypeError: unsupported format string passed to numpy.ndarray.__format__
This post explains how to work around a change in how Python string formatting works for numpy arrays between Python…
Read More »