Python
-
scikit-learn: Using GridSearch to tune the hyper-parameters of VotingClassifier
In my last blog post I showed how to create a multi class classification ensemble using scikit-learn’s VotingClassifier and finished…
Read More » -
scikit-learn: Building a multi class classification ensemble
For the Kaggle Spooky Author Identification I wanted to combine multiple classifiers together into an ensemble and found the VotingClassifier…
Read More » -
Python: Learning about defaultdict’s handling of missing keys
While reading the scikit-learn code I came across a bit of code that I didn’t understand for a while but…
Read More » -
Python: Combinations of values on and off
In my continued exploration of Kaggle’s Spooky Authors competition, I wanted to run a GridSearch turning on and off different…
Read More » -
scikit-learn: Creating a matrix of named entity counts
I’ve been trying to improve my score on Kaggle’s Spooky Author Identification competition, and my latest idea was building a…
Read More » -
Python: polyglot – ModuleNotFoundError: No module named ‘icu’
I wanted to use the polyglot NLP library that my colleague Will Lyon mentioned in his analysis of Russian Twitter…
Read More » -
Python 3: TypeError: unsupported format string passed to numpy.ndarray.__format__
This post explains how to work around a change in how Python string formatting works for numpy arrays between Python…
Read More » -
Using gRPC in Python
gRPC is an HTTP/2-based Remote Procedure Call (RPC) framework that uses protocol buffers (protobuf) as the underlying data serialization framework.…
Read More » -
How to create a Google Drive App in Flask
This is the first in a series of articles for web programmers that explain in detail about using the Google…
Read More »