-
Python
Python: Scraping elements relative to each other with BeautifulSoup
Last week we hosted a Game of Thrones based intro to Cypher at the Women Who Code London meetup and…
Read More » -
Python
Python: Regex – matching foreign characters/unicode letters
I’ve been back in the land of screen scrapping this week extracting data from the Game of Thrones wiki and…
Read More » -
Python
Python: Squashing ‘duplicate’ pairs together
As part of a data cleaning pipeline I had pairs of ids of duplicate addresses that I wanted to group…
Read More » -
Python
Python: Parsing a JSON HTTP chunking stream
I’ve been playing around with meetup.com’s API again and this time wanted to consume the chunked HTTP RSVP stream and…
Read More » -
Python
Neo4j: Loading JSON documents with Cypher
One of the most commonly asked questions I get asked is how to load JSON documents into Neo4j and although…
Read More » -
Python
Python: Extracting Excel spreadsheet into CSV files
I’ve been playing around with the Road Safety open data set and the download comes with several CSV files and…
Read More » -
Python
Python: Converting WordPress posts in CSV format
Over the weekend I wanted to look into the WordPress data behind this blog (very meta!) and wanted to get…
Read More » -
Python
Python: Look ahead multiple elements in an iterator/generator
As part of the BBC live text scraping code I’ve been working on I needed to take an iterator of…
Read More » -
Python
Python: Refactoring to iterator
Over the last week I’ve been building a set of scripts to scrape the events from the Bayern Munich/Barcelona game…
Read More »