Articles

Command-line Basics and Tips

A collection of tips to help you on the command-line using Linux, Mac OS X or other unixy command-line system. You might already know most but hopefully there are a few new ones or helpful tips on how to save time and use in productive ways. I’ve tried to include real and useful examples for […]

Python Data Science

My notes, resources and examples using Python, NumPy, SciPy and Matplotlib as alternatives to R and Matlab for data science and analysis. Load Data from Text File An example loading comma delimited data using Numpy: Plotting and Graphing Log Scale Labels for Titles and Axes Saving a Graph The following will create a png image […]

Python Date Formatting

A set of examples working with Python date and time functions, including formatting dates, date calculations, and other common date tasks. See my string format examples for working with python strings. First off, all examples use the following import, any additional imports needed will be shown with the example. Creating Date Objects Typically the first […]

Python String Format Cookbook

Python v2.7 introduced a new string fomatting method, that is now the default in Python3. I started this string formatting cookbook as a quick reference to help me format numbers and strings. Thanks to other contributors I’ve expanded the examples over time. Python 3.6 introduced, formatted string literals, often referred to as f-strings as another […]

xkcd graph style in d3

I saw an article on Hacker News about duplicating the xkcd hand-drawn graph style saw I thought I would give it a shot using d3 javascript library. I have very limited experience with d3 but think I came up with something relatively close. View live example, or See Source View source of the live example […]

Use Python + Selenium to Automate Web Timing

I’ve been hearing a lot recently about the Navigation Timing spec, which sets a multitude of timing events as javascript properties. There are numerous events in the flow, from the very first navigation event, which could be when the user clicks a search result in Google, to DNS timing to Dom parsing etc. See the […]

Strata Data Conference Recap 2012

This article is a recap of the Strata 2012 data conference I went to. I did some previous analysis on the attendee directory, so I am also including the results of the connectedness change before and after the conference. Update on Twitter Attendee Analysis From the list of twitter attendees generated before the conference, I […]

Strata 2012 Attendee Analysis

The Strata Conference is next week, so I was browsing the attendee directory curious to see who I know is going or looking for interesting companies to meet with. This got me thinking about the overall connectedness of the attendee list. So… A few scripts later and a bit of analysis I produced the following […]

Running Stats 2011

Being the data minded guy that I am, here are stats and numbers around my running in 2011. This year I ran my first marathon as well as a couple of other races. Summary Total Miles: 532.2 miles Total Time: 80 hrs 10min 58 secs Avg Pace: 9:02/mi Total Days Run: 87 days Races Date […]

Infographic design, give tables a chance

There has been a recent backlash against infographics lately, justified as this graphic illustrates. Infographics have lost their way, their main purpose should be to make data easier to read by adding a visual perspective to the information shown. For example, using visuals can show the difference in sizes or distance better than abstract numbers. […]

Taste of R: An Introduction

Introduction R is a statistical programming language and environment, it is open source and available on most platforms. R is not a replacement of Java, C, Perl, Python or other common language; R is a specific tool for data calculations, manipulations and graphing. If you have a programming background, R can be a great replacement […]

World Revolutions data visualization

The recent events in Tunisia and Egypt got me thinking about revolutions across history. They tend to cluster together both in geography and time. So being on the current data visualization kick I’m on, it sounded like a fun little project. The following is a world map highlighting revolutions going forward in time. The data […]