Reduced Data Exploration Time

The Problem A team of data analysts working with Python and Pandas has ran into productivity issues with their data exploration tasks on important large datasets. The running time of analyses was in the order of minutes, which was unacceptable for interactive analysis. The Analysis Our analysis uncovered several technical difficulties that reduced the productivity […]

Status Statistics Tell Just Part of the Story

TL;DR: Status statistics, such as the coronavirus-tracking daily counts (e.g. of patients ventilated or deceased), are commonly used but are inherently insufficient without additional transition statistics. Audience: data scientists and their managers, experiment designers, data journalists, and anyone with a critical approach to statistics. Read time: 7 minutes. Status statistics are commonly reported in a […]

Scroll to top