Category: Data

  • Privacy puzzles

    Privacy puzzles

    I contributed a database reconstruction attack demonstration to the book Practical Data Privacy by my colleague Katharine Jarmul. While we might think anonymous summary data is safe to share, this attack demonstrates it’s possible to dramatically reduce the search space for re-identification, in this case from half a trillion quadrillion possibilities to just one! My…

  • Perspectives edition #27

    Perspectives edition #27

    I was thrilled to contribute to Thoughtworks Perspectives edition #27: Power squared: How human capabilities will supercharge AI’s business impact. There are a lot of great quotes from my colleagues Barton Friedland and Ossi Syd in the article, and here’s one from me: The ability to build or consume solutions isn’t necessarily going to be…

  • Electrifying the world with AI Augmented decision-making

    Electrifying the world with AI Augmented decision-making

    I wrote an article about optimising the design of EV charging networks. It’s a story of work done by a team at Thoughtworks, demonstrating the potential of AI augmented decision-making (including some cool optimisation techniques), in this rapidly evolving but durably important space. We were able to thread together these many [business problem, AI techniques,…

  • Humour me – DRY vs WRY

    Humour me – DRY vs WRY

    Don’t Repeat Yourself (DRY) is a tenet of software engineering, but – humour me – let’s consider some reasons Why to Repeat Yourself (WRY). LEGO reuse lessons In 2021, I wrote a series of posts analysing LEGO® data about parts appearing in sets to understand what it might tell us about reuse of software components…

  • I did it my way – hand-rolled navigation with open spatial data

    I did it my way – hand-rolled navigation with open spatial data

    Sure commercial maps app directions are great, but have you ever found the customisation options limited? What if you want to use bike paths and back streets when cycling, or avoid winding roads that might make backseat passengers car-sick on a road trip? The paved route OpenStreetMap and OpenRouteService do provide this type of functionality,…

  • 22 rules of generative AI

    22 rules of generative AI

    Thinking about adopting, incorporating or building generative AI products? Here are some things to think about, depending on your role or roles. I assume you’re bringing your own product idea(s) based on an understanding of an opportunity or problems for customers. These rules therefore focus on the solution space. Solutions with generative AI typically involve…

  • Synthesising Semantle Solvers

    Synthesising Semantle Solvers

    Picking up threads from previous posts on solving Semantle word puzzles with machine learning, we’re ready to explore how different solvers might play along with people while playing the game online. Maybe you’d like to play speed Semantle against an artificially intelligent opponent, maybe you’d like a left-of-field hint on a tricky puzzle, or maybe…

  • Second Semantle Solver

    Second Semantle Solver

    In the post Sketching Semantle Solvers, I introduced two methods for solving Semantle word puzzles, but I only wrote up one. The second solver here is based the idea that the target word should appear in the intersection between the cohorts of possible targets generated by each guess. To recap, the first post: Solution source…

  • Data Mesh Radio

    Data Mesh Radio

    I joined Scott Hirleman for an episode (#95) of the Data Mesh Radio podcast. Scott does great work connecting and educating the data mesh community, and we had fun talking about: Fitness functions to define “what good looks like” for data mesh and guide the evolution of analytic data architecture and operating model Team topologies…

  • Data mesh: a lean perspective

    Data mesh: a lean perspective

    Data mesh can be understood as a response to lean wastes identified in data organisations. I paired with Ned Letcher to present this perspective at the LAST Conference 2021, which was much delayed due to COVID restrictions. Lean wastes including overproduction, inventory, etc, may be concealed and made more difficult to address by centralised data…