Category: Data
-
EMLT Q&A
A fun Q&A with Thoughtworks on the drivers, key messages and writing process for Effective Machine Learning Teams (EMLT) with my fellow authors Ada and David. It’s neat to be featured alongside all the other many great books from Thoughtworks authors. Find the book, trial and purchase options at O’Reilly, and find yourself a nice…
-
Dealing with data inventory
Data held by businesses is often described as an asset. This can be misleading or even incorrect. In any case, data managed inappropriately leaves value on the table, inflates cost, reduces responsiveness, and creates risk. Some data held by businesses would better be described as inventory. It might one day be a true asset, but…
-
Effective Machine Learning Teams
I’m very excited to be writing a book with my colleagues David Tan and Ada Leung. The topic and title Effective Machine Learning Teams was born from our combined work on team technical and delivery practices, and wider organisational patterns, applied to developing machine learning applications. The book has two landing pages where you can…
-
7 wastes of data production – when pipelines become sewers
I recently had the chance to present an updated version of my 7 wastes of data production talk at DataEngBytes Melbourne 2023. I think the talk was stronger this time around and I really appreciated all the great feedback from the audience. Check out the video below and the slides. Thanks to Peter Hanssens and…
-
Privacy puzzles
I contributed a database reconstruction attack demonstration to the book Practical Data Privacy by my colleague Katharine Jarmul. While we might think anonymous summary data is safe to share, this attack demonstrates it’s possible to dramatically reduce the search space for re-identification, in this case from half a trillion quadrillion possibilities to just one! My…
-
Perspectives edition #27
I was thrilled to contribute to Thoughtworks Perspectives edition #27: Power squared: How human capabilities will supercharge AI’s business impact. There are a lot of great quotes from my colleagues Barton Friedland and Ossi Syd in the article, and here’s one from me: The ability to build or consume solutions isn’t necessarily going to be…
-
Electrifying the world with AI Augmented decision-making
I wrote an article about optimising the design of EV charging networks. It’s a story of work done by a team at Thoughtworks, demonstrating the potential of AI augmented decision-making (including some cool optimisation techniques), in this rapidly evolving but durably important space. We were able to thread together these many [business problem, AI techniques,…
-
Humour me – DRY vs WRY
Don’t Repeat Yourself (DRY) is a tenet of software engineering, but – humour me – let’s consider some reasons Why to Repeat Yourself (WRY). LEGO reuse lessons In 2021, I wrote a series of posts analysing LEGO® data about parts appearing in sets to understand what it might tell us about reuse of software components…
-
I did it my way – hand-rolled navigation with open spatial data
Sure commercial maps app directions are great, but have you ever found the customisation options limited? What if you want to use bike paths and back streets when cycling, or avoid winding roads that might make backseat passengers car-sick on a road trip? The paved route OpenStreetMap and OpenRouteService do provide this type of functionality,…
-
22 rules of generative AI
Thinking about adopting, incorporating or building generative AI products? Here are some things to think about, depending on your role or roles. I assume you’re bringing your own product idea(s) based on an understanding of an opportunity or problems for customers. These rules therefore focus on the solution space. Solutions with generative AI typically involve…