Home¶

Paul Brabban, Lead Consultant at Equal Experts

With experience in software development, data engineering and machine learning, I specialise in data-intensive problems and decentralised data engineering at scale. My experience extends to leading teams, technical architecture and product development. Find out more about my experience and publications in my portfolio.

Contact me to see how I can help at paul@tempered.works.

Aug 4, 2025
in Insights
7 min read

UDAFs, stored procedures and more in dbt

A screenshot showing a stored procedure in the BigQuery console, with dbt compilation details in a comment block.

It's been over a year since I first wrote about managing UDFs using custom dbt materializations. The approach has held up well, but a recent project required me to go further and bring UDAFs (user-defined aggregate functions) and stored procedures into the fold. The same approach worked well, and dbt Labs are considering adding formal support for UDFs!

Jul 17, 2025
in Insights, Operations
3 min read

BigQuery, safer by default from September 2025

A snippet of an email from Google, showing the change in BigQuery project quota defaults

On September 1st 2025, Google will make BigQuery a lot safer by default, changing the default quotas for projects under the default on-demand pricing model. Instead of unlimited financial damage, the default for new projects will be around $1000 per day. Existing projects will be updated to a custom limit based on prior 30-day usage. No changes to existing limits.

Thanks to for supporting this content.

Jun 7, 2025
in Security, Operations
10 min read

GitHub Codespaces, one year later

A screenshot of the GitHub web UI option to create a new Codespace on main

Back in early 2024 I tried GitHub Codespaces, and quickly ditched my local dev setup entirely. This post shares my experience going cloud-native for development: benefits for onboarding, agility, and security, alongside the real-world snags like network dependency and unexpected billing quirks.

May 29, 2025
3 min read

GROUP BY ALL solves a really annoying SQL problem

Does your SQL still copy most of your columns from SELECT after GROUP BY?

Behold: GROUP BY ALL.

Thanks to for supporting this content.

May 11, 2025
in Operations
12 min read

My path to consultancy

A country road stretches off into the sunny, leafy distance as my little boy cycles his favourite route home from school

I had big doubts about becoming a consultant or contractor. Could I do it? Would I find work? Could I run my own business? Would I need to change who I am, wear a suit, or buy a briefcase? Seven years have flown by since I took the plunge, so I'm going to share my story in case it's helpful for you!

Mar 23, 2025
in Security, Operations
7 min read

Rethinking the guest network to improve my home network security

Network diagram showing the internet connected to a router, linked to four devices: tablet, mobile phone, laptop, and IoT device.

I believe that making my guest network my default network reduces the potential harm a compromised app or device can cause. What was my "trusted" network is now my "untrusted" network, with only a few low-risk devices that really need local network connectivity connected to it, isolated from other devices that matter like my phone and work laptop.

It's a simple change in how I use my home network, and does not require in-depth knowledge and experience. Having operated this way at home for a few months now, I've found little impact on day-to-day usability, but it's really helped me sleep better at night. I'll explain what I changed, why, and how I've adjusted my wider thinking on my home network security to take better advantage of this approach.

Thanks to for supporting this content.

Mar 8, 2025
in Insights, Operations, Performance
8 min read

Generating portable and user-friendly identifiers

A screenshot of the BigQuery console, with example SQL for generating an identfier from a string value as I outline below

I'll share how I generate unique identifiers from data in 2025, avoiding the pitfalls I've seen along the way. TL;DR: I'm using MD5 to produce a digest from a string or bytes value, then I'm using plain old hexadecimal encoding of that digest, specifying upper or lowercase for the alpha characters. This solution meets the needs I describe next.

Thanks to for supporting this content.

Feb 9, 2025
in Operations
7 min read

Using AWS billing to track down lost resources

My AWS bill was higher than I expected, and it wasn't immediately clear what was driving the cost. Here's how I tracked down the culprits.

Jan 18, 2025
in Insights
8 min read

Testing stored procedures

Update August 2025

Since writing this post, I've found a clean way to handle stored procedures and their outputs for dbt users by treating them as materializations. This makes testing them as straightforward as testing any other dbt model. Read more in UDAFs, stored procedures and more in dbt.

Whilst I've used and written about UDFs a lot, I can't recall ever having a reason to work with stored procedures. When I was asked how I'd go about wrapping them in automation and testing, I thought it'd be a good excuse to take a look and see how I might go about testing them!

Nov 18, 2024
in Insights
4 min read

Avoiding CAST ROW in AWS Athena SQL

The Trino issue that makes row aggregations so much easier to work with

There is more than one way to build a ROW in AWS Athena and the underlying Trino engine. It turns out I was doing it the verbose, brittle and really annoying way. Row subqueries are so much better!

Thanks to for supporting this content.