Max Halford ツ
Blog Links Bio

#data-eng

Lower your warehouse costs via DuckDB transpilation
2026-03-12 data-eng sql
Row level lineage at Carbonfact
2026-01-09 python data-eng
Thoughts on DuckLake
2025-06-09 data-eng
Minimizing the runtime of a SQL DAG
2025-02-08 data-eng python
A training set for bike sharing forecasting
2024-04-04 data-eng machine-learning
Efficient ELT refreshes
2023-12-01 data-eng
Sh*t flows downhill, but not at Carbonfact
2023-10-16 data-eng
For analytics, don't use dynamic JSON keys
2023-05-11 data-eng sql
A rant against dbt ref
2022-06-28 data-eng sql rant
Dashboards and GROUPING SETS
2021-09-10 data-eng sql
An overview of dataset time travel
2021-04-07 data-eng
A few intermediate pandas tricks
2020-08-17 data-eng
Finding fuzzy duplicates with pandas
2019-09-16 data-eng
A smooth approach to putting machine learning into production
2019-07-13 machine-learning data-eng
Skyline queries in Python
2019-05-21 data-eng
Kaggle icon
mail