Solving Détrak with brute force
optimization llm
Row level lineage at Carbonfact
python data-eng
No pain no startup
showerthought
Scraping Google Calendar events
python scraping
Warmshowers sparks joy
bike-touring showerthought
Do LLMs identify fonts?
llm scraping
Thoughts on DuckLake
data-eng
Minimizing the runtime of a SQL DAG
data-engineering python
Introducing icanexplain @ PyData Paris 2024
analytics-engineering python
LCA software: exit the matrix
sustainability python
Cutting up shoes to measure their footprint
sustainability data-science
A training set for bike sharing forecasting
data-eng machine-learning
Decomposing funnel metrics
data-science
Efficient ELT refreshes
data-eng
Measuring the carbon footprint of pizzas
sustainability python
Graph components with DuckDB
data-science sql
Online gradient descent written in SQL
online-machine-learning sql
Online active learning in 80 lines of Python
online-machine-learning
Are Airbnb guests less energy efficient than their host?
sustainability data-science
The future of River
online-machine-learning
Parsing garment descriptions with GPT-3
text-processing
NLP at Carbonfact: how would you do it?
text-processing
Matrix inverse mini-batch updates
online-machine-learning
A rant against dbt ref
data-eng sql rant
First IRL meetup with the River developers
online-machine-learning
Online machine learning with River @ GAIA
online-machine-learning
Fuzzy regex matching in Python
text-processing
OCR spelling correction is hard
text-processing
Comic book panel segmentation
image-processing
Online machine learning in practice @ PyData PDX
online-machine-learning
The online machine learning predict/fit switcheroo
online-machine-learning
Online machine learning in practice @ Applied AI
online-machine-learning
Online machine learning in practice @ LVMH
online-machine-learning
Web scraping, upside down
scraping
One year at Alan
job-log
Dashboards and GROUPING SETS
data-eng sql
Automated document processing at Alan
text-processing
Text classification by data compression
machine-learning text-processing
Reducing the memory footprint of a scikit-learn text classifier
machine-learning text-processing
What my PhD was about
job-log
Unsupervised text classification with word embeddings
machine-learning text-processing
Focal loss implementation for LightGBM
machine-learning
Our solution to the IDAO 2020 qualifiers
competitive-machine-learning
Machine learning for streaming data with creme
online-machine-learning
The benefits of online machine learning @ Quantmetry
online-machine-learning
The benefits of online machine learning @ Element AI
online-machine-learning
Skyline queries in Python
data-eng
Morellet crosses with JavaScript
generative-art
Streaming groupbys in pandas for big datasets
online-machine-learning
Target encoding done the right way
machine-learning
Stella triangles with JavaScript
generative-art
Unknown pleasures with JavaScript
generative-art
Halftoning with Go - Part 2
image-processing
Challenge Big Data @ TSE
competitive-machine-learning
Halftoning with Go - Part 1
image-processing
Predire la disponibilité des Velib' @ Toulouse Data Science Meetup
data-science machine-learning data-viz
Recursive polygons with JavaScript
generative-art
The Naïve Bayes classifier
machine-learning
An introduction to genetic algorithms
machine-learning