Too big to fail - a Beam Pattern for enriching a Stream using State and Timers

Posted on Tue 01 August 2023 in Apache Beam

The recording of my talk at the Beam Summit 2023, discussing a pattern for enriching streaming data using state and timers, is now available:


Playing the Long Game: Transforming Ricardo's Data Infrastructure with Apache Beam

Posted on Wed 03 August 2022 in Apache Beam

The recording of my talk at the Beam Summit 2022 about building real-time data pipelines, from running our own Apache Flink cluster on-premise to the fully managed GCP Dataflow service, is now available:


Apache Beam Case Study

Posted on Fri 03 December 2021 in Apache Beam

In a case study for Apache Beam, I described how the framework enables Ricardo to evolve into a smarter second-hand marketplace. You can find it on the Apache Beam website.


You belong together - detecting linked accounts at Ricardo

Posted on Fri 06 August 2021 in Apache Beam

The recording of my talk at the Beam Summit 2021 about one of our first production pipelines created with the Python SDK is now available on YouTube:


Four Apache Technologies Combined for Fun and Profit

Posted on Wed 26 August 2020 in Apache Beam

The recording of my talk at the Online Beam Summit 2020 is now available on YouTube:


From database dumps to streaming - Ricardo.ch's Beam journey

Posted on Tue 18 June 2019 in Apache Beam

My talk at the Beam Summit Europe 2019 is now available on YouTube: