GroupBy #27: Balancing HDFS DataNodes in the Uber DataLake, How Figma’s databases team lived to tell the scale

Plus: Building Meta’s GenAI Infrastructure, How to save millions by optimizing data pipeline shuffling

Vu Trinh

Mar 19, 2024

∙ Paid

This is GroupBy, where I share the resources I learn from people smarter than me in the data engineering field.

Not subscribed yet? Here you go:

Available for iOS and Android

👋 Hi, my name is Vu Trinh, a data engineer.
I enjoy reading good stuff (related to data and engineering), and this newsletter is my effort on the journey to seek the "good stuff" across the entire Internet.
Hope…

VuTrinh.

GroupBy #27: Balancing HDFS DataNodes in the Uber DataLake, How Figma’s databases team lived to tell the scale

Plus: Building Meta’s GenAI Infrastructure, How to save millions by optimizing data pipeline shuffling

This post is for paid subscribers