GroupBy #27: Balancing HDFS DataNodes in the Uber DataLake, How Figma’s databases team lived to tell the scale
Plus: Building Meta’s GenAI Infrastructure, How to save millions by optimizing data pipeline shuffling
This is GroupBy, where I share the resources I learn from people smarter than me in the data engineering field.
Not subscribed yet? Here you go:
👋 Hi, my name is Vu Trinh, a data engineer.
I enjoy reading good stuff (related to data and engineering), and this newsletter is my effort on the journey to seek the "good stuff" across the entire Internet.
Hope…