VuTrinh.
Subscribe
Sign in
Home
Become Membership
Sponsor me
Archive
About
Latest
Top
Discussions
The company that created Kafka is replacing it with a new solution
How did LinkedIn build Northguard, the new scalable log storage
Jul 3
•
Vu Trinh
35
Share this post
VuTrinh.
The company that created Kafka is replacing it with a new solution
Copy link
Facebook
Email
Notes
More
Partitioning and Clustering
8 minutes to understand the two most popular OLAP performance-optimized techniques.
Jul 1
•
Vu Trinh
22
Share this post
VuTrinh.
Partitioning and Clustering
Copy link
Facebook
Email
Notes
More
June 2025
If you're learning Apache Spark, this article is for you
A baseline for your Spark learning and research.
Jun 26
•
Vu Trinh
24
Share this post
VuTrinh.
If you're learning Apache Spark, this article is for you
Copy link
Facebook
Email
Notes
More
1
Stream Kafka Topic to the Iceberg Tables with Zero-ETL
A solution from AutoMQ: open-sourced + no need for ETL pipeline maintenance
Jun 19
•
Vu Trinh
32
Share this post
VuTrinh.
Stream Kafka Topic to the Iceberg Tables with Zero-ETL
Copy link
Facebook
Email
Notes
More
8.2 minutes, and you will understand how most data systems execute joins.
From Spark, Snowflake, to BigQuery, here is how joins are built on
Jun 17
•
Vu Trinh
21
Share this post
VuTrinh.
8.2 minutes, and you will understand how most data systems execute joins.
Copy link
Facebook
Email
Notes
More
2
How did Uber build their data infrastructure to serve 137 million monthly active users
With the help of Kafka, HDFS, Hudi, Spark, Flink, Pinot, and Presto
Jun 12
•
Vu Trinh
27
Share this post
VuTrinh.
How did Uber build their data infrastructure to serve 137 million monthly active users
Copy link
Facebook
Email
Notes
More
I spent 8 hours understanding Apache Spark's memory management
Here's everything you need to know
Jun 10
•
Vu Trinh
23
Share this post
VuTrinh.
I spent 8 hours understanding Apache Spark's memory management
Copy link
Facebook
Email
Notes
More
2
Why Parquet Is the Go-To Format for Data Engineers
With more practical lessons to help you with the data engineering journey
Published on Blog | luminousmen
•
Jun 5
4 Things To Keep In Mind As I Begin The Data Engineering Journey Again
To break into the field quickly and grow more efficiently.
Jun 3
•
Vu Trinh
16
Share this post
VuTrinh.
4 Things To Keep In Mind As I Begin The Data Engineering Journey Again
Copy link
Facebook
Email
Notes
More
May 2025
What is Apache Hive?
Why did Meta create it years ago, and why don't you see it anymore
May 29
•
Vu Trinh
33
Share this post
VuTrinh.
What is Apache Hive?
Copy link
Facebook
Email
Notes
More
1
Why do we need open table formats like Delta Lake or Iceberg?
The hope of data lake + table format = data warehouse
May 27
•
Vu Trinh
31
Share this post
VuTrinh.
Why do we need open table formats like Delta Lake or Iceberg?
Copy link
Facebook
Email
Notes
More
Hi, I have some news
I'm offering paid membership.
May 23
•
Vu Trinh
14
Share this post
VuTrinh.
Hi, I have some news
Copy link
Facebook
Email
Notes
More
8
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts