VuTrinh.
Subscribe
Sign in
Home
GroupBy.
Kafka
Dimensions.
Archive
About
Latest
Top
Discussions
I spent 4 hours learning the architecture of BigQuery's storage engine
Vortex: The BigQuery's Stream-Oriented Storage Engine (Part 1)
18 hrs ago
•
Vu Trinh
9
Share this post
VuTrinh.
I spent 4 hours learning the architecture of BigQuery's storage engine
Copy link
Facebook
Email
Notes
More
AutoMQ: Achieving Auto Partition Reassignment In Kafka Without Cruise Control
AutoMQ’s stateless brokers and its self-balancing feature
Nov 20
•
Vu Trinh
7
Share this post
VuTrinh.
AutoMQ: Achieving Auto Partition Reassignment In Kafka Without Cruise Control
Copy link
Facebook
Email
Notes
More
I spent 4 hours learning how Netflix operates Apache Iceberg at scale.
Iceberg The Backbone At Netflix Data Platform Architecture
Nov 16
•
Vu Trinh
16
Share this post
VuTrinh.
I spent 4 hours learning how Netflix operates Apache Iceberg at scale.
Copy link
Facebook
Email
Notes
More
6
How does Netflix ensure the data quality for thousands of Apache Iceberg tables?
The Write-Audit-Publish pattern with Iceberg Branches
Nov 12
•
Vu Trinh
11
Share this post
VuTrinh.
How does Netflix ensure the data quality for thousands of Apache Iceberg tables?
Copy link
Facebook
Email
Notes
More
1
I spent 8 hours relearning the Delta Lake table format
The format, Read/Write process, Concurrency, Data Mutation and more
Nov 9
•
Vu Trinh
20
Share this post
VuTrinh.
I spent 8 hours relearning the Delta Lake table format
Copy link
Facebook
Email
Notes
More
3
DataHub: The Metadata Platform Developed at LinkedIn
How did LinkedIn manage the data catalog at scale?
Nov 5
•
Vu Trinh
9
Share this post
VuTrinh.
DataHub: The Metadata Platform Developed at LinkedIn
Copy link
Facebook
Email
Notes
More
1
I spent 8 hours learning the ClickHouse MergeTree Table Engine
Concepts, The Write/Read Process, The Mutation and The replication
Nov 2
•
Vu Trinh
10
Share this post
VuTrinh.
I spent 8 hours learning the ClickHouse MergeTree Table Engine
Copy link
Facebook
Email
Notes
More
October 2024
I spent 3 hours learning the overview of ClickHouse
The overview architecture
Oct 29
•
Vu Trinh
9
Share this post
VuTrinh.
I spent 3 hours learning the overview of ClickHouse
Copy link
Facebook
Email
Notes
More
I spent 3 hours learning how Uber manages data quality.
From the standard to the data quality platform
Oct 26
•
Vu Trinh
21
Share this post
VuTrinh.
I spent 3 hours learning how Uber manages data quality.
Copy link
Facebook
Email
Notes
More
2
How AutoMQ Reduces Nearly 100% of Kafka Cross-Zone Data Transfer Cost
Producing data with the broker in the same availability zone with S3 WAL, self-balancing, and leveraging Kafka rack-awareness
Oct 22
•
Vu Trinh
12
Share this post
VuTrinh.
How AutoMQ Reduces Nearly 100% of Kafka Cross-Zone Data Transfer Cost
Copy link
Facebook
Email
Notes
More
I spent 4 hours learning Apache Spark Resource Allocation
Spark's resource allocation mechanism and the two scheduling modes.
Oct 19
•
Vu Trinh
16
Share this post
VuTrinh.
I spent 4 hours learning Apache Spark Resource Allocation
Copy link
Facebook
Email
Notes
More
I spent 8 hours learning the details of the Apache Spark scheduling process.
Anatomy of a Spark job and the typical scheduling process.
Oct 15
•
Vu Trinh
24
Share this post
VuTrinh.
I spent 8 hours learning the details of the Apache Spark scheduling process.
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts