Part One
Foundations of Data Systems
The first four chapters go through the fundamental ideas that apply to all data systems, whether running on a single machine or distributed across a cluster of machines. These foundations are essential for understanding the more complex distributed systems discussed in later parts.
Chapters
Reliable, Scalable, and Maintainable Applications
Explore the three pillars of data systems and what they mean for modern applications.
Data Models and Query Languages
Compare different data models and understand when to use each one.
Storage and Retrieval
Dive into storage engines and understand how databases lay out data on disk.
Encoding and Evolution
Learn how data flows through systems and schemas evolve over time.
Recommended Learning Path
- 1
Start with Chapter 1
Understand the core principles of reliability, scalability, and maintainability. This sets the context for everything that follows.
- 2
Explore Data Models (Chapter 2)
Learn about different ways of modeling data and the trade-offs between them. Try the interactive data model comparisons.
- 3
Understand Storage (Chapters 3-4)
Dive into how databases actually store and retrieve data. Watch the visual simulations of LSM-trees and B-trees in action.
- 4
Move to Distributed Systems
Once you understand the foundations, proceed to Part II to learn about the challenges and solutions of distributed data systems.