🏗️System Design

Data Modeling

Updated 2026-05-15

10 min read

Data Modeling

Introduction

Data modeling is a fundamental aspect of database design and development. It involves creating a blueprint or schema that defines how data will be structured, stored, and accessed within a database system. Effective data modeling ensures efficient data retrieval, storage optimization, and scalability. In this tutorial, we'll explore techniques for effective data modeling, focusing on relational databases.

Concept

Data modeling typically involves several key concepts:

Entities: These are real-world objects or concepts that you want to store in the database. For example, in a library system, entities might include "Books," "Authors," and "Members."
Attributes: These are properties of an entity. Continuing with the library example, attributes for the "Books" entity could be "Title," "ISBN," "AuthorID," and "PublicationDate."
Relationships: These define how different entities interact or relate to each other. For instance, a book is written by one author, indicating a one-to-many relationship between authors and books.
Normalization: This is the process of organizing data in a database to reduce redundancy and improve data integrity. Normalization involves breaking down tables into smaller, related tables and defining relationships between them.
Denormalization: In some cases, denormalization might be used to optimize read performance by intentionally duplicating or combining data across multiple tables.

Examples

Let's walk through a practical example of data modeling for a simple e-commerce system.

Step 1: Identify Entities and Attributes

For an e-commerce system, we might identify the following entities:

Customers
- CustomerID
- FirstName
- LastName
- Email
- PhoneNumber
Products
- ProductID
- Name
- Description
- Price
- CategoryID
Orders
- OrderID
- OrderDate
- CustomerID
- TotalAmount

Step 2: Define Relationships

Next, we define the relationships between these entities:

A customer can place multiple orders, so there is a one-to-many relationship between Customers and Orders.
Each order contains multiple products, so there is a many-to-many relationship between Orders and Products.

To handle the many-to-many relationship between Orders and Products, we introduce an intermediary table called "OrderDetails":

OrderDetails
- OrderDetailID
- OrderID (foreign key referencing Orders)
- ProductID (foreign key referencing Products)
- Quantity

Step 3: Create Tables

Now, let's create the SQL tables based on our data model:

SQL

1CREATE TABLE Customers (
2  CustomerID INT PRIMARY KEY,
3  FirstName VARCHAR(50),
4  LastName VARCHAR(50),
5  Email VARCHAR(100),
6  PhoneNumber VARCHAR(20)
7);
8 
9CREATE TABLE Products (
10  ProductID INT PRIMARY KEY,
11  Name VARCHAR(100),
12  Description TEXT,
13  Price DECIMAL(10, 2),
14  CategoryID INT
15);
16 
17CREATE TABLE Orders (
18  OrderID INT PRIMARY KEY,
19  OrderDate DATETIME,
20  CustomerID INT,
21  TotalAmount DECIMAL(10, 2),
22  FOREIGN KEY (CustomerID) REFERENCES Customers(CustomerID)
23);
24 
25CREATE TABLE OrderDetails (
26  OrderDetailID INT PRIMARY KEY,
27  OrderID INT,
28  ProductID INT,
29  Quantity INT,
30  FOREIGN KEY (OrderID) REFERENCES Orders(OrderID),
31  FOREIGN KEY (ProductID) REFERENCES Products(ProductID)
32);

Step 4: Querying the Data

With our tables in place, we can now perform queries to retrieve data. For example, to get all orders placed by a specific customer:

Terminal

SELECT * FROM Orders WHERE CustomerID = 1;

Output

OrderID | OrderDate       | CustomerID | TotalAmount
--------|-----------------|------------|------------
1       | 2023-01-01 10:00 | 1          | 59.99
2       | 2023-02-15 14:30 | 1          | 75.50

What's Next?

In the next section, we will explore distributed databases and how they differ from traditional relational databases in terms of architecture and data modeling techniques.

By understanding these concepts and following best practices for data modeling, you can design robust and efficient database systems that meet the needs of your applications.

🏗️System Design

Data Modeling

Updated 2026-05-15

10 min read

Data Modeling

Introduction

Concept

Data modeling typically involves several key concepts:

Entities: These are real-world objects or concepts that you want to store in the database. For example, in a library system, entities might include "Books," "Authors," and "Members."
Attributes: These are properties of an entity. Continuing with the library example, attributes for the "Books" entity could be "Title," "ISBN," "AuthorID," and "PublicationDate."
Relationships: These define how different entities interact or relate to each other. For instance, a book is written by one author, indicating a one-to-many relationship between authors and books.
Normalization: This is the process of organizing data in a database to reduce redundancy and improve data integrity. Normalization involves breaking down tables into smaller, related tables and defining relationships between them.
Denormalization: In some cases, denormalization might be used to optimize read performance by intentionally duplicating or combining data across multiple tables.

Examples

Let's walk through a practical example of data modeling for a simple e-commerce system.

Step 1: Identify Entities and Attributes

For an e-commerce system, we might identify the following entities:

Customers
- CustomerID
- FirstName
- LastName
- Email
- PhoneNumber
Products
- ProductID
- Name
- Description
- Price
- CategoryID
Orders
- OrderID
- OrderDate
- CustomerID
- TotalAmount

Step 2: Define Relationships

Next, we define the relationships between these entities:

A customer can place multiple orders, so there is a one-to-many relationship between Customers and Orders.
Each order contains multiple products, so there is a many-to-many relationship between Orders and Products.

To handle the many-to-many relationship between Orders and Products, we introduce an intermediary table called "OrderDetails":

OrderDetails
- OrderDetailID
- OrderID (foreign key referencing Orders)
- ProductID (foreign key referencing Products)
- Quantity

Step 3: Create Tables

Now, let's create the SQL tables based on our data model:

SQL

1CREATE TABLE Customers (
2  CustomerID INT PRIMARY KEY,
3  FirstName VARCHAR(50),
4  LastName VARCHAR(50),
5  Email VARCHAR(100),
6  PhoneNumber VARCHAR(20)
7);
8 
9CREATE TABLE Products (
10  ProductID INT PRIMARY KEY,
11  Name VARCHAR(100),
12  Description TEXT,
13  Price DECIMAL(10, 2),
14  CategoryID INT
15);
16 
17CREATE TABLE Orders (
18  OrderID INT PRIMARY KEY,
19  OrderDate DATETIME,
20  CustomerID INT,
21  TotalAmount DECIMAL(10, 2),
22  FOREIGN KEY (CustomerID) REFERENCES Customers(CustomerID)
23);
24 
25CREATE TABLE OrderDetails (
26  OrderDetailID INT PRIMARY KEY,
27  OrderID INT,
28  ProductID INT,
29  Quantity INT,
30  FOREIGN KEY (OrderID) REFERENCES Orders(OrderID),
31  FOREIGN KEY (ProductID) REFERENCES Products(ProductID)
32);

Step 4: Querying the Data

With our tables in place, we can now perform queries to retrieve data. For example, to get all orders placed by a specific customer:

Terminal

SELECT * FROM Orders WHERE CustomerID = 1;

Output

OrderID | OrderDate       | CustomerID | TotalAmount
--------|-----------------|------------|------------
1       | 2023-01-01 10:00 | 1          | 59.99
2       | 2023-02-15 14:30 | 1          | 75.50

What's Next?

In the next section, we will explore distributed databases and how they differ from traditional relational databases in terms of architecture and data modeling techniques.

By understanding these concepts and following best practices for data modeling, you can design robust and efficient database systems that meet the needs of your applications.