🗄️SQL & Databases

Denormalization

Updated 2026-05-15

10 min read

Denormalization

Introduction

In the world of database design, normalization is a process used to reduce data redundancy and improve data integrity. It involves organizing the fields and tables of a relational database to ensure that their dependencies are logical and consistent. However, in some cases, excessive normalization can lead to performance issues due to the need for multiple joins when retrieving data.

Denormalization is the process of intentionally adding redundancy to a database schema to optimize query performance. By duplicating some data or combining tables, denormalization can reduce the complexity of queries and improve their execution speed, especially in read-heavy applications.

Concept

Denormalization involves relaxing the strict rules of normalization to achieve better performance. This is typically done by merging tables that have been normalized or by adding redundant columns that store precomputed values. The goal is to minimize the number of joins required to retrieve data and reduce the overall query execution time.

Common Denormalization Techniques

Combining Tables: Merging related tables into a single table can reduce the need for joins.
Adding Redundant Columns: Storing duplicate information in multiple places can speed up read operations.
Precomputed Values: Calculating and storing values that are frequently used in queries (like sums, averages) can improve performance.

Trade-offs

While denormalization improves query performance, it also introduces trade-offs:

Increased Data Redundancy: More data duplication means more storage space is required.
Data Integrity Challenges: Maintaining consistency across redundant data can be complex and error-prone.
Complex Updates: Changes to the data may require updates in multiple places.

Examples

Let's explore some practical examples of denormalization using SQL.

Example 1: Combining Tables

Suppose we have two normalized tables, orders and customers, where each order is linked to a customer by a foreign key.

SQL

1CREATE TABLE customers (
2  customer_id INT PRIMARY KEY,
3  name VARCHAR(255),
4  email VARCHAR(255)
5);
6 
7CREATE TABLE orders (
8  order_id INT PRIMARY KEY,
9  customer_id INT,
10  order_date DATE,
11  amount DECIMAL(10, 2),
12  FOREIGN KEY (customer_id) REFERENCES customers(customer_id)
13);

To denormalize this schema, we can combine the orders and customers tables into a single table.

SQL

1CREATE TABLE order_details (
2  order_id INT PRIMARY KEY,
3  customer_name VARCHAR(255),
4  email VARCHAR(255),
5  order_date DATE,
6  amount DECIMAL(10, 2)
7);

When inserting data into the order_details table, we need to join the orders and customers tables.

SQL

1INSERT INTO order_details (order_id, customer_name, email, order_date, amount)
2SELECT o.order_id, c.name, c.email, o.order_date, o.amount
3FROM orders o
4JOIN customers c ON o.customer_id = c.customer_id;

Example 2: Adding Redundant Columns

Consider a scenario where we have a products table and an orders table. We want to optimize the retrieval of product names along with order details.

SQL

1CREATE TABLE products (
2  product_id INT PRIMARY KEY,
3  name VARCHAR(255),
4  price DECIMAL(10, 2)
5);
6 
7CREATE TABLE orders (
8  order_id INT PRIMARY KEY,
9  product_id INT,
10  quantity INT,
11  FOREIGN KEY (product_id) REFERENCES products(product_id)
12);

To denormalize this schema, we can add a product_name column to the orders table.

SQL

1ALTER TABLE orders
2ADD COLUMN product_name VARCHAR(255);

When inserting data into the orders table, we need to update the product_name column based on the corresponding product name in the products table.

SQL

1INSERT INTO orders (order_id, product_id, quantity, product_name)
2SELECT o.order_id, o.product_id, o.quantity, p.name
3FROM orders o
4JOIN products p ON o.product_id = p.product_id;

What's Next?

In this tutorial, we explored the concept of denormalization and how it can be used to improve query performance by adding redundancy. While denormalization offers significant benefits in terms of speed, it also introduces challenges related to data integrity and storage.

For further optimization techniques, you may want to explore other database design strategies such as indexing, partitioning, and caching. Additionally, understanding the specific requirements and usage patterns of your application will help you make informed decisions about when and how to denormalize your database schema.

In the next section, we will delve into more advanced SQL optimization techniques that can further enhance the performance of your queries.

🗄️SQL & Databases

Denormalization

Updated 2026-05-15

10 min read

Denormalization

Introduction

Concept

Common Denormalization Techniques

Combining Tables: Merging related tables into a single table can reduce the need for joins.
Adding Redundant Columns: Storing duplicate information in multiple places can speed up read operations.
Precomputed Values: Calculating and storing values that are frequently used in queries (like sums, averages) can improve performance.

Trade-offs

While denormalization improves query performance, it also introduces trade-offs:

Increased Data Redundancy: More data duplication means more storage space is required.
Data Integrity Challenges: Maintaining consistency across redundant data can be complex and error-prone.
Complex Updates: Changes to the data may require updates in multiple places.

Examples

Let's explore some practical examples of denormalization using SQL.

Example 1: Combining Tables

Suppose we have two normalized tables, orders and customers, where each order is linked to a customer by a foreign key.

SQL

1CREATE TABLE customers (
2  customer_id INT PRIMARY KEY,
3  name VARCHAR(255),
4  email VARCHAR(255)
5);
6 
7CREATE TABLE orders (
8  order_id INT PRIMARY KEY,
9  customer_id INT,
10  order_date DATE,
11  amount DECIMAL(10, 2),
12  FOREIGN KEY (customer_id) REFERENCES customers(customer_id)
13);

To denormalize this schema, we can combine the orders and customers tables into a single table.

SQL

1CREATE TABLE order_details (
2  order_id INT PRIMARY KEY,
3  customer_name VARCHAR(255),
4  email VARCHAR(255),
5  order_date DATE,
6  amount DECIMAL(10, 2)
7);

When inserting data into the order_details table, we need to join the orders and customers tables.

SQL

1INSERT INTO order_details (order_id, customer_name, email, order_date, amount)
2SELECT o.order_id, c.name, c.email, o.order_date, o.amount
3FROM orders o
4JOIN customers c ON o.customer_id = c.customer_id;

Example 2: Adding Redundant Columns

Consider a scenario where we have a products table and an orders table. We want to optimize the retrieval of product names along with order details.

SQL

1CREATE TABLE products (
2  product_id INT PRIMARY KEY,
3  name VARCHAR(255),
4  price DECIMAL(10, 2)
5);
6 
7CREATE TABLE orders (
8  order_id INT PRIMARY KEY,
9  product_id INT,
10  quantity INT,
11  FOREIGN KEY (product_id) REFERENCES products(product_id)
12);

To denormalize this schema, we can add a product_name column to the orders table.

SQL

1ALTER TABLE orders
2ADD COLUMN product_name VARCHAR(255);

When inserting data into the orders table, we need to update the product_name column based on the corresponding product name in the products table.

SQL

1INSERT INTO orders (order_id, product_id, quantity, product_name)
2SELECT o.order_id, o.product_id, o.quantity, p.name
3FROM orders o
4JOIN products p ON o.product_id = p.product_id;

What's Next?

In the next section, we will delve into more advanced SQL optimization techniques that can further enhance the performance of your queries.