Introduction to Database Normalization

Last Updated : 17 Apr, 2026

Normalization is an important process in database design that helps improve the database's efficiency, consistency, and accuracy. It makes it easier to manage and maintain the data and ensures that the database is adaptable to changing business needs.

  • It is the process of organizing the attributes of the database to reduce or eliminate data redundancy (having the same data in different places), which otherwise unnecessarily increases the size of the database.
  • Data redundancy leads to inconsistency problems during insert, delete, and update operations, making data management difficult.
  • Normalization involves splitting a table into multiple tables, which must be linked each time a query requires data from the split tables.
  • Before Normalization: The table is prone to redundancy and anomalies (insertion, update, and deletion).
  • After Normalization: The data is divided into logical tables to ensure consistency, avoid redundancy and remove anomalies making the database efficient and reliable.

Problems in the Employee_Department Relation

This section highlights common anomalies and redundancy issues that occur due to improper database design.

  1. Insertion Anomaly: If a new department is created but no employee is assigned to it yet, we cannot store its location because we need an employee record to insert.
  2. Update Anomaly: If the location of the HR department changes, we must update it in multiple rows (for both Nick Wise and Lily Case). If one row is missed, the data becomes inconsistent.
  3. Deletion Anomaly: If all employees in the IT department leave, we lose the department information, including its location.
  4. Data Redundancy: The department location is repeated for every employee in the same department.

Need of Normalization

The primary objective for normalizing the relations is to eliminate the below anomalies. Failure to reduce anomalies results in data redundancy, which may threaten data integrity and cause additional issues as the database increases.

  • Insertion Anomalies: Insertion anomalies occur when it is not possible to insert data into a database because the required fields are missing or because the data is incomplete. For example, if a database requires that every record has a primary key, but no value is provided for a particular record, it cannot be inserted into the database.
  • Deletion anomalies: Deletion anomalies occur when deleting a record from a database and can result in the unintentional loss of data. For example, if a database contains information about customers and orders, deleting a customer record may also delete all the orders associated with that customer.
  • Updation anomalies: Updation anomalies occur when modifying data in a database and can result in inconsistencies or errors. For example, if a database contains information about employees and their salaries, updating an employee’s salary in one record but not in all related records could lead to incorrect calculations and reporting.

Features of Database Normalization

This section highlights the key benefits of normalization in improving data organization, consistency, and overall database design.

  • Elimination of Data Redundancy: One of the main features of normalization is to eliminate the data redundancy that can occur in a database. Data redundancy refers to the repetition of data in different parts of the database. Normalization helps in reducing or eliminating this redundancy, which can improve the efficiency and consistency of the database.
  • Ensuring Data Consistency: Normalization helps in ensuring that the data in the database is consistent and accurate. By eliminating redundancy, normalization helps in preventing inconsistencies and contradictions that can arise due to different versions of the same data.
  • Simplification of Data Management: Normalization simplifies the process of managing data in a database. By breaking down a complex data structure into simpler tables, normalization makes it easier to manage the data, update it, and retrieve it.
  • Improved Database Design: Normalization helps in improving the overall design of the database. By organizing the data in a structured and systematic way, normalization makes it easier to design and maintain the database. It also makes the database more flexible and adaptable to changing business needs.
  • Standardization: Normalization helps in standardizing the data in the database. By organizing the data into tables and defining relationships between them, normalization helps in ensuring that the data is stored in a consistent and uniform manner.

Normal Forms in DBMS

This section explains the different normal forms used to structure databases and eliminate redundancy while ensuring data integrity.

Normal Forms

Description of Normal Forms

First Normal Form (1NF)

A relation is in first normal form if every attribute in that relation is single-valued attribute. 

Second Normal Form (2NF)

A relation that is in First Normal Form and every non-primary-key attribute is fully functionally dependent on the primary key, then the relation is in Second Normal Form (2NF).

Third Normal Form (3NF)

A relation is in the third normal form, if there is no transitive dependency for non-prime attributes as well as it is in the second normal form. A relation is in 3NF if at least one of the following conditions holds in every non-trivial function dependency X –> Y.

  • X is a super key.
  • Y is a prime attribute (each element of Y is part of some candidate key).

Boyce-Codd Normal Form (BCNF)

For BCNF the relation should satisfy the below conditions

  • The relation should be in the 3rd Normal Form.
  • X should be a super-key for every functional dependency (FD) X−>Y in a given relation. 

Fourth Normal Form (4NF)

A relation R is in 4NF if and only if the following conditions are satisfied: 

Fifth Normal Form (5NF)

 A relation R is in 5NF if and only if it satisfies the following conditions:

  • R should be already in 4NF. 
  • It cannot be further non loss decomposed (join dependency)
Comment

Explore