Topics

SQL Constraints

Database constraints are rules enforced by the database management system to ensure data integrity, accuracy, and consistency within your tables. Think of constraints as guardrails that prevent invalid or inconsistent data from being stored in your database. They act as automatic validators that check every data modification operation against predefined business rules.

Constraints serve multiple purposes in database design. They maintain referential integrity between related tables, ensure data quality by preventing invalid entries, enforce business rules automatically without requiring application logic, and provide the database optimizer with valuable information for query planning.

When you define constraints, you’re essentially building a contract between your application and the database about what constitutes valid data. This contract is enforced automatically, regardless of how the data enters the system, whether through application code, direct SQL statements, or data import processes.

NOT NULL Constraint

The NOT NULL constraint ensures that a column cannot contain empty values. This is one of the most fundamental constraints because it guarantees that essential data fields always have meaningful content.

When you apply a NOT NULL constraint to a column, the database engine will reject any INSERT or UPDATE operation that attempts to leave that column empty. This prevents situations where critical information might be missing, such as customer names, product prices, or employee identification numbers.

CREATE TABLE Employees (
    employee_id INT NOT NULL,
    first_name VARCHAR(50) NOT NULL,
    last_name VARCHAR(50) NOT NULL,
    email VARCHAR(100) NOT NULL,
    phone VARCHAR(15),
    hire_date DATE NOT NULL
);

In this example, essential fields like employee_id, names, email, and hire_date must always contain values, while the phone number is optional. This reflects real world business requirements where certain information is mandatory while other details might be collected later.

The NOT NULL constraint also interacts with default values. If you specify both a NOT NULL constraint and a default value for a column, the default value will be used when no explicit value is provided during insertion, ensuring the NOT NULL constraint is satisfied.

UNIQUE Constraint

The UNIQUE constraint ensures that all values in a column or combination of columns are distinct across all rows in the table. This prevents duplicate entries and maintains data uniqueness for business identifiers that should be one of a kind.

Unlike primary keys, a table can have multiple UNIQUE constraints, and UNIQUE columns can contain NULL values. However, most database systems allow only one NULL value per UNIQUE column, treating NULL as a distinct value for uniqueness purposes.

CREATE TABLE Customers (
    customer_id INT PRIMARY KEY,
    email VARCHAR(100) UNIQUE,
    social_security_number VARCHAR(11) UNIQUE,
    phone VARCHAR(15) UNIQUE,
    customer_code VARCHAR(20) UNIQUE
);

This table ensures that each customer has a unique email address, social security number, phone number, and customer code. If an application attempts to insert a customer with an email address that already exists in the table, the database will reject the operation and return an error.

UNIQUE constraints automatically create indexes on the constrained columns, which improves query performance when searching by those fields. This dual benefit of data integrity and performance optimization makes UNIQUE constraints valuable for columns that are frequently used in WHERE clauses.

PRIMARY KEY Constraint

The PRIMARY KEY constraint combines the features of NOT NULL and UNIQUE constraints while adding the special designation of being the table’s primary identifier. Each table can have only one primary key, and it serves as the definitive way to uniquely identify each row.

Primary keys play a crucial role in database relationships, serving as the target for foreign key references from other tables. They also influence how the database engine physically stores and organizes data, particularly in systems that use clustered indexing.

CREATE TABLE Products (
    product_id INT PRIMARY KEY,
    product_name VARCHAR(100) NOT NULL,
    category_id INT,
    price DECIMAL(10,2) NOT NULL,
    stock_quantity INT DEFAULT 0
);

In this products table, product_id serves as the primary key, ensuring that each product can be uniquely identified. The database automatically enforces that product_id values cannot be NULL and must be unique across all rows.

Composite Primary Keys

Some tables require composite primary keys, where the combination of multiple columns forms the unique identifier. This is common in junction tables that represent many to many relationships or in tables where natural business keys consist of multiple components.

CREATE TABLE OrderDetails (
    order_id INT,
    product_id INT,
    quantity INT NOT NULL,
    unit_price DECIMAL(10,2) NOT NULL,
    PRIMARY KEY (order_id, product_id)
);

Here, the combination of order_id and product_id forms the primary key, ensuring that each product can appear only once per order while allowing the same product to appear in multiple orders and the same order to contain multiple products.

FOREIGN KEY Constraint

Foreign key constraints establish and enforce relationships between tables by ensuring that values in one table correspond to existing values in another table. This maintains referential integrity and prevents orphaned records that reference non existent parent records.

A foreign key in one table points to a primary key or unique key in another table, creating a parent child relationship. The table containing the foreign key is the child table, while the table containing the referenced primary key is the parent table.

CREATE TABLE Orders (
    order_id INT PRIMARY KEY,
    customer_id INT,
    order_date DATE NOT NULL,
    total_amount DECIMAL(12,2),
    FOREIGN KEY (customer_id) REFERENCES Customers(customer_id)
);

This foreign key constraint ensures that every order must be associated with an existing customer. The database will reject any attempt to insert an order with a customer_id that doesn’t exist in the Customers table.

Referential Actions

Foreign key constraints support various referential actions that define what happens when referenced data in the parent table is modified or deleted.

CASCADE actions automatically propagate changes from the parent table to child tables. When you delete a customer record with CASCADE DELETE, all associated orders for that customer are automatically deleted as well. Similarly, CASCADE UPDATE automatically updates foreign key values when the referenced primary key changes.

SET NULL actions set foreign key columns to NULL when the referenced parent record is deleted or updated. This maintains referential integrity while preserving child records, though it requires that the foreign key column allows NULL values.

SET DEFAULT actions assign a default value to foreign key columns when the referenced parent record is removed. This can be useful for maintaining data consistency while avoiding cascading deletions.

RESTRICT actions prevent modifications to parent records that would violate referential integrity. This is often the default behavior, requiring you to handle child records before modifying or deleting parent records.

CREATE TABLE Orders (
    order_id INT PRIMARY KEY,
    customer_id INT,
    order_date DATE NOT NULL,
    FOREIGN KEY (customer_id) REFERENCES Customers(customer_id) 
        ON DELETE CASCADE ON UPDATE RESTRICT
);

CHECK Constraint

CHECK constraints allow you to define custom validation rules using logical expressions that must evaluate to true for each row. This provides powerful flexibility for enforcing complex business rules directly at the database level.

CHECK constraints can reference single columns or multiple columns within the same row, enabling sophisticated validation logic that ensures data meets specific business requirements before it’s stored in the database.

CREATE TABLE Employees (
    employee_id INT PRIMARY KEY,
    first_name VARCHAR(50) NOT NULL,
    last_name VARCHAR(50) NOT NULL,
    salary DECIMAL(10,2),
    hire_date DATE NOT NULL,
    birth_date DATE,
    department_id INT,
    CHECK (salary > 0),
    CHECK (hire_date >= '1900-01-01'),
    CHECK (birth_date < hire_date),
    CHECK (department_id BETWEEN 1 AND 10)
);

These CHECK constraints enforce several business rules: employee salaries must be positive, hire dates must be reasonable, employees must be born before they’re hired, and department IDs must fall within a valid range.

Complex CHECK Constraints

CHECK constraints can incorporate complex logical expressions including multiple conditions, subqueries, and function calls, allowing for sophisticated validation scenarios.

CREATE TABLE Products (
    product_id INT PRIMARY KEY,
    product_name VARCHAR(100) NOT NULL,
    category VARCHAR(50),
    price DECIMAL(10,2),
    discount_price DECIMAL(10,2),
    stock_quantity INT,
    CHECK (price > 0),
    CHECK (discount_price <= price),
    CHECK (stock_quantity >= 0),
    CHECK (
        (category = 'Electronics' AND price <= 10000) OR
        (category = 'Clothing' AND price <= 1000) OR
        (category NOT IN ('Electronics', 'Clothing'))
    )
);

This example demonstrates how CHECK constraints can enforce category specific pricing rules, ensuring that electronics don’t exceed a certain price point while clothing items have different limits.

Constraint Interaction and Best Practices

Constraint Naming Conventions

Establishing consistent naming conventions for constraints improves database maintainability and makes error messages more meaningful. Include the constraint type and affected columns in the constraint name to make its purpose immediately clear.

CREATE TABLE Customers (
    customer_id INT CONSTRAINT PK_Customers_CustomerID PRIMARY KEY,
    email VARCHAR(100) CONSTRAINT UQ_Customers_Email UNIQUE,
    age INT CONSTRAINT CK_Customers_Age_Range CHECK (age BETWEEN 18 AND 120),
    registration_date DATE CONSTRAINT NN_Customers_RegDate NOT NULL
);

Performance Considerations

Constraints can impact database performance in various ways. PRIMARY KEY and UNIQUE constraints automatically create indexes that improve query performance but require additional storage space and maintenance overhead during data modifications.

CHECK constraints are evaluated during every INSERT and UPDATE operation, so complex CHECK expressions can slow down data modification operations. Balance the benefits of data validation against performance requirements, especially for high volume transactional systems.

Foreign key constraints require the database to verify relationships during data modifications, which can add overhead to INSERT, UPDATE, and DELETE operations. However, they also provide the query optimizer with valuable information about table relationships that can improve JOIN performance.

Constraint Violation Handling

When constraints are violated, databases return specific error messages that applications should handle gracefully. Understanding these error patterns helps in building robust applications that can provide meaningful feedback to users when data validation fails.

Constraint violations typically result in the entire transaction being rolled back, ensuring data consistency but requiring applications to handle partial failure scenarios appropriately. Design your application logic to anticipate constraint violations and provide alternative workflows when strict validation rules prevent certain operations.

Understanding and properly implementing these SQL constraints forms the foundation of reliable database design. They ensure data quality, enforce business rules, and provide the database engine with optimization opportunities that improve overall system performance and reliability.

← Indexes Window functions →

Track your progress

Mark this subtopic as completed when you finish reading.