Topics

Stored Procedures & Functions

A stored procedure is a precompiled collection of SQL statements and optional control flow logic that is stored in the database and can be executed as a single unit. Think of a stored procedure as a program that lives inside your database, containing a series of SQL commands that work together to accomplish a specific business task or data operation.

The fundamental concept behind stored procedures is encapsulation of business logic at the database level. Instead of writing the same complex sequence of SQL statements repeatedly in your applications, you can define this logic once as a stored procedure and then call it whenever needed. This approach promotes code reusability, consistency, and centralized maintenance of database operations.

Stored procedures accept input parameters, can contain variables and control structures like loops and conditional statements, and can return output parameters or result sets. They execute on the database server, which means all the processing happens where the data resides, potentially reducing network traffic and improving performance for complex operations.

The precompilation aspect of stored procedures means that the database engine analyzes, optimizes, and creates an execution plan for the procedure when it’s first created or when it’s modified. This execution plan is then reused on subsequent calls, eliminating the overhead of parsing and optimizing the SQL statements each time the procedure is executed.

Stored Procedure Architecture and Execution

When you create a stored procedure, the database engine parses the SQL statements, checks for syntax errors, validates object references, and creates an optimized execution plan. This compilation process happens once, and the resulting compiled code is stored in the database’s system catalog along with the procedure definition.

The execution model of stored procedures involves several phases. When a stored procedure is called, the database engine locates the compiled procedure in memory or loads it from storage if necessary. Parameters are bound to their corresponding variables within the procedure, and the execution plan is initiated. The procedure executes its statements in sequence, handling any control flow logic, and returns results or output parameters as defined.

CREATE PROCEDURE ProcessMonthlyOrders
    @ProcessMonth INT,
    @ProcessYear INT,
    @ProcessedOrderCount INT OUTPUT
AS
BEGIN
    DECLARE @OrderCount INT = 0;
    DECLARE @CurrentDate DATE = GETDATE();
    
    -- Update order status for completed orders
    UPDATE orders 
    SET order_status = 'Processed',
        processed_date = @CurrentDate
    WHERE MONTH(order_date) = @ProcessMonth 
      AND YEAR(order_date) = @ProcessYear
      AND order_status = 'Pending';
    
    SET @OrderCount = @@ROWCOUNT;
    
    -- Insert audit record
    INSERT INTO processing_audit (process_date, month_processed, year_processed, orders_affected)
    VALUES (@CurrentDate, @ProcessMonth, @ProcessYear, @OrderCount);
    
    -- Return count via output parameter
    SET @ProcessedOrderCount = @OrderCount;
    
    -- Return summary result set
    SELECT 
        @ProcessMonth as month_processed,
        @ProcessYear as year_processed,
        @OrderCount as orders_processed,
        @CurrentDate as processing_date;
END

This stored procedure demonstrates several key concepts: input parameters that control the procedure’s behavior, local variables for internal processing, data modification operations, audit trail creation, and both output parameters and result sets for returning information to the caller.

Parameter Types and Usage

Stored procedures support multiple parameter types that provide flexibility in how data flows into and out of the procedure. Input parameters allow you to pass values into the procedure to customize its behavior. Output parameters enable the procedure to return calculated values or status information back to the calling application. Input/output parameters can both receive values and return modified values.

Input Parameters

Input parameters are the most common type, allowing you to pass values into the procedure to control its execution. These parameters act like variables within the procedure and can be used in SQL statements, calculations, and control flow logic.

CREATE PROCEDURE GetCustomerOrderSummary
    @CustomerID INT,
    @StartDate DATE,
    @EndDate DATE,
    @MinimumOrderAmount DECIMAL(10,2) = 0
AS
BEGIN
    SELECT 
        c.customer_name,
        COUNT(o.order_id) as total_orders,
        SUM(o.order_total) as total_spent,
        AVG(o.order_total) as average_order_value,
        MIN(o.order_date) as first_order_date,
        MAX(o.order_date) as most_recent_order_date
    FROM customers c
    INNER JOIN orders o ON c.customer_id = o.customer_id
    WHERE c.customer_id = @CustomerID
      AND o.order_date BETWEEN @StartDate AND @EndDate
      AND o.order_total >= @MinimumOrderAmount
    GROUP BY c.customer_id, c.customer_name;
END

This procedure uses input parameters to create a flexible customer analysis tool. The @MinimumOrderAmount parameter has a default value, making it optional when calling the procedure. This demonstrates how default values can make procedures more user friendly while maintaining flexibility.

Output Parameters and Return Values

Output parameters allow procedures to return calculated values, status codes, or other information back to the calling application. This is particularly useful for returning scalar values like counts, totals, or success indicators alongside result sets.

CREATE PROCEDURE CalculateCustomerMetrics
    @CustomerID INT,
    @AnalysisPeriodMonths INT = 12,
    @TotalOrders INT OUTPUT,
    @TotalSpent DECIMAL(12,2) OUTPUT,
    @AverageMonthlySpending DECIMAL(10,2) OUTPUT,
    @CustomerRiskScore INT OUTPUT
AS
BEGIN
    DECLARE @StartDate DATE;
    DECLARE @MonthlyAverage DECIMAL(10,2);
    
    SET @StartDate = DATEADD(MONTH, -@AnalysisPeriodMonths, GETDATE());
    
    -- Calculate basic metrics
    SELECT 
        @TotalOrders = COUNT(order_id),
        @TotalSpent = ISNULL(SUM(order_total), 0)
    FROM orders
    WHERE customer_id = @CustomerID 
      AND order_date >= @StartDate;
    
    -- Calculate average monthly spending
    SET @AverageMonthlySpending = @TotalSpent / @AnalysisPeriodMonths;
    
    -- Calculate risk score based on spending patterns
    IF @AverageMonthlySpending > 1000
        SET @CustomerRiskScore = 1; -- Low risk
    ELSE IF @AverageMonthlySpending > 500
        SET @CustomerRiskScore = 2; -- Medium risk
    ELSE
        SET @CustomerRiskScore = 3; -- High risk
    
    -- Return detailed breakdown
    SELECT 
        order_date,
        order_total,
        DATEDIFF(DAY, order_date, GETDATE()) as days_ago
    FROM orders
    WHERE customer_id = @CustomerID 
      AND order_date >= @StartDate
    ORDER BY order_date DESC;
END

This procedure combines multiple output parameters with a result set to provide comprehensive customer analysis. The calling application can access both the calculated metrics through output parameters and the detailed order history through the result set.

Understanding Functions

Database functions are similar to stored procedures but with important distinctions in their design philosophy and usage patterns. Functions are designed to return a single value or a table and are typically used within SQL statements as expressions, similar to built in functions like SUM() or UPPER().

The key philosophical difference is that functions should be deterministic and side effect free. A function should always return the same result when called with the same parameters, and it should not modify database state through INSERT, UPDATE, or DELETE operations. This makes functions predictable and safe to use within complex SQL expressions.

Functions come in several varieties: scalar functions that return a single value, table valued functions that return result sets, and inline table valued functions that essentially act as parameterized views. Each type serves different purposes and has different performance characteristics.

Scalar Functions

Scalar functions return a single value of a specific data type and can be used anywhere in SQL where you would use a literal value or built in function. They’re particularly useful for encapsulating complex calculations, data transformations, or business rules that need to be applied consistently across multiple queries.

CREATE FUNCTION CalculateCustomerLifetimeValue
    (@CustomerID INT)
RETURNS DECIMAL(12,2)
AS
BEGIN
    DECLARE @TotalSpent DECIMAL(12,2) = 0;
    DECLARE @FirstOrderDate DATE;
    DECLARE @CustomerTenureMonths INT;
    DECLARE @LifetimeValue DECIMAL(12,2);
    
    -- Get total spending and tenure
    SELECT 
        @TotalSpent = ISNULL(SUM(order_total), 0),
        @FirstOrderDate = MIN(order_date)
    FROM orders
    WHERE customer_id = @CustomerID;
    
    -- Calculate customer tenure in months
    IF @FirstOrderDate IS NOT NULL
    BEGIN
        SET @CustomerTenureMonths = DATEDIFF(MONTH, @FirstOrderDate, GETDATE());
        IF @CustomerTenureMonths = 0 SET @CustomerTenureMonths = 1; -- Avoid division by zero
        
        -- Calculate lifetime value with tenure adjustment
        SET @LifetimeValue = @TotalSpent * (1.0 + (@CustomerTenureMonths / 100.0));
    END
    ELSE
    BEGIN
        SET @LifetimeValue = 0;
    END
    
    RETURN @LifetimeValue;
END

This scalar function can now be used in any SQL query where you need customer lifetime value calculations, promoting consistency and eliminating the need to duplicate this complex logic across multiple queries.

-- Using the scalar function in a query
SELECT 
    customer_id,
    customer_name,
    dbo.CalculateCustomerLifetimeValue(customer_id) as lifetime_value,
    CASE 
        WHEN dbo.CalculateCustomerLifetimeValue(customer_id) > 5000 THEN 'Premium'
        WHEN dbo.CalculateCustomerLifetimeValue(customer_id) > 2000 THEN 'Standard'
        ELSE 'Basic'
    END as customer_tier
FROM customers
WHERE customer_status = 'Active';

Table Valued Functions

Table valued functions return result sets and can be used in the FROM clause of queries like tables or views. They provide a way to encapsulate complex query logic that can be reused across multiple contexts while accepting parameters to customize the results.

Inline Table Valued Functions

Inline table valued functions contain a single SELECT statement and return the result set directly. They’re essentially parameterized views and generally offer better performance than multi statement table valued functions because the query optimizer can better integrate them with the calling query.

CREATE FUNCTION GetCustomerOrdersByDateRange
    (@CustomerID INT, @StartDate DATE, @EndDate DATE)
RETURNS TABLE
AS
RETURN (
    SELECT 
        o.order_id,
        o.order_date,
        o.order_total,
        o.order_status,
        COUNT(od.order_detail_id) as item_count,
        SUM(od.quantity * od.unit_price) as calculated_total
    FROM orders o
    INNER JOIN order_details od ON o.order_id = od.order_id
    WHERE o.customer_id = @CustomerID
      AND o.order_date BETWEEN @StartDate AND @EndDate
    GROUP BY o.order_id, o.order_date, o.order_total, o.order_status
);

This inline table valued function can be used like a table in queries, with the flexibility of accepting parameters to filter the results.

-- Using the table valued function
SELECT 
    c.customer_name,
    orders.order_date,
    orders.order_total,
    orders.item_count
FROM customers c
CROSS APPLY dbo.GetCustomerOrdersByDateRange(c.customer_id, '2024-01-01', '2024-12-31') orders
WHERE c.customer_status = 'Active'
ORDER BY c.customer_name, orders.order_date;

Multi Statement Table Valued Functions

Multi statement table valued functions use a table variable to build their result set through multiple SQL statements. This provides more flexibility for complex logic but typically has higher overhead than inline functions.

CREATE FUNCTION GenerateCustomerAnalysisReport
    (@AnalysisDate DATE)
RETURNS @CustomerAnalysis TABLE (
    customer_id INT,
    customer_name VARCHAR(100),
    total_orders INT,
    total_spent DECIMAL(12,2),
    average_order_value DECIMAL(10,2),
    days_since_last_order INT,
    customer_status VARCHAR(20),
    risk_category VARCHAR(20)
)
AS
BEGIN
    -- Insert basic customer metrics
    INSERT INTO @CustomerAnalysis (
        customer_id, customer_name, total_orders, total_spent, average_order_value, days_since_last_order
    )
    SELECT 
        c.customer_id,
        c.customer_name,
        ISNULL(order_stats.order_count, 0),
        ISNULL(order_stats.total_amount, 0),
        ISNULL(order_stats.avg_amount, 0),
        ISNULL(DATEDIFF(DAY, order_stats.last_order_date, @AnalysisDate), 999)
    FROM customers c
    LEFT JOIN (
        SELECT 
            customer_id,
            COUNT(*) as order_count,
            SUM(order_total) as total_amount,
            AVG(order_total) as avg_amount,
            MAX(order_date) as last_order_date
        FROM orders
        WHERE order_date <= @AnalysisDate
        GROUP BY customer_id
    ) order_stats ON c.customer_id = order_stats.customer_id;
    
    -- Update customer status based on recent activity
    UPDATE @CustomerAnalysis
    SET customer_status = CASE 
        WHEN days_since_last_order <= 30 THEN 'Active'
        WHEN days_since_last_order <= 90 THEN 'At Risk'
        WHEN days_since_last_order <= 180 THEN 'Dormant'
        ELSE 'Inactive'
    END;
    
    -- Calculate risk categories
    UPDATE @CustomerAnalysis
    SET risk_category = CASE 
        WHEN total_spent > 5000 AND days_since_last_order <= 60 THEN 'Low Risk'
        WHEN total_spent > 2000 AND days_since_last_order <= 90 THEN 'Medium Risk'
        WHEN total_orders > 0 THEN 'High Risk'
        ELSE 'No History'
    END;
    
    RETURN;
END

This multi statement function builds a comprehensive customer analysis by performing multiple operations on a table variable, demonstrating how complex business logic can be encapsulated in a reusable function.

Performance Considerations and Best Practices

Execution Plan Caching

Both stored procedures and functions benefit from execution plan caching, but the effectiveness varies based on their design and usage patterns. Stored procedures typically cache their execution plans more effectively because they represent complete units of work with predictable parameter patterns.

Functions, particularly scalar functions used in WHERE clauses or SELECT lists, may be executed many times within a single query. If a scalar function is called for each row in a large result set, the performance impact can be significant. In such cases, consider whether the function logic could be rewritten as a JOIN to a table valued function or incorporated directly into the query.

Parameter Sniffing

Parameter sniffing is a phenomenon where the database optimizer creates an execution plan based on the parameter values used during the first execution of a stored procedure. While this can lead to optimal plans for typical parameter values, it can cause performance problems when the procedure is later called with atypical parameters that would benefit from a different execution plan.

CREATE PROCEDURE GetOrdersByStatus
    @OrderStatus VARCHAR(20),
    @UseRecompile BIT = 0
AS
BEGIN
    IF @UseRecompile = 1
    BEGIN
        -- Force recompilation for atypical parameter values
        SELECT o.*, c.customer_name
        FROM orders o
        INNER JOIN customers c ON o.customer_id = c.customer_id
        WHERE o.order_status = @OrderStatus
        OPTION (RECOMPILE);
    END
    ELSE
    BEGIN
        -- Use cached plan for typical executions
        SELECT o.*, c.customer_name
        FROM orders o
        INNER JOIN customers c ON o.customer_id = c.customer_id
        WHERE o.order_status = @OrderStatus;
    END
END

This pattern provides flexibility to force recompilation when needed while allowing plan reuse for typical executions.

Security and Access Control

Stored procedures and functions provide excellent security benefits by encapsulating data access logic and allowing you to grant execute permissions without giving users direct access to underlying tables. This enables implementation of the principle of least privilege, where users can perform necessary operations without having broader database access.

CREATE PROCEDURE UpdateCustomerPreferences
    @CustomerID INT,
    @EmailNotifications BIT,
    @SMSNotifications BIT,
    @NewsletterSubscription BIT
AS
BEGIN
    -- Validate customer exists and user has permission to modify
    IF NOT EXISTS (SELECT 1 FROM customers WHERE customer_id = @CustomerID)
    BEGIN
        RAISERROR('Customer not found', 16, 1);
        RETURN;
    END
    
    -- Update preferences with audit trail
    UPDATE customer_preferences 
    SET email_notifications = @EmailNotifications,
        sms_notifications = @SMSNotifications,
        newsletter_subscription = @NewsletterSubscription,
        last_updated = GETDATE(),
        updated_by = SYSTEM_USER
    WHERE customer_id = @CustomerID;
    
    -- Log the change
    INSERT INTO preference_audit (
        customer_id, change_date, changed_by, change_type
    ) VALUES (
        @CustomerID, GETDATE(), SYSTEM_USER, 'Preference Update'
    );
END

This procedure demonstrates how business logic, validation, and audit requirements can be encapsulated in a secure, reusable unit that provides controlled access to sensitive customer data.

Understanding stored procedures and functions is essential for building robust, maintainable database applications. They provide powerful tools for encapsulating business logic, improving performance, and maintaining data security while promoting code reusability and consistency across your database applications.

← Common Table Expressions Error handling & transactions →

Track your progress

Mark this subtopic as completed when you finish reading.