The PARTITION BY
clause in SQL is a game-changer for anyone delving into advanced data analysis. By dividing your dataset into partitions and applying window functions, you can extract deeper insights and make data-driven decisions with ease. In this blog, we’ll explore four practical examples showcasing the power of PARTITION BY
and how it can enhance your SQL queries.
Example 1: Cumulative Sales Calculation
Need to track how your sales representatives’ performance evolves over time? Use PARTITION BY
to calculate cumulative sales for each rep.
Sample Data:
CREATE TABLE Sales (
SalesID INT PRIMARY KEY,
SalesRep VARCHAR(50),
SaleDate DATE,
SalesAmount DECIMAL(10, 2)
);
INSERT INTO Sales (SalesID, SalesRep, SaleDate, SalesAmount)
VALUES
(1, 'Alice', '2024-01-10', 500.00),
(2, 'Bob', '2024-01-15', 700.00),
(3, 'Alice', '2024-02-05', 600.00),
(4, 'Bob', '2024-02-12', 800.00),
(5, 'Alice', '2024-03-20', 750.00),
(6, 'Bob', '2024-03-25', 900.00);
Query:
SELECT
SalesRep,
SaleDate,
SalesAmount,
SUM(SalesAmount) OVER (PARTITION BY SalesRep ORDER BY SaleDate) AS CumulativeSales
FROM
Sales;
Result:

Example 2: Year-to-Date Sales
Calculate year-to-date sales for your team members to monitor their progress throughout the year.
Sample Data:
CREATE TABLE SalesData (
SalesID INT PRIMARY KEY,
SalesRep VARCHAR(50),
SaleDate DATE,
SaleAmount DECIMAL(10, 2)
);
INSERT INTO SalesData (SalesID, SalesRep, SaleDate, SaleAmount)
VALUES
(1, 'Alice', '2024-01-15', 3000.00),
(2, 'Alice', '2024-02-20', 4500.00),
(3, 'Alice', '2024-03-18', 6000.00),
(4, 'Bob', '2024-01-25', 3500.00),
(5, 'Bob', '2024-02-10', 4700.00),
(6, 'Bob', '2024-03-22', 6500.00);
Query:
SELECT
SalesRep,
SaleDate,
SaleAmount,
SUM(SaleAmount) OVER (PARTITION BY SalesRep ORDER BY SaleDate ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS YearToDateSales
FROM
SalesData;
Result:

Example 3: Moving Average of Sales
Smooth out sales data fluctuations with a moving average over the past three months.
Sample Data:
CREATE TABLE MonthlySales (
MonthYear CHAR(7), -- Format: YYYY-MM
SalesRep VARCHAR(50),
MonthlySales DECIMAL(10, 2)
);
INSERT INTO MonthlySales (MonthYear, SalesRep, MonthlySales)
VALUES
('2024-01', 'Alice', 5000.00),
('2024-02', 'Alice', 6000.00),
('2024-03', 'Alice', 7000.00),
('2024-01', 'Bob', 7000.00),
('2024-02', 'Bob', 8000.00),
('2024-03', 'Bob', 9000.00);
Query:
SELECT
SalesRep,
MonthYear,
MonthlySales,
AVG(MonthlySales) OVER (PARTITION BY SalesRep ORDER BY MonthYear ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) AS MovingAverage
FROM
MonthlySales;
Result:

Example 4: Total Sales by Product Category
Summarize total sales for each product category to gain insights into category performance.
Sample Data:
CREATE TABLE ProductSales (
ProductID INT PRIMARY KEY,
Category VARCHAR(50),
ProductName VARCHAR(50),
Sales DECIMAL(10, 2)
);
INSERT INTO ProductSales (ProductID, Category, ProductName, Sales)
VALUES
(1, 'Electronics', 'Smartphone', 12000.00),
(2, 'Electronics', 'Laptop', 15000.00),
(3, 'Electronics', 'Tablet', 9000.00),
(4, 'Furniture', 'Sofa', 8000.00),
(5, 'Furniture', 'Table', 5000.00),
(6, 'Furniture', 'Chair', 7000.00);
Query:
SELECT
Category,
ProductName,
Sales,
SUM(Sales) OVER (PARTITION BY Category) AS TotalCategorySales
FROM
ProductSales;
Result:

Real-World Applications
Using PARTITION BY
helps you:
- Track cumulative and year-to-date sales to evaluate performance trends.
- Calculate moving averages to smooth out data fluctuations.
- Summarize and analyze sales data by categories for strategic insights.
Have you explored the power of the PARTITION BY
clause in your SQL queries? How have you used it to solve complex data challenges or improve your analysis? Share your experiences or ask questions in the comments below!
Thanks for reading, and happy querying!