Histogram

Overview

The Histogram calculator shows the distribution of values for a selected attribute across defined ranges (bins). This visualization helps you understand the shape, central tendency, and spread of your data, making it essential for identifying patterns, outliers, and data distributions.

Common Uses

  • See the distribution of invoice payments (early, on-time, or late)
  • Analyze the distribution of case durations
  • View the distribution of processing times
  • Identify peak activity hours throughout the day
  • Analyze purchase order value distributions

Settings

Attribute: Select the attribute you wish to analyze (e.g., Case Duration, Invoice Amount, Time of Day).

Aggregate Function: Choose how to aggregate the data within each bin:

Function Description
Case Count Returns the number of cases in each bin (most common)
Max Returns the maximum value for each bin
Median Returns the median value for each bin
Min Returns the minimum value for each bin
Sum Returns the sum of values for each bin

Number of Bins: Specify the number of bins (ranges/groups/intervals) to distribute the data into. More bins provide finer granularity, fewer bins show broader patterns.

Calculation Mode: Choose how the histogram ranges are determined:

Mode Description When to Use
Auto Mode Automatically selects appropriate values and units Best starting point for most analyses
Full Range Mode Includes the complete range of values, including outliers When you need to see all data including extreme values
Manual Mode Manually specify min/max values and units When you want to focus on a specific range or exclude outliers

Manual Mode Settings:

  • Min Value: Specify the lowest value to include
  • Max Value: Specify the highest value to include
  • Duration Units: For time-based attributes, specify units (minutes, hours, days, weeks)

Examples

Example 1: Activity Distribution Throughout the Day

Scenario: You want to see when cases are started throughout the day to understand peak activity hours.

Settings:

  • Attribute: Time of Day
  • Aggregate Function: Case Count
  • Number of Bins: 24 (one per hour)
  • Calculation Mode: Manual
    • Min Value: 0 (midnight)
    • Max Value: 24 (end of day)

Output:

The histogram shows that most cases start between 6am and 3pm, with some cases falling outside typical working hours.

Insights: This reveals staffing patterns, identifies after-hours activity that may require investigation, and shows peak processing times.

Example 2: Purchase Order Value Distribution

Scenario: You want to analyze the distribution of total costs for purchase orders.

Settings:

  • Attribute: Total Cost
  • Aggregate Function: Case Count
  • Calculation Mode: Auto

Output:

The histogram shows that most purchase orders are on the lower end, below $1,000.

Insights: This helps you understand typical transaction sizes and identify appropriate approval thresholds.

Example 3: Focused Analysis of Lower-Value Orders

Scenario: To analyze the lower-value purchase orders more closely, use manual mode to zoom into a specific range.

Settings:

  • Attribute: Total Cost
  • Calculation Mode: Manual
    • Max Value: 500

Output:

The histogram now shows finer detail for purchase orders under $500, revealing more granular distribution patterns in this range.

Insights: Manual mode lets you focus on specific ranges of interest by excluding outliers or high-value transactions.


This documentation is part of the mindzie Studio process mining platform.

An error has occurred. This application may no longer respond until reloaded. Reload ??