stdevif
This page explains how to use the stdevif aggregation function in APL.
The stdevif
aggregation function in APL computes the standard deviation of values in a group based on a specified condition. This is useful when you want to calculate variability in data, but only for rows that meet a particular condition. For example, you can use stdevif
to find the standard deviation of response times in an HTTP log, but only for requests that resulted in a 200 status code.
The stdevif
function is useful when you want to analyze the spread of data values filtered by specific criteria, such as analyzing request durations in successful transactions or monitoring trace durations of specific services in OpenTelemetry data.
For users of other query languages
If you come from other query languages, this section explains how to adjust your existing queries to achieve the same results in APL.
Usage
Syntax
Parameters
- column: The column that contains the numeric values for which you want to calculate the standard deviation.
- condition: The condition that must be true for the values to be included in the standard deviation calculation.
Returns
The stdevif
function returns a floating-point number representing the standard deviation of the specified column for the rows that satisfy the condition.
Use case examples
In this example, you calculate the standard deviation of request durations (req_duration_ms
), but only for successful HTTP requests (status code 200).
Query
Output
geo.country | stdev_req_duration_ms |
---|---|
US | 120.45 |
Canada | 98.77 |
Germany | 134.92 |
This query calculates the standard deviation of request durations for HTTP 200 responses, grouped by country.
List of related aggregations
- avgif: Similar to
stdevif
, but instead of calculating the standard deviation,avgif
computes the average of values that meet the condition. - sumif: Computes the sum of values that meet the condition. Use
sumif
when you want to aggregate total values instead of analyzing data spread. - varianceif: Returns the variance of values that meet the condition, which is a measure of how spread out the data points are.
- countif: Counts the number of rows that satisfy the specified condition.
- minif: Retrieves the minimum value that satisfies the given condition, useful when finding the smallest value in filtered data.