ποΈ any_value
Obtains an arbitrary row from each aggregated group.
ποΈ approx_count_distinct
Returns the approximate value of aggregate function similar to the result of COUNT(DISTINCT col).
ποΈ approx_top_k
Returns the top k most frequently occurring item values in an expr along with their approximate counts.
ποΈ avg
Returns the average value of selected fields.
ποΈ bitmap
Here is a simple example to illustrate the usage of several aggregate functions in Bitmap.
ποΈ bool_or
Returns true if at least one row for expr is true.
ποΈ corr
Returns the Pearson correlation coefficient between two expressions.
ποΈ count
Returns the total number of rows specified by an expression.
ποΈ count_if
Returns the number of records that meet the specified condition or 0 if no records satisfy the condition.
ποΈ covar_pop
Returns the population covariance of two expressions.
ποΈ covar_samp
Returns the sample covariance of two expressions.
ποΈ ds_hll_accumulate
Accumulates values into a DataSketches HyperLogLog (HLL) sketch and returns the serialized sketch as VARBINARY for approximate distinct counting.
ποΈ ds_hll_combine
Combines multiple serialized DataSketches HyperLogLog (HLL) sketches into a single serialized sketch for approximate distinct counting.
ποΈ ds_hll_count_distinct
Returns the approximate distinct count using DataSketches HyperLogLog (HLL). Similar to APPROX_COUNT_DISTINCT but with higher precision.
ποΈ ds_hll_estimate
Estimates the approximate distinct count from a serialized DataSketches HyperLogLog (HLL) sketch.
ποΈ ds_theta_count_distinct
Returns approximate distinct count using DataSketches Theta sketch. Faster than COUNT(DISTINCT) with lower memory usage for high-cardinality columns.
ποΈ group_concat
Concatenates non-null values from a group into a single string, with a sep argument, which is , by default if not specified.
ποΈ grouping
Indicates whether a column is an aggregate column.
ποΈ grouping_id
grouping_id is used to distinguish the grouping statistics results of the same grouping standard.
ποΈ hll_raw_agg
Aggregate function that combines multiple HLL values into a single HLL for approximate COUNT(DISTINCT).
ποΈ hll_union
Returns the concatenation of a set of HLL values.
ποΈ hll_union_agg
HLL is an engineering implementation based on the HyperLogLog algorithm, which is used to save the intermediate results of HyperLogGog calculation process.
ποΈ mann_whitney_u_test
mann_whitney_u_test performs the Mann-Whitney rank test on samples derived from two populations.
ποΈ max
Returns the maximum value of the expr expression.
ποΈ max_by
Returns the value of x associated with the maximum value of y.
ποΈ min
Returns the minimum value of the expr expression.
ποΈ min_by
Returns the value of x associated with the minimum value of y.
ποΈ MIN_N, MAX_N
Returns the n smallest or largest values from an expression as an array.
ποΈ multi_distinct_count
Returns the total number of rows of the expr, equivalent to count(distinct expr).
ποΈ multi_distinct_sum
Returns the sum of distinct values in expr, equivalent to sum(distinct expr).
ποΈ percentile_approx
Returns the approximate value for a given percentile p, or an array of values for corresponding percentiles if p is an array.
ποΈ percentile_approx_weight
Returns the approximation of the p-th percentile with weight. A weighted version of PERCENTILE_APPROX that accepts a weight column or constant for each...
ποΈ percentile_cont
Computes the percentile value of expr with linear interpolation.
ποΈ percentile_disc
Returns a percentile value based on a discrete distribution of the input column expr.
ποΈ percentile_disc_lc
Returns a percentile value based on a discrete distribution of the input column expr.
ποΈ retention
Calculates the user retention rate within a specified period of time.
ποΈ std
Returns the standard deviation of an expression.
ποΈ stddev,stddev_pop,std
Returns the population standard deviation of the expr expression.
ποΈ stddev_samp
Returns the sample standard deviation of an expression.
ποΈ sum
Returns the sum of non-null values for expr.
ποΈ sum_map
Aggregates MAP values by summing numeric values for matching keys across multiple rows.
ποΈ var_samp,variance_samp
Returns the sample variance of an expression.
ποΈ variance,var_pop,variance_pop
Returns the population variance of an expression.
ποΈ window_funnel
Searches for an event chain in a sliding window and calculates the maximum number of consecutive events in the event chain.