Get invited to our slack community and get access to opportunities and data science insights

Blog

approx_count_distinct

approx_count_distinct(expr[, relativeSD]) – Returns the estimated cardinality by HyperLogLog++. `relativeSD` defines the maximum estimation error

approx_percentile

approx_percentile(col, percentage [, accuracy]) – Returns the approximate percentile value of numeric column `col` at

argmin_kld

argmin_kld(float mean, float covar) – Returns mean or covar that minimize a KL-distance among distributions

array

array(expr, …) – Returns an array with the given elements. Platforms: WhereOS, Spark, Hive Class:

angular_distance

angular_distance(ftvec1, ftvec2) – Returns an angular distance of the given two vectors WITH docs as

angular_similarity

angular_similarity(ftvec1, ftvec2) – Returns an angular similarity of the given two vectors WITH docs as