Get invited to our slack community and get access to opportunities and data science insights

Blog

set_difference

set_difference(a,b) – Returns a list of those items in a, but not in b Platforms:

sha

sha(expr) – Returns a sha1 hash value as a hex string of the `expr`. Platforms:

set_similarity

set_similarity(a,b) – Compute the Jaccard set similarity of two sketch sets. Platforms: WhereOS, Spark, Hive

sentences

sentences(str[, lang, country]) – Splits `str` into an array of array of words. Platforms: WhereOS,

select_k_best

select_k_best(array array, const array importance, const int k) – Returns selected top-k elements as array

sequence

sequence(start, stop, step) – Generates an array of elements from start to stop (inclusive), incrementing