each_top_k

each_top_k(int K, Object group, double cmpKey, *) – Returns top-K values (or tail-K values when

element_at

element_at(array, index) – Returns element of array at given (1-based) index. If index < 0,

distance2similarity

distance2similarity(float d) – Returns 1.0 / (1.0 + d) Platforms: WhereOS, Spark, Hive Class: hivemall.knn.similarity.Distance2SimilarityUDF

distcache_gets

distcache_gets(filepath, key, default_value [, parseKey]) – Returns map|value_type Platforms: WhereOS, Spark, Hive Class: hivemall.tools.mapred.DistributedCacheLookupUDF More

distributed_bloom

Loads a bloomfilter from a file in distributed cache, and makes available as a named

distributed_map

Platforms: WhereOS, Spark, Hive Class: brickhouse.udf.dcache.DistributedMapUDF More functions can be added to WhereOS via Python