snowflake.snowpark.functions.approx_count_distinct¶
- snowflake.snowpark.functions.approx_count_distinct(e: Union[Column, str]) Column [source] (https://github.com/snowflakedb/snowpark-python/blob/v1.16.0/src/snowflake/snowpark/functions.py#L623-L639)¶
Uses HyperLogLog to return an approximation of the distinct cardinality of the input (i.e. HLL(col1, col2, … ) returns an approximation of COUNT(DISTINCT col1, col2, … )).
- Example::
>>> df = session.create_dataframe([[1, 2], [3, 4], [5, 6]], schema=["a", "b"]) >>> df.select(approx_count_distinct("a").alias("result")).show() ------------ |"RESULT" | ------------ |3 | ------------