Machine Learning & Data Science

Also referred to as advanced analytics, artificial intelligence (AI), and “Big Data”, machine learning and data science cover a broad category of vendors, tools, and technologies that provide advanced capabilities for statistical and predictive modeling.

These tools and technologies often share some overlapping features and functionality with BI tools; however, they focus less on analyzing/reporting past data. Instead, they focus on examining large data sets to discover patterns and uncover useful business information that can be used to predict future trends.

The following machine learning and data science platforms and technologies are known to provide native connectivity to Snowflake:

Solution

Version / Installation Requirements

Notes

Alteryx (https://www.alteryx.com)

Available in Partner Connect

Alteryx: Analytics 11.5 (or higher)

Snowflake: ODBC Driver — download from the ODBC Download (https://developers.snowflake.com/odbc/) page

  • Available for trial via Snowflake Partner Connect.

  • Validated by the Snowflake Ready Technology Validation Program.

  • Additional resources:

    • Snowflake In-Database Functionality Now Available (https://community.alteryx.com/t5/Analytics-Blog/Snowflake-In-DB-Functionality-is-Now-Available-Making-11-0-Even/ba-p/77268) (Alteryx Community Blog)

    • Supported Data Sources — Snowflake (https://help.alteryx.com/current/DataSources/Snowflake.htm) (Alteryx Documentation)

Amazon SageMaker (https://aws.amazon.com/sagemaker/)

Amazon SageMaker: No requirements

Snowflake: No requirements

Big Squid (https://www.bigsquid.com/)

Big Squid: No requirements

Snowflake: No requirements

  • Validated by the Snowflake Ready Technology Validation Program.

  • Additional resources:

    • How Do I Create a Snowflake Connection? (https://support.bigsquid.com/hc/en-us/articles/360000232593-How-do-I-create-a-Snowflake-connection-) (Big Squid Support)

BoostKPI (https://boostkpi.com/)

BoostKPI: No requirements

Snowflake: No requirements

  • Additional resources:

    • BoostKPI Integration with Snowflake (https://boostkpi.com/partners/snowflake) (BoostKPI website)

Databricks (https://www.databricks.com/)

Databricks: Runtime 4.2

Snowflake: No requirements

  • Integration implemented through the Snowflake Connector for Spark embedded in Databricks Runtime.

  • Additional resources:

    • Data Sources — Snowflake (https://docs.databricks.com/spark/latest/data-sources/snowflake.html) (Databricks Documentation)

Dataiku (https://www.dataiku.com/dss/editions/)

Available in Partner Connect

Dataiku: DSS

Snowflake: JDBC Driver — download from the JDBC Driver page in the Maven Central Repository (https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc)

DataRobot (https://www.datarobot.com/)

Available in Partner Connect

DataRobot: No requirements

Snowflake: No requirements

Domino (https://www.dominodatalab.com/)

Domino: 3.6 (or higher)

Snowflake: See the Domino documentation for requirements

  • Additional resources:

    • Connecting to Snowflake from Domino (https://docs.dominodatalab.com/en/latest/reference/data/data_sources/Connecting_to_Snowflake_from_Domino.html) (Domino Documentation)

H2O.ai (https://www.h2o.ai/)

Available in Partner Connect

H2O.ai: Driverless AI 1.4.2 (or higher)

Snowflake: No requirements

Hex (https://hex.tech/)

Available in Partner Connect

Hex: No requirements

Snowflake: No requirements

  • Available for trial via Snowflake Partner Connect.

  • Validated by the Snowflake Ready Technology Validation Program.

  • Additional resources:

    • Query Snowflake data right from a powerful data notebook (https://hex.tech/integrations/snowflake) (Hex website)

    • Connect to Data (https://learn.hex.tech/docs/connect-to-data/data-connections/overview) (Hex Documentation)

KNIME (https://www.knime.com/)

KNIME:

  • Analytics Platform 4.4.0 (or higher)

  • Extension: KNIME Snowflake Integration (https://hub.knime.com/knime/extensions/org.knime.features.snowflake/latest)

Snowflake: None (JDBC Driver embedded in the KNIME extension); other Snowflake drivers also supported

  • Additional resources:

    • Overview: KNIME for Snowflake Users (https://www.knime.com/knime-for-snowflake-users) (KNIME website)

    • Collection: KNIME for Snowflake Users (https://hub.knime.com/knime/collections/KNIME%20for%20Snowflake%20Users~1sIkhkwhAvlptfBj) (KNIME Community Hub)

    • KNIME Snowflake Extension Guide (https://docs.knime.com/latest/snowflake_extension_guide/) (KNIME Documentation)

Qubole (https://www.qubole.com/)

Qubole: Enterprise Edition

Snowflake: No requirements

  • Integration implemented through the Snowflake Connector for Spark embedded in Qubole Data Service (QDS)

  • Additional resources:

    • Qubole Quickstart Guide (http://docs.qubole.com/en/latest/quick-start-guide/index.html) (Qubole Documentation)

    • Qubole-Snowflake Integration Guide (http://docs.qubole.com/en/latest/partner-integration/snowflake-integration/index.html) (Qubole Documentation)

SAS (https://www.sas.com/)

SAS:

  • Cloud Analytic Services 3.4 (or higher)

  • SAS/ACCESS 9.4 (or higher) for Relational Databases

Snowflake: ODBC Driver — download from the ODBC Download (https://developers.snowflake.com/odbc/) page

  • Additional resources:

    • Snowflake: Data Connector Specifics (https://documentation.sas.com/?docsetId=casref&docsetTarget=p183rli8obtde3n10y9bzbrpwnsh.htm&docsetVersion=3.4&locale=en) (SAS Documentation)

    • SAS/ACCESS Interface to Snowflake (https://documentation.sas.com/?docsetId=acreldb&docsetTarget=p19i7uzcbso1szn1pczxn88co3g1.htm&docsetVersion=9.4&locale=en) (SAS Documentation)

Apache Spark (http://spark.apache.org/)

Spark: 3.0 (or higher)

Scala: 2.12 or 2.13

Snowflake:

  • JDBC Driver — download from the JDBC Driver page in the Maven Central Repository (https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc)

  • Connector for Spark — download from the Snowflake Connector for Spark page in the Maven Central Repository (https://central.sonatype.com/search?q=g%3Anet.snowflake%20spark-snowflake&sort=published)

  • Additional resources:

    • Configuring Snowflake to Communicate with Spark Running on EMR (https://community.snowflake.com/s/article/configuring-snowflake-to-communicate-with-apache-spark-running-on-amazon-emr-with-apache-zeppelin) (Snowflake Community)

Tellius (http://tellius.com)

Tellius: None

Snowflake: None

  • Additional resources:

    • Tellius + Snowflake for Instant Analytics & AI at Scale (https://www.tellius.com/snowflake/) (Tellius website)

    • 4 Things That Make Tellius and Snowflake Great Together (https://www.tellius.com/4-things-that-make-tellius-and-snowflake-great-together/) (Tellius website)

    • Transform Data in Snowflake Data Cloud with Java UDFs in Tellius (https://www.tellius.com/transform-data-in-snowflake-data-cloud-with-java-udfs-in-tellius/) (Tellius website)

Zepl (https://www.zepl.com/)

Zepl: No requirements

Snowflake:

  • JDBC Driver — download from the JDBC Driver page in the Maven Central Repository (https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc)

  • Connector for Spark — download from the Snowflake Connector for Spark page in the Maven Central Repository (https://central.sonatype.com/search?q=g%3Anet.snowflake%20spark-snowflake&sort=published)

  • Additional resources:

    • Zepl and Snowflake Bring Data Science as a Service to Cloud Data Warehouses (https://www.zepl.com/blog/zepl-and-snowflake-bring-data-science-as-a-service-to-cloud-data-warehouses/) (Zepl Blog)

    • Getting Started with Zepl and Snowflake in Minutes (https://www.zepl.com/blog/get-started-with-zepl-and-snowflake-in-minutes/) (Zepl Blog)

    • Zepl and Snowflake — Data Science as a Service for your Cloud Warehouse (https://www.zepl.com/wp-content/uploads/2019/06/Snowflake1s.pdf) (Zepl Solution Brief)

Language: English