Snowflake

Snowflake is a cloud-based Data Warehouse solution provided as a Saas (Software-as-a-Service) with full support for ANSI SQL. It also has a unique architecture that enables users to just create tables and start querying data with very less administration or DBA activities needed.

It enables data storage, processing, and analytic solutions that are faster, easier to use, and far more flexible than traditional offerings. The Snowflake data platform is not built on any existing database technology or “big data” software platforms such as Hadoop.

Connecting to Snowflake

Mammoth allows you to connect to your Snowflake Database and get the data into Mammoth.

  1. Select API & Databases from the add menu and click on Snowflake.

    Snowflake selection
  2. Create a new connection and add your database credentials - Host URL, Username, Password, Database and Warehouse.

    Snowflake connection
  3. Select the Schema you want to work on and click on Next.

    Snowflake schema

Once the connection is established, you will be presented with a list of tables and views in that database.

  • Select the desired table to get a preview.

  • Write your own SQL query or run a test query and preview the result.

  • Click on Next

    Snowflake config screen

After you have selected the table you want to work on, you get options to configure it as follows -

  • Rename it in the data pull scheduling window.
  • Save it in a desired location in the the Data Library from Adding file to option.

Scheduling your Data Pulls

You can start retrieving the data now or at a specific time according to your choice. You can also schedule the data pull in order to get the latest data from your Database at a certain time interval - every few minutes, daily, weekly or monthly.

On every data pull from your Database, you also have an option to either replace the older data or combine with older data.

Snowflake data pull

On choosing Combine with older data option, you will get an option to choose a unique sequence column. Using this column, on refresh, Mammoth will pick up all the rows that have greater value in this column than the previous data pull.

Note:

  1. Make sure that Mammoth’s public IP address is added to your whitelist.
  2. Mammoth’s public IP is displayed on the create connection window.