How to Connect to Snowflake using SSIS

ETL (Extract, Transform, Load) processes are fundamental in modern data management. Organizations across industries rely on diverse data sources, creating a need to retrieve, process, and integrate this data efficiently. ETL processes unify and optimize data, preparing it for analysis for making right decisions. A wide range of ETL tools is available on the market to handle these tasks, and Microsoft's SQL Server Integration Services (SSIS) stands out as one of the most popular solutions.

If you use SSIS for your ETL workflows, you can enhance its capabilities by incorporating Devart SSIS Data Flow Components. These tools simplify the process of connecting SSIS to various data sources and configuring data flow tasks. This article focuses on how to use these components to connect SSIS to Snowflake – a popular cloud-based data warehousing and analytics platform.

Create data flow tasks for Snowflake

Devart SSIS Components for Snowflake are integrated with Visual Studio. Before we establish a SSIS connection to this particular data source and start configuring a data flow task, we need to make sure that the following prerequisites are in place:

Now we can create a SSIS data flow task for our data source:

  • 1. Open the necessary SSIS project in Visual Studio and go to the Data Flow tab > SSIS Toolbox
  • 2. Select Devart Snowflake Source and drag and drop it onto the Data Flow field.

Add the Snowflake data source connection

The next step is to establish a connection with our data source.

Configure a SSIS Snowflake connection

Double-click the Devart Snowflake Source and select Create New Connection Manager.

Add a new Snowflake connection

You need to provide the following valid connection details for the Snowflake SSIS connector:

  • Domain - the Snowflake account domain
  • User Id - the username used to sign in to the Snowflake account
  • Password - the password to sign in
  • Database - the name of the target database

The above-mentioned parameters are mandatory and must be entered to connect successfully.

Provide general Snowflake connection details

The remaining parameters are optional:

  • Warehouse - the name of the warehouse used for the database
  • Schema - the name of the schema to be used once a connection is opened
  • Role - the role name used to connect

For more precise connection tuning, you can set parameters in the Advanced section of the Connection Manager.

When all details for the connection are provided, check and verify them by clicking Test Connection.

Verify connection details

Click OK, and you will be able to proceed to import Snowflake data with SSIS.

Select Snowflake data for retrieving using SSIS

When the connection is established, double-click Devart Snowflake Source in the Data Flow area of Visual Studio. It opens the Source Editor. The list of available tables with data is displayed on the left. Drag and drop the desired table onto the Query area, and the SELECT query for that table will be generated automatically.

View tables in the Source Editor

The Source Editor supports advanced custom queries that you can construct and use for SSIS data import. You can select specific columns from the table, apply variables, and execute complex statements against the Snowflake data just as you would do with any regular database.

Use complex SQL queries

Preview the data to be imported with this SSIS Snowflake task, and click OK.

Preview Snowflake data to load

Essential SSIS ETL tasks for Snowflake integration

Devart SSIS Data Flow Components for Snowflake are helpful with all standard ETL tasks operated with SSIS.

Import/Export with CSV, XML, and Excel support

Using SSIS allows you to import and export data in various formats, such as CSV, XML, and Excel. You can automate data extraction and loading operations to transfer data into relational databases or other destinations.

Aggregate, split, and merge data

SSIS is a popular option for aggregating and summarizing data. Also, it allows you to split large datasets into smaller portions and merge data from multiple sources.

Integration with cloud storage

SSIS allows you to move and manage data between on-premises systems and cloud-based storage solutions, which is especially helpful for cloud-based ETL operations.

Replication, backup, data migration

To duplicate databases, configure disaster recovery, or migrate data between heterogeneous systems, you can refer to SSIS, as this tool is excellent for maintaining data continuity and accurate transitions between systems.

Conclusion

SSIS remains one of the most popular ETL solutions for data specialists. It allows simple task configuration due to its intuitive GUI and ensures excellent scalability and data protection, and offers a more cost-effective alternative to cloud-based ETL platforms. Besides, it integrates perfectly with all other Microsoft products.

The possibility of enhancing the default functionality of SSIS with solutions such as Devart SSIS Data Flow Components is another advantage. You can improve all your ETL processes and make such essential operations as configuring data flow tasks much more straightforward. Download and install the SSIS Data Flow Components for Snowflake for ETL processes related to this platform, or opt for the SSIS Integration Universal Bundle, a comprehensive set of solutions for both databases and cloud platforms in a single package. A fully functional free trial is available for all products.

SSIS Data Flow Components

High-performance SSIS components for popular databases and cloud applications