Check out our High Performance Batch Processing API: Match and Enrich Data Using CSV/TSV Files as Input Data to our APIs Learn More

Using Interzoid with Snowflake

Snowflake is a SQL-oriented Cloud data platform for storing and analyzing data in the Cloud. Interzoid has created a native application for Snowflake available on the Snowflake Marketplace. Interzoid also supports Snowflake connectivity, which means that database tables in these platforms can be targets of Interzoid analysis and processing, simply by using a Snowflake connection string.

There are three ways to work with and process Snowflake tables: Our Snowflake native application, through the Cloud Data Connect Wizard, or via an API call.

Our Snowflake native application enables data quality and data matching to occur on the Snowflake Data Cloud platform with a single SQL query. This native application calls the Interzoid matching APIs via a User-Defined Function (UDF) that has been pre-integrated to the Snowflake. It is available as part of any SQL query on the platform, is easy to set up, and is a powerful offering.

Using the Cloud Data Connect Wizard, you simply connect to an instance of Snowflake with a connection string (see below) or providing connection parameters within the form, selecting the database and table you will use as your source. You must also provide the specific column you will be matching on. You need to choose the category of matching you want to perform (company names, individual names, or addresses).

Finally, select the type of matching you want to perform. This can be a match/inconsistency report that shows clusters of similar data, inconsistent, and otherwise matched data. You can also create an output file with a similarity key for every record in the file. You can create a new table that will be created within the source database that will store the similarity keys along with the corresponding value, or you can choose to generate the SQL that allows for the same. Creating a new table to store similarity keys enables you to perform your own custom types of matching using similarity keys as the basis of a join rather than the actual value of the data itself, and also enables matching across tables within your database. This will provide significantly higher match rates than matching on the original data values.

Here is a screen from the Cloud Data Connect Wizard showing a sample configuration. After you select your options, click "Run" and you will shortly have your results.


Snowflake data matching, data cleansing, and data quality example

You can also access a Snowflake table programmatically via an API call. Here is an example (place in the URL address bar of your browser and press 'enter'):

                                            
    https://connect.interzoid.com/run?function=match&apikey=use-your-own-api-key-here&source=snowflake&connection=your-specific-connection-string&table=companies&column=company&process=matchreport&category=company
                                            

For more details and documentation for the parameters of the API call, visit here.


Connection string example:

            
    "user:password@zwa55555/database/public"
            

A "connection string" provides the parameters necessary to initiate the connection to a specific data source on the Cloud for analysis, enrichment, matching, or whatever data function from Interzoid is selected for use.

A connection string enables connecting to a data source using the Interzoid Cloud Data Connect product.

For additional information performing data matching, match reports, and the ability to match otherwise-inconsistent data in Snowflake tables, see here.