Integrations
Integrations are the fundamental connection point between your data warehouse and SDF. Credentials are used by integrations to authenticate to your data warehouse.
The first step towards using SDF to transform queries in your warehouse is configuring an integration
. Integrations enable SDF to talk to your data warehouse and give you fine-grained controls over the databases, schemas, and tables that SDF can read and write to.
Integrations live as a subproperty of the workspace block, so a basic integration configuration will look like this:
Simply put, this Snowflake integration would let SDF pull table metadata from the database w_source
for compilation, then let SDF write and materialize tables to the database w_target
.
The integration above defaults to using a credential called default
. This prevents you from having to add the credential
property to the integration block.
Sources and targets are configurable at the schema and table level, for example if I only wanted to pull the schema for a table called my_table
in schema schema1
, but I wanted to pull the schema for all tables in schema2
, I would write:
Multiple Integrations
You can have multiple integrations in a single workspace. This is useful when you have multiple data warehouses or databases that you want to hydrate tables from. For example, if you have Snowflake and Redshift databases, you can configure your workspace like so:
In this example, we have two integrations, one for Snowflake and one for Redshift. Each integration has its own set of sources and targets, and each integration uses its own set of credentials to authenticate to the respective data warehouse.
Integration Configuration
Integrations have a few key properties that you can configure:
provider
: The name of the provider you want to use. This can besnowflake
,redshift
,s3
, and others.sdf
is a special provider that allows for running SQL queries locally without a data warehouse connection.type
: The type of integration you’re using. This can bedatabase
,data
, ormetadata
. See our integrations overview for more information.credential
: The name of the credential you want to use to authenticate with your data warehouse. This is optional and defaults todefault
.sources
: A list of patterns that SDF will use to hydrate table schemas from your data warehouse when it doesn’t find them locally defined in SQL.targets
: A list of patterns that SDF will use to materialize tables to your data warehouse when you runsdf run
.
This list is not exhaustive. Other properties may be required based on the integration type and provider you’re using. See the Integration YML Reference for more information.
All tables being materialized or output by SDF compilation or execution must match a target pattern. If a table does not match a target pattern, SDF will throw an error.
Next Steps
For a more active example, follow along with the materialization guide specific to your data warehouse: