Orchestrate BigQuery Workloads with Dataform (OBWD) – Outline

Detailed Course Outline

Module 1 - Dataform Core Components

Topics:

  • SQL workflow
  • Repositories and workspaces
  • Default files and folders
  • Compiled graphs

Objectives:

  • Understand the components of Dataflow core.

Module 2 - Table Definitions and Dependencies

Topics:

  • Declare a data source.
  • Create a table.
  • Create an incremental table.
  • Set partitioning and clustering options.
  • Create an empty table.
  • Create an external BigLake table.
  • Create views and materialized views.
  • Define dependencies.

Objectives:

  • Create tables and views in BigQuery using Dataform

Module 3 - Document BigQuery Tables and Views

Topics:

  • Use column descriptions.
  • Use globally defined JavaScript constants.
  • Add labels.

Objectives:

  • Document BigQuery tables and views.

Activities:

  • Lab: Build SQL Workflows with Dependencies in Dataform

Module 4 - BigQuery Security Settings

Topics:

  • IAM dataset and table/view access
  • Column-level security
  • Row-level security

Objectives:

  • Understand BigQuery security settings using Dataform

Module 5 - Assertions

Topics:

  • Use built-in assertions.
  • Create manual assertions.

Objectives:

  • Use assertions to validate data in Dataform workflows.

Activities:

  • Lab: Work with Assertions and BigQuery Security Settings in Dataform.

Module 6 - SQL Workflow Executions

Topics:

  • Dataform code lifecycle.
  • What happens during compilation.
  • Customize and schedule compilation results.
  • Execute workflows (UI, Cloud Scheduler, Cloud Composer).
  • Logging and monitoring.

Objectives:

  • Execute Dataform SQL workflows in an automated fashion.

Activities:

  • Lab: Automate and Monitor SQL Workflow Executions in Dataform

Module 7 - Advanced Use Cases

Topics:

  • Create a BigLake table after file upload using Cloud Run functions.
  • Build a Machine Learning pipeline with BigQuery ML.
  • Work with Slowly Changing Dimensions Type 2.

Objectives:

  • Explore additional use cases for Dataform.

Activities:

  • Lab: Create a BigLake Table with Dataform Using Cloud Run Functions.