Managing a Data Mesh with Dataplex (MDMD) – Outline

Detailed Course Outline

Module 1 - Introduction to Dataplex

Topics:

  • Modern Data Platforms and Data-Oriented Design
  • Pillars of Data Governance
  • What is Dataplex?
  • Dataplex Capabilities
  • Dataplex compared with other products on Google Cloud

Objectives:

  • Identify the importance of a modern data platform
  • Explain the role of Dataplex on Google Cloud

Module 2 - Creating a Data Mesh on Dataplex

Topics:

  • What is a data mesh?
  • Dataplex concepts
  • Creating data lakes and zones
  • Assets in Dataplex

Objectives:

  • Define key Dataplex concepts
  • Configure and set up Dataplex

Activities:

  • Lab: Provision a Data Mesh using Dataplex

Module 3 - Processing Data on Dataplex

Topics:

  • Processing data on Dataplex
  • Data preparation tasks
  • Ingestion jobs
  • Dataflow and Spark tasks

Objectives:

  • Understand different data processing options in Dataplex
  • Configure and run data preparation tasks on Dataplex

Activities:

  • Lab: Standardize Data using Dataplex Tasks

Module 4 - Managing Data Security through Dataplex

Topics:

  • IAM permissions and roles
  • Securing your data lake
  • Policy management
  • Metadata security

Objectives:

  • Secure data lakes, zones, and assets in Dataplex

Activities:

  • Lab: Manage Data Security using Dataplex

Module 5 - Data Tagging and Data Catalog

Topics:

  • Introduction to Data Catalog
  • Technical metadata vs. business metadata
  • Tags and tag templates
  • Entries and entry groups
  • Data lineage

Objectives:

  • Implement tagging for resources and use tags to search for assets

Activities:

  • Lab: Data Catalog and Data Lineage

Module 6 - Data Quality and Profiling

Topics:

  • Data quality tasks and AutoDQ
  • Reporting on data quality
  • Data profiling

Objectives:

  • Design, execute and report on data quality processes

Activities:

  • Lab: Data Quality and Profiling your Data in BigQuery

Module 7 - Dataplex Best Practices

Topics:

  • Best practices
  • End-to-end demo

Objectives:

  • Implement best practices for Dataplex

Activities:

  • Challenge Lab: Managing a Data Mesh with Dataplex