Detailed Course Outline
Module 1 - Introduction to Dataplex
Topics:
- Modern Data Platforms and Data-Oriented Design
- Pillars of Data Governance
- What is Dataplex?
- Dataplex Capabilities
- Dataplex compared with other products on Google Cloud
Objectives:
- Identify the importance of a modern data platform
- Explain the role of Dataplex on Google Cloud
Module 2 - Creating a Data Mesh on Dataplex
Topics:
- What is a data mesh?
- Dataplex concepts
- Creating data lakes and zones
- Assets in Dataplex
Objectives:
- Define key Dataplex concepts
- Configure and set up Dataplex
Activities:
- Lab: Provision a Data Mesh using Dataplex
Module 3 - Processing Data on Dataplex
Topics:
- Processing data on Dataplex
- Data preparation tasks
- Ingestion jobs
- Dataflow and Spark tasks
Objectives:
- Understand different data processing options in Dataplex
- Configure and run data preparation tasks on Dataplex
Activities:
- Lab: Standardize Data using Dataplex Tasks
Module 4 - Managing Data Security through Dataplex
Topics:
- IAM permissions and roles
- Securing your data lake
- Policy management
- Metadata security
Objectives:
- Secure data lakes, zones, and assets in Dataplex
Activities:
- Lab: Manage Data Security using Dataplex
Module 5 - Data Tagging and Data Catalog
Topics:
- Introduction to Data Catalog
- Technical metadata vs. business metadata
- Tags and tag templates
- Entries and entry groups
- Data lineage
Objectives:
- Implement tagging for resources and use tags to search for assets
Activities:
- Lab: Data Catalog and Data Lineage
Module 6 - Data Quality and Profiling
Topics:
- Data quality tasks and AutoDQ
- Reporting on data quality
- Data profiling
Objectives:
- Design, execute and report on data quality processes
Activities:
- Lab: Data Quality and Profiling your Data in BigQuery
Module 7 - Dataplex Best Practices
Topics:
- Best practices
- End-to-end demo
Objectives:
- Implement best practices for Dataplex
Activities:
- Challenge Lab: Managing a Data Mesh with Dataplex