<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE FL_Course SYSTEM "https://www.flane.de/dtd/fl_course095.dtd"><?xml-stylesheet type="text/xsl" href="https://portal.flane.ch/css/xml-course.xsl"?><course productid="35303" language="en" source="https://portal.flane.ch/swisscom/en/xml-course/google-ideg" lastchanged="2026-03-30T20:05:43+02:00" parent="https://portal.flane.ch/swisscom/en/xml-courses"><title>Introduction to Data Engineering on Google Cloud</title><productcode>IDEG</productcode><vendorcode>GO</vendorcode><vendorname>Google</vendorname><fullproductcode>GO-IDEG</fullproductcode><version>1.0</version><objective>&lt;ul&gt;
&lt;li&gt;Understand the role of a data engineer.&lt;/li&gt;&lt;li&gt;Identify data engineering tasks and core components used on Google Cloud.&lt;/li&gt;&lt;li&gt;Understand how to create and deploy data pipelines of varying patterns on Google Cloud.&lt;/li&gt;&lt;li&gt;Identify and utilize various automation techniques on Google Cloud.&lt;/li&gt;&lt;/ul&gt;</objective><essentials>&lt;ul&gt;
&lt;li&gt;Prior Google Cloud experience at the fundamental level using Cloud Shell and accessing products from the Google Cloud console.&lt;/li&gt;&lt;li&gt;Basic proficiency with a common query language such as SQL.&lt;/li&gt;&lt;li&gt;Experience with data modeling and ETL (extract, transform, load) activities.&lt;/li&gt;&lt;li&gt;Experience developing applications using a common programming language such as Python.&lt;/li&gt;&lt;/ul&gt;</essentials><audience>&lt;ul&gt;
&lt;li&gt;Data engineers&lt;/li&gt;&lt;li&gt;Database administrators&lt;/li&gt;&lt;li&gt;System administrators&lt;/li&gt;&lt;/ul&gt;</audience><outline>&lt;h4&gt;Module 1 - Data Engineering Tasks and Components&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The role of a data engineer&lt;/li&gt;&lt;li&gt;Data sources versus data sinks&lt;/li&gt;&lt;li&gt;Data formats&lt;/li&gt;&lt;li&gt;Storage solution options on Google Cloud&lt;/li&gt;&lt;li&gt;Metadata management options on Google Cloud&lt;/li&gt;&lt;li&gt;Sharing datasets using Analytics Hub&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the role of a data engineer.&lt;/li&gt;&lt;li&gt;Understand the differences between a data source and a data sink.&lt;/li&gt;&lt;li&gt;Explain the different types of data formats.&lt;/li&gt;&lt;li&gt;Explain the storage solution options on Google Cloud.&lt;/li&gt;&lt;li&gt;Learn about the metadata management options on Google Cloud.&lt;/li&gt;&lt;li&gt;Understand how to share datasets with ease using Analytics Hub.&lt;/li&gt;&lt;li&gt;Understand how to load data into BigQuery using the Google Cloud console or the gcloud CLI.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: Loading Data into BigQuery&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;&lt;h4&gt;Module 2 - Data Replication and Migration&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Replication and migration architecture&lt;/li&gt;&lt;li&gt;The gcloud command-line tool&lt;/li&gt;&lt;li&gt;Moving datasets&lt;/li&gt;&lt;li&gt;Datastream&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the baseline Google Cloud data replication and migration architecture.&lt;/li&gt;&lt;li&gt;Understand the options and use cases for the gcloud command-line tool.&lt;/li&gt;&lt;li&gt;Explain the functionality and use cases for Storage Transfer Service.&lt;/li&gt;&lt;li&gt;Explain the functionality and use cases for Transfer Appliance.&lt;/li&gt;&lt;li&gt;Understand the features and deployment of Datastream.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: Datastream: PostgreSQL Replication to BigQuery (optional for ILT)&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;&lt;h4&gt;Module 3 - The Extract and Load Data Pipeline Pattern&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Extract and load architecture&lt;/li&gt;&lt;li&gt;The bq command-line tool&lt;/li&gt;&lt;li&gt;BigQuery Data Transfer Service&lt;/li&gt;&lt;li&gt;BigLake&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the baseline extract and load architecture diagram.&lt;/li&gt;&lt;li&gt;Understand the options of the bq command-line tool.&lt;/li&gt;&lt;li&gt;Explain the functionality and use cases for BigQuery Data Transfer Service.&lt;/li&gt;&lt;li&gt;Explain the functionality and use cases for BigLake as a non-extract-load pattern.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: BigLake: Qwik Start&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;&lt;h4&gt;Module 4 - The Extract, Load, and Transform Data Pipeline Pattern&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Extract, load, and transform (ELT) architecture&lt;/li&gt;&lt;li&gt;SQL scripting and scheduling with BigQuery&lt;/li&gt;&lt;li&gt;Dataform&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the baseline extract, load, and transform architecture diagram.&lt;/li&gt;&lt;li&gt;Understand a common ELT pipeline on Google Cloud.&lt;/li&gt;&lt;li&gt;Learn about BigQuery&amp;rsquo;s SQL scripting and scheduling capabilities.&lt;/li&gt;&lt;li&gt;Explain the functionality and use cases for Dataform.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: Create and Execute a SQL Workflow in Dataform&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;&lt;h4&gt;Module 5 - The Extract, Transform, and Load Data Pipeline Pattern&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Extract, transform, and load (ETL) architecture&lt;/li&gt;&lt;li&gt;Google Cloud GUI tools for ETL data pipelines&lt;/li&gt;&lt;li&gt;Batch data processing using Dataproc&lt;/li&gt;&lt;li&gt;Streaming data processing options&lt;/li&gt;&lt;li&gt;Bigtable and data pipelines&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the baseline extract, transform, and load architecture diagram.&lt;/li&gt;&lt;li&gt;Learn about the GUI tools on Google Cloud used for ETL data pipelines.&lt;/li&gt;&lt;li&gt;Explain batch data processing using Dataproc.&lt;/li&gt;&lt;li&gt;Learn how to use Dataproc Serverless for Spark for ETL.&lt;/li&gt;&lt;li&gt;Explain streaming data processing options.&lt;/li&gt;&lt;li&gt;Explain the role Bigtable plays in data pipelines.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: Use Dataproc Serverless for Spark to Load BigQuery (optional for ILT)&lt;/li&gt;&lt;li&gt;Lab: Creating a Streaming Data Pipeline for a Real-Time Dashboard with Dataflow&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;&lt;h4&gt;Module 6 - Automation Techniques&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Topics:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Automation patterns and options for pipelines&lt;/li&gt;&lt;li&gt;Cloud Scheduler and Workflows&lt;/li&gt;&lt;li&gt;Cloud Composer&lt;/li&gt;&lt;li&gt;Cloud Run Functions&lt;/li&gt;&lt;li&gt;Eventarc&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Objectives:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explain the automation patterns and options available for pipelines.&lt;/li&gt;&lt;li&gt;Learn about Cloud Scheduler and Workflows.&lt;/li&gt;&lt;li&gt;Learn about Cloud Composer.&lt;/li&gt;&lt;li&gt;Learn about Cloud Run functions.&lt;/li&gt;&lt;li&gt;Explain the functionality and automation use cases for Eventarc.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Activities:&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Lab: Use Cloud Run Functions to Load BigQuery (optional for ILT)&lt;/li&gt;&lt;li&gt;Quiz&lt;/li&gt;&lt;/ul&gt;</outline><objective_plain>- Understand the role of a data engineer.
- Identify data engineering tasks and core components used on Google Cloud.
- Understand how to create and deploy data pipelines of varying patterns on Google Cloud.
- Identify and utilize various automation techniques on Google Cloud.</objective_plain><essentials_plain>- Prior Google Cloud experience at the fundamental level using Cloud Shell and accessing products from the Google Cloud console.
- Basic proficiency with a common query language such as SQL.
- Experience with data modeling and ETL (extract, transform, load) activities.
- Experience developing applications using a common programming language such as Python.</essentials_plain><audience_plain>- Data engineers
- Database administrators
- System administrators</audience_plain><outline_plain>Module 1 - Data Engineering Tasks and Components


Topics:



- The role of a data engineer
- Data sources versus data sinks
- Data formats
- Storage solution options on Google Cloud
- Metadata management options on Google Cloud
- Sharing datasets using Analytics Hub
Objectives:



- Explain the role of a data engineer.
- Understand the differences between a data source and a data sink.
- Explain the different types of data formats.
- Explain the storage solution options on Google Cloud.
- Learn about the metadata management options on Google Cloud.
- Understand how to share datasets with ease using Analytics Hub.
- Understand how to load data into BigQuery using the Google Cloud console or the gcloud CLI.
Activities:



- Lab: Loading Data into BigQuery
- Quiz
Module 2 - Data Replication and Migration


Topics:



- Replication and migration architecture
- The gcloud command-line tool
- Moving datasets
- Datastream
Objectives:



- Explain the baseline Google Cloud data replication and migration architecture.
- Understand the options and use cases for the gcloud command-line tool.
- Explain the functionality and use cases for Storage Transfer Service.
- Explain the functionality and use cases for Transfer Appliance.
- Understand the features and deployment of Datastream.
Activities:



- Lab: Datastream: PostgreSQL Replication to BigQuery (optional for ILT)
- Quiz
Module 3 - The Extract and Load Data Pipeline Pattern


Topics:



- Extract and load architecture
- The bq command-line tool
- BigQuery Data Transfer Service
- BigLake
Objectives:



- Explain the baseline extract and load architecture diagram.
- Understand the options of the bq command-line tool.
- Explain the functionality and use cases for BigQuery Data Transfer Service.
- Explain the functionality and use cases for BigLake as a non-extract-load pattern.
Activities:



- Lab: BigLake: Qwik Start
- Quiz
Module 4 - The Extract, Load, and Transform Data Pipeline Pattern


Topics:



- Extract, load, and transform (ELT) architecture
- SQL scripting and scheduling with BigQuery
- Dataform
Objectives:



- Explain the baseline extract, load, and transform architecture diagram.
- Understand a common ELT pipeline on Google Cloud.
- Learn about BigQuery’s SQL scripting and scheduling capabilities.
- Explain the functionality and use cases for Dataform.
Activities:



- Lab: Create and Execute a SQL Workflow in Dataform
- Quiz
Module 5 - The Extract, Transform, and Load Data Pipeline Pattern


Topics:



- Extract, transform, and load (ETL) architecture
- Google Cloud GUI tools for ETL data pipelines
- Batch data processing using Dataproc
- Streaming data processing options
- Bigtable and data pipelines
Objectives:



- Explain the baseline extract, transform, and load architecture diagram.
- Learn about the GUI tools on Google Cloud used for ETL data pipelines.
- Explain batch data processing using Dataproc.
- Learn how to use Dataproc Serverless for Spark for ETL.
- Explain streaming data processing options.
- Explain the role Bigtable plays in data pipelines.
Activities:



- Lab: Use Dataproc Serverless for Spark to Load BigQuery (optional for ILT)
- Lab: Creating a Streaming Data Pipeline for a Real-Time Dashboard with Dataflow
- Quiz
Module 6 - Automation Techniques


Topics:



- Automation patterns and options for pipelines
- Cloud Scheduler and Workflows
- Cloud Composer
- Cloud Run Functions
- Eventarc
Objectives:



- Explain the automation patterns and options available for pipelines.
- Learn about Cloud Scheduler and Workflows.
- Learn about Cloud Composer.
- Learn about Cloud Run functions.
- Explain the functionality and automation use cases for Eventarc.
Activities:



- Lab: Use Cloud Run Functions to Load BigQuery (optional for ILT)
- Quiz</outline_plain><duration unit="d" days="1">1 day</duration><pricelist><price country="US" currency="USD">595.00</price><price country="GB" currency="GBP">660.00</price><price country="IT" currency="EUR">650.00</price><price country="CA" currency="CAD">820.00</price><price country="DE" currency="EUR">950.00</price><price country="AT" currency="EUR">950.00</price><price country="SE" currency="EUR">950.00</price><price country="FR" currency="EUR">790.00</price><price country="CH" currency="CHF">950.00</price></pricelist><miles/></course>