<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE FL_Course SYSTEM "https://www.flane.de/dtd/fl_course095.dtd"><?xml-stylesheet type="text/xsl" href="https://portal.flane.ch/css/xml-course.xsl"?><course productid="18642" language="fr" source="https://portal.flane.ch/swisscom/fr/xml-course/google-degcp" lastchanged="2025-11-18T18:18:14+01:00" parent="https://portal.flane.ch/swisscom/fr/xml-courses"><title>Data Engineering on Google Cloud Platform</title><productcode>DEGCP</productcode><vendorcode>GO</vendorcode><vendorname>Google</vendorname><fullproductcode>GO-DEGCP</fullproductcode><version>3.0</version><objective>&lt;p&gt;This course teaches participants the following skills:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Design and build data processing systems on Google Cloud Platform&lt;/li&gt;&lt;li&gt;Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow&lt;/li&gt;&lt;li&gt;Derive business insights from extremely large datasets using Google BigQuery&lt;/li&gt;&lt;li&gt;Train, evaluate and predict using machine learning models using Tensorflow and Cloud ML&lt;/li&gt;&lt;li&gt;Leverage unstructured data using Spark and ML APIs on Cloud Dataproc&lt;/li&gt;&lt;li&gt;Enable instant insights from streaming data&lt;/li&gt;&lt;/ul&gt;</objective><essentials>&lt;p&gt;To get the most of out of this course, participants should have:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Completed &lt;span class=&quot;attentionbbcode&quot; title=&quot;inactive or disabled course: GO-GCF-BDM&quot;&gt;!&lt;/span&gt;Google Cloud Fundamentals: Big Data and Machine Learning &lt;span class=&quot;fl-prod-pcode&quot;&gt;(GCF-BDM)&lt;/span&gt; course OR have equivalent experience&lt;/li&gt;&lt;li&gt;Basic proficiency with common query language such as SQL&lt;/li&gt;&lt;li&gt;Experience with data modeling, extract, transform, load activities Developing applications using a common programming language such Python&lt;/li&gt;&lt;li&gt;Familiarity with Machine Learning and/or statistics&lt;/li&gt;&lt;/ul&gt;</essentials><audience>&lt;p&gt;This class is intended for experienced developers who are responsible for managing big data transformations including:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Extracting, Loading, Transforming, cleaning, and validating data&lt;/li&gt;&lt;li&gt;Designing pipelines and architectures for data processing&lt;/li&gt;&lt;li&gt;Creating and maintaining machine learning and statistical models&lt;/li&gt;&lt;li&gt;Querying datasets, visualizing query results and creating reports&lt;/li&gt;&lt;/ul&gt;</audience><contents>&lt;h5&gt;Module 1: Google Cloud Dataproc Overview&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Creating and managing clusters.&lt;/li&gt;&lt;li&gt;Leveraging custom machine types and preemptible worker nodes.&lt;/li&gt;&lt;li&gt;Scaling and deleting Clusters.&lt;/li&gt;&lt;li&gt;Lab: Creating Hadoop Clusters with Google Cloud Dataproc.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 2: Running Dataproc Jobs&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Running Pig and Hive jobs.&lt;/li&gt;&lt;li&gt;Separation of storage and compute.&lt;/li&gt;&lt;li&gt;Lab: Running Hadoop and Spark Jobs with Dataproc.&lt;/li&gt;&lt;li&gt;Lab: Submit and monitor jobs.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 3: Integrating Dataproc with Google Cloud Platform&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Customize cluster with initialization actions.&lt;/li&gt;&lt;li&gt;BigQuery Support.&lt;/li&gt;&lt;li&gt;Lab: Leveraging Google Cloud Platform Services.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 4: Making Sense of Unstructured Data with Google&amp;rsquo;s Machine Learning APIs&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Google&amp;rsquo;s Machine Learning APIs.&lt;/li&gt;&lt;li&gt;Common ML Use Cases.&lt;/li&gt;&lt;li&gt;Invoking ML APIs.&lt;/li&gt;&lt;li&gt;Lab: Adding Machine Learning Capabilities to Big Data Analysis.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 5: Serverless data analysis with BigQuery&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is BigQuery.&lt;/li&gt;&lt;li&gt;Queries and Functions.&lt;/li&gt;&lt;li&gt;Lab: Writing queries in BigQuery.&lt;/li&gt;&lt;li&gt;Loading data into BigQuery.&lt;/li&gt;&lt;li&gt;Exporting data from BigQuery.&lt;/li&gt;&lt;li&gt;Lab: Loading and exporting data.&lt;/li&gt;&lt;li&gt;Nested and repeated fields.&lt;/li&gt;&lt;li&gt;Querying multiple tables.&lt;/li&gt;&lt;li&gt;Lab: Complex queries.&lt;/li&gt;&lt;li&gt;Performance and pricing.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 6: Serverless, autoscaling data pipelines with Dataflow&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;The Beam programming model.&lt;/li&gt;&lt;li&gt;Data pipelines in Beam Python.&lt;/li&gt;&lt;li&gt;Data pipelines in Beam Java.&lt;/li&gt;&lt;li&gt;Lab: Writing a Dataflow pipeline.&lt;/li&gt;&lt;li&gt;Scalable Big Data processing using Beam.&lt;/li&gt;&lt;li&gt;Lab: MapReduce in Dataflow.&lt;/li&gt;&lt;li&gt;Incorporating additional data.&lt;/li&gt;&lt;li&gt;Lab: Side inputs.&lt;/li&gt;&lt;li&gt;Handling stream data.&lt;/li&gt;&lt;li&gt;GCP Reference architecture.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 7: Getting started with Machine Learning&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is machine learning (ML).&lt;/li&gt;&lt;li&gt;Effective ML: concepts, types.&lt;/li&gt;&lt;li&gt;ML datasets: generalization.&lt;/li&gt;&lt;li&gt;Lab: Explore and create ML datasets.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 8: Building ML models with Tensorflow&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Getting started with TensorFlow.&lt;/li&gt;&lt;li&gt;Lab: Using tf.learn.&lt;/li&gt;&lt;li&gt;TensorFlow graphs and loops + lab.&lt;/li&gt;&lt;li&gt;Lab: Using low-level TensorFlow + early stopping.&lt;/li&gt;&lt;li&gt;Monitoring ML training.&lt;/li&gt;&lt;li&gt;Lab: Charts and graphs of TensorFlow training.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 9: Scaling ML models with CloudML&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Why Cloud ML?&lt;/li&gt;&lt;li&gt;Packaging up a TensorFlow model.&lt;/li&gt;&lt;li&gt;End-to-end training.&lt;/li&gt;&lt;li&gt;Lab: Run a ML model locally and on cloud.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 10: Feature Engineering&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Creating good features.&lt;/li&gt;&lt;li&gt;Transforming inputs.&lt;/li&gt;&lt;li&gt;Synthetic features.&lt;/li&gt;&lt;li&gt;Preprocessing with Cloud ML.&lt;/li&gt;&lt;li&gt;Lab: Feature engineering.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 11: Architecture of streaming analytics pipelines&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Stream data processing: Challenges.&lt;/li&gt;&lt;li&gt;Handling variable data volumes.&lt;/li&gt;&lt;li&gt;Dealing with unordered/late data.&lt;/li&gt;&lt;li&gt;Lab: Designing streaming pipeline.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 12: Ingesting Variable Volumes&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is Cloud Pub/Sub?&lt;/li&gt;&lt;li&gt;How it works: Topics and Subscriptions.&lt;/li&gt;&lt;li&gt;Lab: Simulator.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 13: Implementing streaming pipelines&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Challenges in stream processing.&lt;/li&gt;&lt;li&gt;Handle late data: watermarks, triggers, accumulation.&lt;/li&gt;&lt;li&gt;Lab: Stream data processing pipeline for live traffic data.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 14: Streaming analytics and dashboards&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Streaming analytics: from data to decisions.&lt;/li&gt;&lt;li&gt;Querying streaming data with BigQuery.&lt;/li&gt;&lt;li&gt;What is Google Data Studio?&lt;/li&gt;&lt;li&gt;Lab: build a real-time dashboard to visualize processed data.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 15: High throughput and low-latency with Bigtable&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is Cloud Spanner?&lt;/li&gt;&lt;li&gt;Designing Bigtable schema.&lt;/li&gt;&lt;li&gt;Ingesting into Bigtable.&lt;/li&gt;&lt;li&gt;Lab: streaming into Bigtable.&lt;/li&gt;&lt;/ul&gt;</contents><outline>&lt;h5&gt;Module 1: Google Cloud Dataproc Overview&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Creating and managing clusters.&lt;/li&gt;&lt;li&gt;Leveraging custom machine types and preemptible worker nodes.&lt;/li&gt;&lt;li&gt;Scaling and deleting Clusters.&lt;/li&gt;&lt;li&gt;Lab: Creating Hadoop Clusters with Google Cloud Dataproc.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 2: Running Dataproc Jobs&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Running Pig and Hive jobs.&lt;/li&gt;&lt;li&gt;Separation of storage and compute.&lt;/li&gt;&lt;li&gt;Lab: Running Hadoop and Spark Jobs with Dataproc.&lt;/li&gt;&lt;li&gt;Lab: Submit and monitor jobs.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 3: Integrating Dataproc with Google Cloud Platform&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Customize cluster with initialization actions.&lt;/li&gt;&lt;li&gt;BigQuery Support.&lt;/li&gt;&lt;li&gt;Lab: Leveraging Google Cloud Platform Services.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 4: Making Sense of Unstructured Data with Google&amp;rsquo;s Machine Learning APIs&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Google&amp;rsquo;s Machine Learning APIs.&lt;/li&gt;&lt;li&gt;Common ML Use Cases.&lt;/li&gt;&lt;li&gt;Invoking ML APIs.&lt;/li&gt;&lt;li&gt;Lab: Adding Machine Learning Capabilities to Big Data Analysis.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 5: Serverless data analysis with BigQuery&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is BigQuery.&lt;/li&gt;&lt;li&gt;Queries and Functions.&lt;/li&gt;&lt;li&gt;Lab: Writing queries in BigQuery.&lt;/li&gt;&lt;li&gt;Loading data into BigQuery.&lt;/li&gt;&lt;li&gt;Exporting data from BigQuery.&lt;/li&gt;&lt;li&gt;Lab: Loading and exporting data.&lt;/li&gt;&lt;li&gt;Nested and repeated fields.&lt;/li&gt;&lt;li&gt;Querying multiple tables.&lt;/li&gt;&lt;li&gt;Lab: Complex queries.&lt;/li&gt;&lt;li&gt;Performance and pricing.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 6: Serverless, autoscaling data pipelines with Dataflow&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;The Beam programming model.&lt;/li&gt;&lt;li&gt;Data pipelines in Beam Python.&lt;/li&gt;&lt;li&gt;Data pipelines in Beam Java.&lt;/li&gt;&lt;li&gt;Lab: Writing a Dataflow pipeline.&lt;/li&gt;&lt;li&gt;Scalable Big Data processing using Beam.&lt;/li&gt;&lt;li&gt;Lab: MapReduce in Dataflow.&lt;/li&gt;&lt;li&gt;Incorporating additional data.&lt;/li&gt;&lt;li&gt;Lab: Side inputs.&lt;/li&gt;&lt;li&gt;Handling stream data.&lt;/li&gt;&lt;li&gt;GCP Reference architecture.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 7: Getting started with Machine Learning&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is machine learning (ML).&lt;/li&gt;&lt;li&gt;Effective ML: concepts, types.&lt;/li&gt;&lt;li&gt;ML datasets: generalization.&lt;/li&gt;&lt;li&gt;Lab: Explore and create ML datasets.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 8: Building ML models with Tensorflow&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Getting started with TensorFlow.&lt;/li&gt;&lt;li&gt;Lab: Using tf.learn.&lt;/li&gt;&lt;li&gt;TensorFlow graphs and loops + lab.&lt;/li&gt;&lt;li&gt;Lab: Using low-level TensorFlow + early stopping.&lt;/li&gt;&lt;li&gt;Monitoring ML training.&lt;/li&gt;&lt;li&gt;Lab: Charts and graphs of TensorFlow training.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 9: Scaling ML models with CloudML&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Why Cloud ML?&lt;/li&gt;&lt;li&gt;Packaging up a TensorFlow model.&lt;/li&gt;&lt;li&gt;End-to-end training.&lt;/li&gt;&lt;li&gt;Lab: Run a ML model locally and on cloud.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 10: Feature Engineering&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Creating good features.&lt;/li&gt;&lt;li&gt;Transforming inputs.&lt;/li&gt;&lt;li&gt;Synthetic features.&lt;/li&gt;&lt;li&gt;Preprocessing with Cloud ML.&lt;/li&gt;&lt;li&gt;Lab: Feature engineering.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 11: Architecture of streaming analytics pipelines&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Stream data processing: Challenges.&lt;/li&gt;&lt;li&gt;Handling variable data volumes.&lt;/li&gt;&lt;li&gt;Dealing with unordered/late data.&lt;/li&gt;&lt;li&gt;Lab: Designing streaming pipeline.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 12: Ingesting Variable Volumes&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is Cloud Pub/Sub?&lt;/li&gt;&lt;li&gt;How it works: Topics and Subscriptions.&lt;/li&gt;&lt;li&gt;Lab: Simulator.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 13: Implementing streaming pipelines&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Challenges in stream processing.&lt;/li&gt;&lt;li&gt;Handle late data: watermarks, triggers, accumulation.&lt;/li&gt;&lt;li&gt;Lab: Stream data processing pipeline for live traffic data.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 14: Streaming analytics and dashboards&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Streaming analytics: from data to decisions.&lt;/li&gt;&lt;li&gt;Querying streaming data with BigQuery.&lt;/li&gt;&lt;li&gt;What is Google Data Studio?&lt;/li&gt;&lt;li&gt;Lab: build a real-time dashboard to visualize processed data.&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 15: High throughput and low-latency with Bigtable&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;What is Cloud Spanner?&lt;/li&gt;&lt;li&gt;Designing Bigtable schema.&lt;/li&gt;&lt;li&gt;Ingesting into Bigtable.&lt;/li&gt;&lt;li&gt;Lab: streaming into Bigtable.&lt;/li&gt;&lt;/ul&gt;</outline><objective_plain>This course teaches participants the following skills:


- Design and build data processing systems on Google Cloud Platform
- Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow
- Derive business insights from extremely large datasets using Google BigQuery
- Train, evaluate and predict using machine learning models using Tensorflow and Cloud ML
- Leverage unstructured data using Spark and ML APIs on Cloud Dataproc
- Enable instant insights from streaming data</objective_plain><essentials_plain>To get the most of out of this course, participants should have:


- Completed Google Cloud Fundamentals: Big Data and Machine Learning (GCF-BDM) course OR have equivalent experience
- Basic proficiency with common query language such as SQL
- Experience with data modeling, extract, transform, load activities Developing applications using a common programming language such Python
- Familiarity with Machine Learning and/or statistics</essentials_plain><audience_plain>This class is intended for experienced developers who are responsible for managing big data transformations including:


- Extracting, Loading, Transforming, cleaning, and validating data
- Designing pipelines and architectures for data processing
- Creating and maintaining machine learning and statistical models
- Querying datasets, visualizing query results and creating reports</audience_plain><contents_plain>Module 1: Google Cloud Dataproc Overview


- Creating and managing clusters.
- Leveraging custom machine types and preemptible worker nodes.
- Scaling and deleting Clusters.
- Lab: Creating Hadoop Clusters with Google Cloud Dataproc.
Module 2: Running Dataproc Jobs


- Running Pig and Hive jobs.
- Separation of storage and compute.
- Lab: Running Hadoop and Spark Jobs with Dataproc.
- Lab: Submit and monitor jobs.
Module 3: Integrating Dataproc with Google Cloud Platform


- Customize cluster with initialization actions.
- BigQuery Support.
- Lab: Leveraging Google Cloud Platform Services.
Module 4: Making Sense of Unstructured Data with Google’s Machine Learning APIs


- Google’s Machine Learning APIs.
- Common ML Use Cases.
- Invoking ML APIs.
- Lab: Adding Machine Learning Capabilities to Big Data Analysis.
Module 5: Serverless data analysis with BigQuery


- What is BigQuery.
- Queries and Functions.
- Lab: Writing queries in BigQuery.
- Loading data into BigQuery.
- Exporting data from BigQuery.
- Lab: Loading and exporting data.
- Nested and repeated fields.
- Querying multiple tables.
- Lab: Complex queries.
- Performance and pricing.
Module 6: Serverless, autoscaling data pipelines with Dataflow


- The Beam programming model.
- Data pipelines in Beam Python.
- Data pipelines in Beam Java.
- Lab: Writing a Dataflow pipeline.
- Scalable Big Data processing using Beam.
- Lab: MapReduce in Dataflow.
- Incorporating additional data.
- Lab: Side inputs.
- Handling stream data.
- GCP Reference architecture.
Module 7: Getting started with Machine Learning


- What is machine learning (ML).
- Effective ML: concepts, types.
- ML datasets: generalization.
- Lab: Explore and create ML datasets.
Module 8: Building ML models with Tensorflow


- Getting started with TensorFlow.
- Lab: Using tf.learn.
- TensorFlow graphs and loops + lab.
- Lab: Using low-level TensorFlow + early stopping.
- Monitoring ML training.
- Lab: Charts and graphs of TensorFlow training.
Module 9: Scaling ML models with CloudML


- Why Cloud ML?
- Packaging up a TensorFlow model.
- End-to-end training.
- Lab: Run a ML model locally and on cloud.
Module 10: Feature Engineering


- Creating good features.
- Transforming inputs.
- Synthetic features.
- Preprocessing with Cloud ML.
- Lab: Feature engineering.
Module 11: Architecture of streaming analytics pipelines


- Stream data processing: Challenges.
- Handling variable data volumes.
- Dealing with unordered/late data.
- Lab: Designing streaming pipeline.
Module 12: Ingesting Variable Volumes


- What is Cloud Pub/Sub?
- How it works: Topics and Subscriptions.
- Lab: Simulator.
Module 13: Implementing streaming pipelines


- Challenges in stream processing.
- Handle late data: watermarks, triggers, accumulation.
- Lab: Stream data processing pipeline for live traffic data.
Module 14: Streaming analytics and dashboards


- Streaming analytics: from data to decisions.
- Querying streaming data with BigQuery.
- What is Google Data Studio?
- Lab: build a real-time dashboard to visualize processed data.
Module 15: High throughput and low-latency with Bigtable


- What is Cloud Spanner?
- Designing Bigtable schema.
- Ingesting into Bigtable.
- Lab: streaming into Bigtable.</contents_plain><outline_plain>Module 1: Google Cloud Dataproc Overview


- Creating and managing clusters.
- Leveraging custom machine types and preemptible worker nodes.
- Scaling and deleting Clusters.
- Lab: Creating Hadoop Clusters with Google Cloud Dataproc.
Module 2: Running Dataproc Jobs


- Running Pig and Hive jobs.
- Separation of storage and compute.
- Lab: Running Hadoop and Spark Jobs with Dataproc.
- Lab: Submit and monitor jobs.
Module 3: Integrating Dataproc with Google Cloud Platform


- Customize cluster with initialization actions.
- BigQuery Support.
- Lab: Leveraging Google Cloud Platform Services.
Module 4: Making Sense of Unstructured Data with Google’s Machine Learning APIs


- Google’s Machine Learning APIs.
- Common ML Use Cases.
- Invoking ML APIs.
- Lab: Adding Machine Learning Capabilities to Big Data Analysis.
Module 5: Serverless data analysis with BigQuery


- What is BigQuery.
- Queries and Functions.
- Lab: Writing queries in BigQuery.
- Loading data into BigQuery.
- Exporting data from BigQuery.
- Lab: Loading and exporting data.
- Nested and repeated fields.
- Querying multiple tables.
- Lab: Complex queries.
- Performance and pricing.
Module 6: Serverless, autoscaling data pipelines with Dataflow


- The Beam programming model.
- Data pipelines in Beam Python.
- Data pipelines in Beam Java.
- Lab: Writing a Dataflow pipeline.
- Scalable Big Data processing using Beam.
- Lab: MapReduce in Dataflow.
- Incorporating additional data.
- Lab: Side inputs.
- Handling stream data.
- GCP Reference architecture.
Module 7: Getting started with Machine Learning


- What is machine learning (ML).
- Effective ML: concepts, types.
- ML datasets: generalization.
- Lab: Explore and create ML datasets.
Module 8: Building ML models with Tensorflow


- Getting started with TensorFlow.
- Lab: Using tf.learn.
- TensorFlow graphs and loops + lab.
- Lab: Using low-level TensorFlow + early stopping.
- Monitoring ML training.
- Lab: Charts and graphs of TensorFlow training.
Module 9: Scaling ML models with CloudML


- Why Cloud ML?
- Packaging up a TensorFlow model.
- End-to-end training.
- Lab: Run a ML model locally and on cloud.
Module 10: Feature Engineering


- Creating good features.
- Transforming inputs.
- Synthetic features.
- Preprocessing with Cloud ML.
- Lab: Feature engineering.
Module 11: Architecture of streaming analytics pipelines


- Stream data processing: Challenges.
- Handling variable data volumes.
- Dealing with unordered/late data.
- Lab: Designing streaming pipeline.
Module 12: Ingesting Variable Volumes


- What is Cloud Pub/Sub?
- How it works: Topics and Subscriptions.
- Lab: Simulator.
Module 13: Implementing streaming pipelines


- Challenges in stream processing.
- Handle late data: watermarks, triggers, accumulation.
- Lab: Stream data processing pipeline for live traffic data.
Module 14: Streaming analytics and dashboards


- Streaming analytics: from data to decisions.
- Querying streaming data with BigQuery.
- What is Google Data Studio?
- Lab: build a real-time dashboard to visualize processed data.
Module 15: High throughput and low-latency with Bigtable


- What is Cloud Spanner?
- Designing Bigtable schema.
- Ingesting into Bigtable.
- Lab: streaming into Bigtable.</outline_plain><duration unit="d" days="4">4 jours</duration><pricelist><price country="IT" currency="EUR">2600.00</price><price country="DE" currency="EUR">2600.00</price><price country="NL" currency="EUR">2695.00</price><price country="BE" currency="EUR">2695.00</price><price country="AT" currency="EUR">2600.00</price><price country="US" currency="USD">2495.00</price><price country="ES" currency="EUR">1950.00</price><price country="SG" currency="SGD">3450.00</price><price country="SE" currency="EUR">2600.00</price><price country="AE" currency="USD">2600.00</price><price country="CH" currency="CHF">3380.00</price><price country="IN" currency="USD">1500.00</price><price country="RU" currency="RUB">221000.00</price><price country="IL" currency="ILS">9020.00</price><price country="GR" currency="EUR">1950.00</price><price country="MK" currency="EUR">1950.00</price><price country="HU" currency="EUR">1950.00</price><price country="SI" currency="EUR">2600.00</price><price country="GB" currency="GBP">2640.00</price><price country="CA" currency="CAD">3445.00</price><price country="FR" currency="EUR">2990.00</price></pricelist><miles/></course>