<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE FL_Course SYSTEM "https://www.flane.de/dtd/fl_course095.dtd"><?xml-stylesheet type="text/xsl" href="https://portal.flane.ch/css/xml-course.xsl"?><course productid="36493" language="fr" source="https://portal.flane.ch/swisscom/fr/xml-course/amazon-deaws" lastchanged="2026-02-02T10:34:37+01:00" parent="https://portal.flane.ch/swisscom/fr/xml-courses"><title>Data Engineering on AWS</title><productcode>DEAWS</productcode><vendorcode>AW</vendorcode><vendorname>Amazon Web Services</vendorname><fullproductcode>AW-DEAWS</fullproductcode><version>1.0</version><objective>&lt;p&gt;In this course, you will learn to do the following:
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Understand the foundational roles and key concepts of data engineering, including data personas, data discovery, and relevant AWS services.&lt;/li&gt;&lt;li&gt;Identify and explain the various AWS tools and services crucial for data engineering, encompassing orchestration, security, monitoring, CI/CD, IaC, networking, and cost optimization.&lt;/li&gt;&lt;li&gt;Design and implement a data lake solution on AWS, including storage, data ingestion, transformation, and serving data for consumption.&lt;/li&gt;&lt;li&gt;Optimize and secure a data lake solution by implementing open table formats, security measures, and troubleshooting common issues.&lt;/li&gt;&lt;li&gt;Design and set up a data warehouse using Amazon Redshift Serverless, understanding its architecture, data ingestion, processing, and serving capabilities.&lt;/li&gt;&lt;li&gt;Apply performance optimization techniques to data warehouses in Amazon Redshift, including monitoring, data optimization, query optimization, and orchestration.&lt;/li&gt;&lt;li&gt;Manage security and access control for data warehouses in Amazon Redshift, understanding authentication, data security, auditing, and compliance.&lt;/li&gt;&lt;li&gt;Design effective batch data pipelines using appropriate AWS services for processing and transforming data.&lt;/li&gt;&lt;li&gt;Implement comprehensive strategies for batch data pipelines, covering data processing, transformation, integration, cataloging, and serving data for consumption.&lt;/li&gt;&lt;li&gt;Optimize, orchestrate, and secure batch data pipelines, demonstrating advanced skills in data processing automation and security.&lt;/li&gt;&lt;li&gt;Architect streaming data pipelines, understanding various use cases, ingestion, storage, processing, and analysis using AWS services.&lt;/li&gt;&lt;li&gt;Optimize and secure streaming data solutions, including compliance considerations and access control.&lt;/li&gt;&lt;/ul&gt;</objective><essentials>&lt;p&gt;We recommend that attendees of this course have:
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Familiarity with basic machine learning concepts, such as supervised and unsupervised learning, regression, classification, and clustering algorithms.&lt;/li&gt;&lt;li&gt;Working knowledge of Python programming language and common data science libraries like NumPy, Pandas, and Scikit-learn.&lt;/li&gt;&lt;li&gt;Basic understanding of cloud computing concepts and familiarity with the AWS platform.&lt;/li&gt;&lt;li&gt;Familiarity with SQL and relational databases is recommended but not mandatory.&lt;/li&gt;&lt;li&gt;Experience with version control systems like Git is beneficial but not required.&lt;/li&gt;&lt;/ul&gt;</essentials><audience>&lt;p&gt;This course is designed for professionals who are interested in designing, building, optimizing, and securing data engineering solutions using AWS services.&lt;/p&gt;</audience><outline>&lt;h4&gt;Day 1&lt;/h4&gt;&lt;p&gt;&lt;strong&gt;Module 1: Data Engineering Roles and Key Concepts&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Role of a Data Engineer&lt;/li&gt;&lt;li&gt;Key functions of a Data Engineer&lt;/li&gt;&lt;li&gt;Data Personas&lt;/li&gt;&lt;li&gt;Data Discovery&lt;/li&gt;&lt;li&gt;AWS Data Services&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Module 2: AWS Data Engineering Tools and Services&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Orchestration and Automation&lt;/li&gt;&lt;li&gt;Data Engineering Security&lt;/li&gt;&lt;li&gt;Monitoring&lt;/li&gt;&lt;li&gt;Continuous Integration and Continuous Delivery&lt;/li&gt;&lt;li&gt;Infrastructure as Code&lt;/li&gt;&lt;li&gt;AWS Serverless Application Model&lt;/li&gt;&lt;li&gt;Networking Considerations&lt;/li&gt;&lt;li&gt;Cost Optimization Tools&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Module 3: Designing and Implementing Data Lakes&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Data lake introduction&lt;/li&gt;&lt;li&gt;Data lake storage&lt;/li&gt;&lt;li&gt;Ingest data into a data lake&lt;/li&gt;&lt;li&gt;Catalog data&lt;/li&gt;&lt;li&gt;Transform data&lt;/li&gt;&lt;li&gt;Server data for consumption&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: Setting up a Data Lake on AWS&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Module 4: Optimizing and Securing a Data Lake Solution&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Open Table Formats&lt;/li&gt;&lt;li&gt;Security using AWS Lake Formation&lt;/li&gt;&lt;li&gt;Setting permissions with Lake Formation&lt;/li&gt;&lt;li&gt;Security and governance&lt;/li&gt;&lt;li&gt;Troubleshooting&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hand-on lab: Automating Data Lake Creation using AWS Lake Formation Blueprints&lt;/p&gt;
&lt;h4&gt;Day 2&lt;/h4&gt;&lt;p&gt;
&lt;strong&gt;Module 5: Data Warehouse Architecture and Design Principles&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Introduction to data warehouses&lt;/li&gt;&lt;li&gt;Amazon Redshift Overview&lt;/li&gt;&lt;li&gt;Ingesting data into Redshift&lt;/li&gt;&lt;li&gt;Processing data&lt;/li&gt;&lt;li&gt;Serving data for consumption&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on Lab: Setting up a Data Warehouse using Amazon Redshift Serverless&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Module 6: Performance Optimization Techniques for Data Warehouses&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Monitoring and optimization options&lt;/li&gt;&lt;li&gt;Data optimization in Amazon Redshift&lt;/li&gt;&lt;li&gt;Query optimization in Amazon Redshift&lt;/li&gt;&lt;li&gt;Orchestration options&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Module 7: Security and Access Control for Data Warehouses&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Authentication and access control in Amazon Redshift&lt;/li&gt;&lt;li&gt;Data security in Amazon Redshift&lt;/li&gt;&lt;li&gt;Auditing and compliance in Amazon Redshift&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: Managing Access Control in Redshift&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Module 8: Designing Batch Data Pipelines&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Introduction to batch data pipelines&lt;/li&gt;&lt;li&gt;Designing a batch data pipeline&lt;/li&gt;&lt;li&gt;AWS services for batch data processing&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Module 9: Implementing Strategies for Batch Data Pipeline&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Elements of a batch data pipeline&lt;/li&gt;&lt;li&gt;Processing and transforming data&lt;/li&gt;&lt;li&gt;Integrating and cataloging your data&lt;/li&gt;&lt;li&gt;Serving data for consumption&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: A Day in the Life of a Data Engineer&lt;/p&gt;
&lt;h4&gt;Day 3&lt;/h4&gt;&lt;p&gt;&lt;strong&gt;Module 10: Optimizing, Orchestrating, and Securing Batch Data Pipelines&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Optimizing the batch data pipeline&lt;/li&gt;&lt;li&gt;Orchestrating the batch data pipeline&lt;/li&gt;&lt;li&gt;Securing the batch data pipeline&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: Orchestrating Data Processing in Spark using AWS Step Functions&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Module 11: Streaming Data Architecture Patterns&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Introduction to streaming data pipelines&lt;/li&gt;&lt;li&gt;Ingesting data from stream sources&lt;/li&gt;&lt;li&gt;Streaming data ingestion services&lt;/li&gt;&lt;li&gt;Storing streaming data&lt;/li&gt;&lt;li&gt;Processing Streaming Data&lt;/li&gt;&lt;li&gt;Analyzing Streaming Data with AWS Services&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: Streaming Analytics with Amazon Managed Service for Apache Flink&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Module 12: Optimizing and Securing Streaming Solutions&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Optimizing a streaming data solution&lt;/li&gt;&lt;li&gt;Securing a streaming data pipeline&lt;/li&gt;&lt;li&gt;Compliance considerations&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;
Hands-on lab: Access Control with Amazon Managed Streaming for Apache Kafka&lt;/p&gt;</outline><objective_plain>In this course, you will learn to do the following:



- Understand the foundational roles and key concepts of data engineering, including data personas, data discovery, and relevant AWS services.
- Identify and explain the various AWS tools and services crucial for data engineering, encompassing orchestration, security, monitoring, CI/CD, IaC, networking, and cost optimization.
- Design and implement a data lake solution on AWS, including storage, data ingestion, transformation, and serving data for consumption.
- Optimize and secure a data lake solution by implementing open table formats, security measures, and troubleshooting common issues.
- Design and set up a data warehouse using Amazon Redshift Serverless, understanding its architecture, data ingestion, processing, and serving capabilities.
- Apply performance optimization techniques to data warehouses in Amazon Redshift, including monitoring, data optimization, query optimization, and orchestration.
- Manage security and access control for data warehouses in Amazon Redshift, understanding authentication, data security, auditing, and compliance.
- Design effective batch data pipelines using appropriate AWS services for processing and transforming data.
- Implement comprehensive strategies for batch data pipelines, covering data processing, transformation, integration, cataloging, and serving data for consumption.
- Optimize, orchestrate, and secure batch data pipelines, demonstrating advanced skills in data processing automation and security.
- Architect streaming data pipelines, understanding various use cases, ingestion, storage, processing, and analysis using AWS services.
- Optimize and secure streaming data solutions, including compliance considerations and access control.</objective_plain><essentials_plain>We recommend that attendees of this course have:



- Familiarity with basic machine learning concepts, such as supervised and unsupervised learning, regression, classification, and clustering algorithms.
- Working knowledge of Python programming language and common data science libraries like NumPy, Pandas, and Scikit-learn.
- Basic understanding of cloud computing concepts and familiarity with the AWS platform.
- Familiarity with SQL and relational databases is recommended but not mandatory.
- Experience with version control systems like Git is beneficial but not required.</essentials_plain><audience_plain>This course is designed for professionals who are interested in designing, building, optimizing, and securing data engineering solutions using AWS services.</audience_plain><outline_plain>Day 1

Module 1: Data Engineering Roles and Key Concepts



- Role of a Data Engineer
- Key functions of a Data Engineer
- Data Personas
- Data Discovery
- AWS Data Services
Module 2: AWS Data Engineering Tools and Services



- Orchestration and Automation
- Data Engineering Security
- Monitoring
- Continuous Integration and Continuous Delivery
- Infrastructure as Code
- AWS Serverless Application Model
- Networking Considerations
- Cost Optimization Tools
Module 3: Designing and Implementing Data Lakes



- Data lake introduction
- Data lake storage
- Ingest data into a data lake
- Catalog data
- Transform data
- Server data for consumption

Hands-on lab: Setting up a Data Lake on AWS

Module 4: Optimizing and Securing a Data Lake Solution



- Open Table Formats
- Security using AWS Lake Formation
- Setting permissions with Lake Formation
- Security and governance
- Troubleshooting

Hand-on lab: Automating Data Lake Creation using AWS Lake Formation Blueprints

Day 2


Module 5: Data Warehouse Architecture and Design Principles



- Introduction to data warehouses
- Amazon Redshift Overview
- Ingesting data into Redshift
- Processing data
- Serving data for consumption

Hands-on Lab: Setting up a Data Warehouse using Amazon Redshift Serverless

Module 6: Performance Optimization Techniques for Data Warehouses



- Monitoring and optimization options
- Data optimization in Amazon Redshift
- Query optimization in Amazon Redshift
- Orchestration options
Module 7: Security and Access Control for Data Warehouses



- Authentication and access control in Amazon Redshift
- Data security in Amazon Redshift
- Auditing and compliance in Amazon Redshift

Hands-on lab: Managing Access Control in Redshift

Module 8: Designing Batch Data Pipelines



- Introduction to batch data pipelines
- Designing a batch data pipeline
- AWS services for batch data processing
Module 9: Implementing Strategies for Batch Data Pipeline



- Elements of a batch data pipeline
- Processing and transforming data
- Integrating and cataloging your data
- Serving data for consumption

Hands-on lab: A Day in the Life of a Data Engineer

Day 3

Module 10: Optimizing, Orchestrating, and Securing Batch Data Pipelines



- Optimizing the batch data pipeline
- Orchestrating the batch data pipeline
- Securing the batch data pipeline

Hands-on lab: Orchestrating Data Processing in Spark using AWS Step Functions

Module 11: Streaming Data Architecture Patterns



- Introduction to streaming data pipelines
- Ingesting data from stream sources
- Streaming data ingestion services
- Storing streaming data
- Processing Streaming Data
- Analyzing Streaming Data with AWS Services

Hands-on lab: Streaming Analytics with Amazon Managed Service for Apache Flink

Module 12: Optimizing and Securing Streaming Solutions



- Optimizing a streaming data solution
- Securing a streaming data pipeline
- Compliance considerations

Hands-on lab: Access Control with Amazon Managed Streaming for Apache Kafka</outline_plain><duration unit="d" days="3">3 jours</duration><pricelist><price country="US" currency="USD">2025.00</price><price country="CA" currency="CAD">2795.00</price><price country="GB" currency="GBP">2655.00</price><price country="FR" currency="EUR">2550.00</price><price country="DE" currency="EUR">1995.00</price><price country="CH" currency="CHF">1995.00</price><price country="AT" currency="EUR">1995.00</price><price country="SE" currency="EUR">1995.00</price><price country="SI" currency="EUR">1995.00</price><price country="NL" currency="EUR">1995.00</price><price country="IT" currency="EUR">1650.00</price></pricelist><miles/></course>