<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE FL_Course SYSTEM "https://www.flane.de/dtd/fl_course095.dtd"><?xml-stylesheet type="text/xsl" href="https://portal.flane.ch/css/xml-course.xsl"?><course productid="29831" language="fr" source="https://portal.flane.ch/swisscom/fr/xml-course/amazon-bbdas" lastchanged="2025-08-27T16:56:02+02:00" parent="https://portal.flane.ch/swisscom/fr/xml-courses"><title>Building Batch Data Analytics Solutions on AWS</title><productcode>BBDAS</productcode><vendorcode>AW</vendorcode><vendorname>Amazon Web Services</vendorname><fullproductcode>AW-BBDAS</fullproductcode><version>1.0</version><objective>&lt;p&gt;In this course, you will learn to:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Compare the features and benefits of data warehouses, data lakes, and modern data architectures&lt;/li&gt;&lt;li&gt;Design and implement a batch data analytics solution&lt;/li&gt;&lt;li&gt;Identify and apply appropriate techniques, including compression, to optimize data storage&lt;/li&gt;&lt;li&gt;Select and deploy appropriate options to ingest, transform, and store data&lt;/li&gt;&lt;li&gt;Choose the appropriate instance and node types, clusters, auto scaling, and network topology for a particular business use case&lt;/li&gt;&lt;li&gt;Understand how data storage and processing affect the analysis and visualization mechanisms needed to gain actionable business insights&lt;/li&gt;&lt;li&gt;Secure data at rest and in transit&lt;/li&gt;&lt;li&gt;Monitor analytics workloads to identify and remediate problems&lt;/li&gt;&lt;li&gt;Apply cost management best practices&lt;/li&gt;&lt;/ul&gt;</objective><essentials>&lt;p&gt;Students with a minimum one-year experience managing open-source data frameworks such as Apache Spark or Apache Hadoop will benefit from this course.&lt;/p&gt;
&lt;p&gt;We recommend that attendees of this course have: 
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Completed either &lt;span class=&quot;cms-link-marked&quot;&gt;&lt;a class=&quot;fl-href-prod&quot; href=&quot;/swisscom/fr/course/amazon-awsa&quot;&gt;&lt;svg role=&quot;img&quot; aria-hidden=&quot;true&quot; focusable=&quot;false&quot; data-nosnippet class=&quot;cms-linkmark&quot;&gt;&lt;use xlink:href=&quot;/css/img/icnset-linkmarks.svg#linkmark&quot;&gt;&lt;/use&gt;&lt;/svg&gt;Architecting on AWS &lt;span class=&quot;fl-prod-pcode&quot;&gt;(AWSA)&lt;/span&gt;&lt;/a&gt;&lt;/span&gt; or &lt;span class=&quot;cms-link-marked&quot;&gt;&lt;a class=&quot;fl-href-prod&quot; href=&quot;/swisscom/fr/course/amazon-awse&quot;&gt;&lt;svg role=&quot;img&quot; aria-hidden=&quot;true&quot; focusable=&quot;false&quot; data-nosnippet class=&quot;cms-linkmark&quot;&gt;&lt;use xlink:href=&quot;/css/img/icnset-linkmarks.svg#linkmark&quot;&gt;&lt;/use&gt;&lt;/svg&gt;AWS Technical Essentials &lt;span class=&quot;fl-prod-pcode&quot;&gt;(AWSE)&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;Completed either &lt;span class=&quot;cms-link-marked&quot;&gt;&lt;a class=&quot;fl-href-prod&quot; href=&quot;/swisscom/fr/course/amazon-bdla&quot;&gt;&lt;svg role=&quot;img&quot; aria-hidden=&quot;true&quot; focusable=&quot;false&quot; data-nosnippet class=&quot;cms-linkmark&quot;&gt;&lt;use xlink:href=&quot;/css/img/icnset-linkmarks.svg#linkmark&quot;&gt;&lt;/use&gt;&lt;/svg&gt;Building Data Lakes on AWS &lt;span class=&quot;fl-prod-pcode&quot;&gt;(BDLA)&lt;/span&gt;&lt;/a&gt;&lt;/span&gt; or Getting Started with AWS Glue&lt;/li&gt;&lt;/ul&gt;</essentials><audience>&lt;p&gt;This course is intended for:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Data platform engineers&lt;/li&gt;&lt;li&gt;Architects and operators who build and manage data analytics pipelines&lt;/li&gt;&lt;/ul&gt;</audience><contents>&lt;h5&gt;Module A: Overview of Data Analytics and the Data Pipeline&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Data analytics use cases&lt;/li&gt;&lt;li&gt;Using the data pipeline for analytics&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 1: Introduction to Amazon EMR&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Using Amazon EMR in analytics solutions&lt;/li&gt;&lt;li&gt;Amazon EMR cluster architecture&lt;/li&gt;&lt;li&gt;Interactive Demo 1: Launching an Amazon EMR cluster&lt;/li&gt;&lt;li&gt;Cost management strategies&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 2: Data Analytics Pipeline Using Amazon EMR: Ingestion and Storage&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Storage optimization with Amazon EMR&lt;/li&gt;&lt;li&gt;Data ingestion techniques&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 3: High-Performance Batch Data Analytics Using Apache Spark on Amazon EMR&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Apache Spark on Amazon EMR use cases&lt;/li&gt;&lt;li&gt;Why Apache Spark on Amazon EMR&lt;/li&gt;&lt;li&gt;Spark concepts&lt;/li&gt;&lt;li&gt;Interactive Demo 2: Connect to an EMR cluster and perform Scala commands using the Spark shell&lt;/li&gt;&lt;li&gt;Transformation, processing, and analytics&lt;/li&gt;&lt;li&gt;Using notebooks with Amazon EMR&lt;/li&gt;&lt;li&gt;Practice Lab 1: Low-latency data analytics using Apache Spark on Amazon EMR&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 4: Processing and Analyzing Batch Data with Amazon EMR and Apache Hive&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Using Amazon EMR with Hive to process batch data&lt;/li&gt;&lt;li&gt;Transformation, processing, and analytics&lt;/li&gt;&lt;li&gt;Practice Lab 2: Batch data processing using Amazon EMR with Hive&lt;/li&gt;&lt;li&gt;Introduction to Apache HBase on Amazon EMR&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 5: Serverless Data Processing&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Serverless data processing, transformation, and analytics&lt;/li&gt;&lt;li&gt;Using AWS Glue with Amazon EMR workloads&lt;/li&gt;&lt;li&gt;Practice Lab 3: Orchestrate data processing in Spark using AWS Step Functions&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 6: Security and Monitoring of Amazon EMR Clusters&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Securing EMR clusters&lt;/li&gt;&lt;li&gt;Interactive Demo 3: Client-side encryption with EMRFS&lt;/li&gt;&lt;li&gt;Monitoring and troubleshooting Amazon EMR clusters&lt;/li&gt;&lt;li&gt;Demo: Reviewing Apache Spark cluster history&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module 7: Designing Batch Data Analytics Solutions&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Batch data analytics use cases&lt;/li&gt;&lt;li&gt;Activity: Designing a batch data analytics workflow&lt;/li&gt;&lt;/ul&gt;&lt;h5&gt;Module B: Developing Modern Data Architectures on AWS&lt;/h5&gt;&lt;ul&gt;
&lt;li&gt;Modern data architectures&lt;/li&gt;&lt;/ul&gt;</contents><objective_plain>In this course, you will learn to:


- Compare the features and benefits of data warehouses, data lakes, and modern data architectures
- Design and implement a batch data analytics solution
- Identify and apply appropriate techniques, including compression, to optimize data storage
- Select and deploy appropriate options to ingest, transform, and store data
- Choose the appropriate instance and node types, clusters, auto scaling, and network topology for a particular business use case
- Understand how data storage and processing affect the analysis and visualization mechanisms needed to gain actionable business insights
- Secure data at rest and in transit
- Monitor analytics workloads to identify and remediate problems
- Apply cost management best practices</objective_plain><essentials_plain>Students with a minimum one-year experience managing open-source data frameworks such as Apache Spark or Apache Hadoop will benefit from this course.

We recommend that attendees of this course have: 



- Completed either Architecting on AWS (AWSA) or AWS Technical Essentials (AWSE)
- Completed either Building Data Lakes on AWS (BDLA) or Getting Started with AWS Glue</essentials_plain><audience_plain>This course is intended for:


- Data platform engineers
- Architects and operators who build and manage data analytics pipelines</audience_plain><contents_plain>Module A: Overview of Data Analytics and the Data Pipeline


- Data analytics use cases
- Using the data pipeline for analytics
Module 1: Introduction to Amazon EMR


- Using Amazon EMR in analytics solutions
- Amazon EMR cluster architecture
- Interactive Demo 1: Launching an Amazon EMR cluster
- Cost management strategies
Module 2: Data Analytics Pipeline Using Amazon EMR: Ingestion and Storage


- Storage optimization with Amazon EMR
- Data ingestion techniques
Module 3: High-Performance Batch Data Analytics Using Apache Spark on Amazon EMR


- Apache Spark on Amazon EMR use cases
- Why Apache Spark on Amazon EMR
- Spark concepts
- Interactive Demo 2: Connect to an EMR cluster and perform Scala commands using the Spark shell
- Transformation, processing, and analytics
- Using notebooks with Amazon EMR
- Practice Lab 1: Low-latency data analytics using Apache Spark on Amazon EMR
Module 4: Processing and Analyzing Batch Data with Amazon EMR and Apache Hive


- Using Amazon EMR with Hive to process batch data
- Transformation, processing, and analytics
- Practice Lab 2: Batch data processing using Amazon EMR with Hive
- Introduction to Apache HBase on Amazon EMR
Module 5: Serverless Data Processing


- Serverless data processing, transformation, and analytics
- Using AWS Glue with Amazon EMR workloads
- Practice Lab 3: Orchestrate data processing in Spark using AWS Step Functions
Module 6: Security and Monitoring of Amazon EMR Clusters


- Securing EMR clusters
- Interactive Demo 3: Client-side encryption with EMRFS
- Monitoring and troubleshooting Amazon EMR clusters
- Demo: Reviewing Apache Spark cluster history
Module 7: Designing Batch Data Analytics Solutions


- Batch data analytics use cases
- Activity: Designing a batch data analytics workflow
Module B: Developing Modern Data Architectures on AWS


- Modern data architectures</contents_plain><duration unit="d" days="1">1 jour</duration><pricelist><price country="IT" currency="EUR">590.00</price><price country="SI" currency="EUR">795.00</price><price country="DE" currency="EUR">750.00</price><price country="AT" currency="EUR">750.00</price><price country="SE" currency="EUR">750.00</price><price country="PL" currency="PLN">2000.00</price><price country="US" currency="USD">675.00</price><price country="IL" currency="ILS">2610.00</price><price country="AE" currency="USD">750.00</price><price country="GR" currency="EUR">795.00</price><price country="MK" currency="EUR">795.00</price><price country="HU" currency="EUR">795.00</price><price country="NL" currency="EUR">795.00</price><price country="BE" currency="EUR">795.00</price><price country="GB" currency="GBP">900.00</price><price country="CH" currency="CHF">870.00</price><price country="CA" currency="CAD">930.00</price><price country="FR" currency="EUR">850.00</price></pricelist><miles/></course>