<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE FL_Course SYSTEM "https://www.flane.de/dtd/fl_course095.dtd"><?xml-stylesheet type="text/xsl" href="https://portal.flane.ch/css/xml-course.xsl"?><course productid="34488" language="fr" source="https://portal.flane.ch/swisscom/fr/xml-course/nvidia-adep" lastchanged="2025-07-29T12:18:27+02:00" parent="https://portal.flane.ch/swisscom/fr/xml-courses"><title>Accelerating Data Engineering Pipelines</title><productcode>ADEP</productcode><vendorcode>NV</vendorcode><vendorname>Nvidia</vendorname><fullproductcode>NV-ADEP</fullproductcode><version>1.0</version><objective>&lt;ul&gt;
&lt;li&gt;How data moves within a computer. How to build the right balance between CPU, DRAM, Disk Memory, and GPUs.&lt;/li&gt;&lt;li&gt;How different file formats can be read and manipulated by hardware.&lt;/li&gt;&lt;li&gt;How to scale an ETL pipeline with multiple GPUs using NVTabular.&lt;/li&gt;&lt;li&gt;How to build an interactive Plotly dashboard where users can filter on millions of data points in less than a second.&lt;/li&gt;&lt;/ul&gt;</objective><essentials>&lt;ul&gt;
&lt;li&gt;Intermediate knowledge of Python (list comprehension, objects)&lt;/li&gt;&lt;li&gt;Familiarity with pandas a plus&lt;/li&gt;&lt;li&gt;Introductory statistics (mean, median, mode)&lt;/li&gt;&lt;/ul&gt;</essentials><outline>&lt;p&gt;&lt;strong&gt;Introduction	&lt;/strong&gt;
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Meet the instructor.&lt;/li&gt;&lt;li&gt;Create an account at courses.nvidia.com/join&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Data on the Hardware Level&lt;/strong&gt;	
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Explore the strengths and weaknesses of different hardware approaches to data and the frameworks that support them:&lt;ul&gt;
&lt;li&gt;Pandas&lt;/li&gt;&lt;li&gt;CuDF&lt;/li&gt;&lt;li&gt;Dask&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;ETL with NVTabular&lt;/strong&gt;	
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Learn how to scale an ETL pipeline from 1 GPU to many with NVTabular through the perspective of a big data recommender system.&lt;ul&gt;
&lt;li&gt;Transform raw json into analysis-ready parquet files&lt;/li&gt;&lt;li&gt;Learn how to quickly add features to a dataset, such as Categorify and Lambda operators&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Data Visualization&lt;/strong&gt;	
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Step into the shoes of a meteorologist and learn how to plot precipitation data on a map.&lt;/li&gt;&lt;li&gt;Learn how to use descriptive statistics and plots like histograms in order to assess data quality&lt;/li&gt;&lt;li&gt;Learn effective memory usage, so users can quickly filter data through a graphical interface&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Final Project: Data Detective&lt;/strong&gt;	
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Users are complaining that the dashboard is too slow. Apply the techniques learned in class to find and eliminate efficiencies in the backend code&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt;Final Review&lt;/strong&gt;	
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Review key learnings and answer questions.&lt;/li&gt;&lt;li&gt;Complete the assessment and earn your certificate.&lt;/li&gt;&lt;li&gt;Complete the workshop survey.&lt;/li&gt;&lt;li&gt;Learn how to set up your own AI application development environment.&lt;/li&gt;&lt;/ul&gt;</outline><objective_plain>- How data moves within a computer. How to build the right balance between CPU, DRAM, Disk Memory, and GPUs.
- How different file formats can be read and manipulated by hardware.
- How to scale an ETL pipeline with multiple GPUs using NVTabular.
- How to build an interactive Plotly dashboard where users can filter on millions of data points in less than a second.</objective_plain><essentials_plain>- Intermediate knowledge of Python (list comprehension, objects)
- Familiarity with pandas a plus
- Introductory statistics (mean, median, mode)</essentials_plain><outline_plain>Introduction	



- Meet the instructor.
- Create an account at courses.nvidia.com/join
Data on the Hardware Level	



- Explore the strengths and weaknesses of different hardware approaches to data and the frameworks that support them:
- Pandas
- CuDF
- Dask
ETL with NVTabular	



- Learn how to scale an ETL pipeline from 1 GPU to many with NVTabular through the perspective of a big data recommender system.
- Transform raw json into analysis-ready parquet files
- Learn how to quickly add features to a dataset, such as Categorify and Lambda operators
Data Visualization	



- Step into the shoes of a meteorologist and learn how to plot precipitation data on a map.
- Learn how to use descriptive statistics and plots like histograms in order to assess data quality
- Learn effective memory usage, so users can quickly filter data through a graphical interface
Final Project: Data Detective	



- Users are complaining that the dashboard is too slow. Apply the techniques learned in class to find and eliminate efficiencies in the backend code
Final Review	



- Review key learnings and answer questions.
- Complete the assessment and earn your certificate.
- Complete the workshop survey.
- Learn how to set up your own AI application development environment.</outline_plain><duration unit="d" days="1">1 jour</duration><pricelist><price country="US" currency="USD">500.00</price><price country="DE" currency="EUR">500.00</price><price country="AT" currency="EUR">500.00</price><price country="SE" currency="EUR">500.00</price><price country="SI" currency="EUR">500.00</price><price country="GB" currency="GBP">420.00</price><price country="IT" currency="EUR">500.00</price><price country="CA" currency="CAD">690.00</price></pricelist><miles/></course>