AWS Data Engineering Project Labs

Zenler Player

Your course is loading. Hang tight.

1. *Below - Mandatory detour to Course RADE™ Agentic Data Engineering with Amazon Q*
2. Before the Hands on Continues - VIMP!
3. Create the Glue Ingestion Job
4. Run Glue Crawler again to update the Table Metadata with partitions
5. Create Crawler for Zone Table
6. *Below - Next Step is Very Important*

Before the Development Begins
:- 1. Intro - Understanding and Gathering Requirements
2. *Capstone Requirements and Guidance on completing it - 3 options*
3. Analysis by Data Analysts, Meeting with them and Proposing a 3 layer structure
4. Understanding Requirements using the Mapping Sheet
5. *Decide on Technical Requirements & gear up for development*

0 *What will you learn in this section*
1 Time to unit test before we hand over the data to Data analysts
2 Unit test the curated layer and fill the unit test document and then hand over to the data anaysts
3 Analyse the Curated to Aggregated Mapping Sheet
4 Create Glue Notebook , load the curated data and enrich with Zone information
5 Create the aggregated tables and load the aggregated data into S3
6 Get the script from the notebook and stop the notebook
7 Get the script vetted by AI
8 Create and Run the Glue ETL job to load from curated to Aggregated
9 What-s the next step
10 Create aggregated tables with the help of Crawler
11 Unit test the aggregated data, provide access to Data analysts and let them to know
12 *What have you learnt so far*

1 Historical Vs Monthly Cadence
2 *Code for both historical and current - run the job*
3 Why do we even need to think of incremental processing of Data
4 Extremely Important from Interview Perspective - Enable Job bookmarks for incremental processing
4.1 Glue Dynamic Frame vs Spark Dataframe
5 *Incremental Processing from Curated to Aggregated*
6 Solution to previous ask

Lesson contents locked