RADE™ AWS Data Engineering Labs

0 Get Ready for the Roller Coaster Ride
1 Lesson
- *Your effort can change your career trajectory!*
  Preview
1 Create AWS Account & Set Up AWS CLI
12 Lessons
- AWS Set up
  Start
- 1 Intro - AWS Setup
  Start
- 2 Create AWS Account
  Start
- 3 Login to AWS using Root User and go to IAM
  Start
- 3.5 Upgrading AWS Account
  Start
- 4 Create Admin User with Console and Programmatic Access
  Start
- 5 Download and Install AWS CLI
  Start
- 6 Create Access Key for AWS CLI Access
  Start
- 7 Configure AWS CLI on your system
  Start
- 8 VIMP - Set up three AWS Budgets
  Start
- 8.5 Set Default Region to us-east-1
  Start
- 9 *needed - Outtro - AWS Set Up*
  Start
2 Set-Up For Data Analysis
11 Lessons
- Set up for Data Analysis
  Start
- 1a. Intro - Set up for Analysis
  Start
- 1b. Datalake Setup
  Start
- 2. Download Data from NYC TLC
  Start
- 3. Move the downloaded file to your project folder
  Start
- 4. Create S3 bucket in Datalake & upload data to the datalake S3 bucket
  Start
- 5. *Needed - Hands On Starts*
  Start
- 6. Create Glue Catalog Database
  Start
- 7. Create a crawler to crawl the data and run the crawler
  Start
- 8. View the Parquet Data in Athena
  Start
- 9. *Needed Outro*
  Start
3 Build an Ingestion ( Extract of ETL ) job
6 Lessons
- 1. *Below - Mandatory detour to Course RADE™ Agentic Data Engineering with Amazon Q*
  Start
- 2. Before the Hands on Continues - VIMP!
  Start
- 3. Create the Glue Ingestion Job
  Start
- 4. Run Glue Crawler again to update the Table Metadata with partitions
  Start
- 5. Create Crawler for Zone Table
  Start
- 6. *Below - Next Step is Very Important*
  Start
4 Understanding and Gathering Requirements
6 Lessons
- Before the Development Begins
  Start
- :- 1. Intro - Understanding and Gathering Requirements
  Start
- 2. *Capstone Requirements and Guidance on completing it - 3 options*
  Start
- 3. Analysis by Data Analysts, Meeting with them and Proposing a 3 layer structure
  Start
- 4. Understanding Requirements using the Mapping Sheet
  Start
- 5. *Decide on Technical Requirements & gear up for development*
  Start
5 Build the data transformation job - Raw to Curated
5 Lessons
- 1 Open the notebook file in your VS Code
  Start
- 2 Start Iterative Development - Load the data
  Start
- 3 *Transform, Validate and Load to Curated*
  Start
- 4 Create and Run Glue Script Job
  Start
- 5 *Why did the script run longer?*
  Start
6 Unit Testing, analysis, and further development
13 Lessons
- 0 *What will you learn in this section*
  Start
- 1 Time to unit test before we hand over the data to Data analysts
  Start
- 2 Unit test the curated layer and fill the unit test document and then hand over to the data anaysts
  Start
- 3 Analyse the Curated to Aggregated Mapping Sheet
  Start
- 4 Create Glue Notebook , load the curated data and enrich with Zone information
  Start
- 5 Create the aggregated tables and load the aggregated data into S3
  Start
- 6 Get the script from the notebook and stop the notebook
  Start
- 7 Get the script vetted by AI
  Start
- 8 Create and Run the Glue ETL job to load from curated to Aggregated
  Start
- 9 What-s the next step
  Start
- 10 Create aggregated tables with the help of Crawler
  Start
- 11 Unit test the aggregated data, provide access to Data analysts and let them to know
  Start
- 12 *What have you learnt so far*
  Start
7 Very Important Section - Historical & Incremental Loading
7 Lessons
- 1 Historical Vs Monthly Cadence
  Start
- 2 *Code for both historical and current - run the job*
  Start
- 3 Why do we even need to think of incremental processing of Data
  Start
- 4 Extremely Important from Interview Perspective - Enable Job bookmarks for incremental processing
  Start
- 4.1 Glue Dynamic Frame vs Spark Dataframe
  Start
- 5 *Incremental Processing from Curated to Aggregated*
  Start
- 6 Solution to previous ask
  Start
8 Orchestration - Build the Pipeline
4 Lessons
- 1. Background for Orchestration
  Start
- 2. Create SNS Topic and Subscribe your email
  Start
- 3. Get the Step Function Code
  Start
- 4. Build the sequential pipeline using Step Function and run it
  Start
9 Schedule the job - monthly cadence
2 Lessons
- 1. Eventbridge Schedule - One Time Test
  Start
- 2. Set up the Monthly Cadence!
  Start
10 Where do you go Next?
1 Lesson
- What Next?
  Start

AWS Data Engineering Project Labs

Build a real-world AWS data engineering project through guided, production-style labs.

Course Summary

Course Curriculum

0 Get Ready for the Roller Coaster Ride

1 Create AWS Account & Set Up AWS CLI

2 Set-Up For Data Analysis

3 Build an Ingestion ( Extract of ETL ) job

4 Understanding and Gathering Requirements

5 Build the data transformation job - Raw to Curated

6 Unit Testing, analysis, and further development

7 Very Important Section - Historical & Incremental Loading

8 Orchestration - Build the Pipeline

9 Schedule the job - monthly cadence

10 Where do you go Next?

Sachin Chandrashekhar

Course Pricing