AWS-Logo_White-Color
1.
Introduction
1.1
Project Goals
1.2
Purpose and Applications
2.
Preparation Steps
2.1
Create IAM Role
2.2
Create S3 Bucket
2.3
Upload Sample Data
3.
Create Glue Crawler
3.1
Create Glue Database
3.2
Create Glue Crawler
3.3
Run Crawler
4.
Create Glue Job
4.1
Create New Glue Job
4.2
Write ETL Script
4.3
Run Job & Verify Output
5.
Create Glue Crawler for Processed Data
5.1
Create New Crawler
5.2
Run Crawler and Verify Table
6.
Query with Amazon Athena
6.1
Configure Athena
6.2
Query Processed Data
6.3
Advanced Queries
7.
Automate with AWS Lambda
7.1
Create Lambda Function
7.2
Write Python Code for Lambda
7.3
Attach S3 Trigger
7.4
Track User Behavior with AWS CloudTrail
8.
Visualize with Amazon QuickSight
8.1
Connect Athena to QuickSight
8.2
Create interactive dashboard
9.
Clean Up Resources
More
AWS Study Group - Blog
AWS Study Group - FB Group
English
Tiếng Việt
Clear History
Workshop
Cloud Journey
Last Updated
26-11-2023
Team
Gia Hưng
AWS Data Pipeline Workshop
>
Create Glue Job
> Create New Glue Job
4.1 Create New Glue Job
Create New Glue Job
4.1 Create New Glue Job
Open AWS Glue → go to
Jobs
→
Add Job
Job name:
ProcessJob
IAM Role: select the one created earlier
Type: Spark
Script: Process data from
sales_data_raw
and write output to
s3://sales-data-bucket-2025/processed/