AWS Cloud can seem intimidating and overwhelming to a lot of people due to its vast ecosystem, but this course will make it easier for anyone who wants a hands-on expertise in setting up a data-warehouse in Redshift or setup a BI infrastructure from scratch .
Data Scientists/Analysts/Business Analysts will soon be expected to (if not already) become all-rounders and handle the technical aspect of data ingestion/engineering/warehousing .
Anyone who has the basic understanding of how cloud works can benefit from this course because :
– This course is designed keeping in mind end to end life cycle of a typical data engineering project
– Provides a practical solution to real-world use-cases
This Course covers :
• Setting up a data warehouse in AWS Redshift from scratch
•Basic Data Warehousing Concepts
• Writing server-less AWS Glue Jobs (pyspark and python shell) for ETL and batch processing
• AWS Athena for ad-hoc analysis (when to use Athena)
• AWS Data Pipeline to sync incremental data
• Lambda functions to trigger and automate ETL/Data Syncing processes
•QuickSight Setup , Analyses and Dashboards
Prerequisites for this course are :
•Python / Sql (Absolute must)
•PySpark (should know how to write some basic Pyspark scripts)
•Willingness to explore ,learn and put in the extra effort to succeed
•An active AWS Account
Important Note – This course makes use of the free tiers for Redshift and RDS , so you will not be billed for them unless you exceed the free tier usage which should be more than enough to get enough practice from this course .
Also , this course makes use of AWS UI on the browser for creating clusters and setting up jobs , there is no bash scripting involved. One can use any operating system to perform the lab sessions in this course .
This course is not code-intense or code-heavy ,there is only 35% coding involved , the rest is execution,understanding and chaining different component together. The whole purpose of this course is to make everyone aware of and feel comfortable with all the tools/features used in this course .
Some Tips :
•Try to watch the videos at 1.2X speed
•Every time you work on a new component or feature , do some research on the other tools that are meant for the same purpose and see how they differ and in what aspects , For Eg Redshift/Athena vs Snowflake or Bigquery , QuickSight vs PowerBi vs Microstrategy
Courses » IT & Software » Other IT & Software » Data Warehouse » Data Engineering, Serverless ETL & BI on Amazon Cloud
Disclosure: when you buy through links on our site, we may earn an affiliate commission.
Data Engineering, Serverless ETL & BI on Amazon Cloud
Data warehousing & ETL on AWS Cloud
Created by
9.6
CourseMarks Score®
Freshness
Feedback
Content
Top Data Warehouse courses:
Detailed Analysis
CourseMarks Score®
CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.
Freshness Score
Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.
Student Feedback
New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.
Content Score
The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.
Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.
This course contains:
Table of contents
Description
You will learn
✓ Learn and understand AWS Athena and when to make use of Athena
✓ Learn how to store data in S3 Data lakes using Parquet columnar file formats and optimize the process of data scans using Athena
✓ Learn and automate the ETL processes using different server-less components like AWS Glue , Data Pipeline and Lambda Functions
✓ Data Centralization using Redshift Spectrum
✓ Trigger and Automate Glue jobs using Lambda Functions
✓ Understand how to pull data into QuickSight which is a BI-Reporting/Visualization offering from AWS
Requirements
• should have a technical background or prior experience in Pyspark (at least beginner level)
• Basic understanding of different cloud components (AWS ,GCP or Azure )
This course is for
• Software developers who are curious to learn data engineering
• Anyone with experience in coding that wants to get into the field of Data Engineering/Analytics and Science
How much does the Data Engineering, Serverless ETL & BI on Amazon Cloud course cost? Is it worth it?
Does the Data Engineering, Serverless ETL & BI on Amazon Cloud course have a money back guarantee or refund policy?
Are there any SCHOLARSHIPS for this course?
Who is the instructor? Is Siddharth Raghunath a SCAM or a TRUSTED instructor?
9.6
CourseMarks Score®
Freshness
Feedback
Content