Poke | Building a Powerful Data Pipeline with AWS Athena, Glue, AWSWrangler, Jupyter Notebook, and Python: A Step-by-Step Guide

Building a Powerful Data Pipeline with AWS Athena, Glue, AWSWrangler, Jupyter Notebook, and Python: A Step-by-Step Guide

16 videos • 1,366 views • by Kamalraj M M A playlist that removes all the jargons in Data Engineering field and teaches what you need to become the best Data Engineer. In this Comprehensive playlist, you will learn how to build a robust data pipeline using some of the most powerful AWS tools, including Athena, Glue, AWSWrangler, Jupyter Notebook, and Python. The playlist explains each product or service from the perspective of Open Source Big Data Ecosystem. From creating the boto3 session, loading the dataset into S3 buckets, writing the table schema in Glue catalog and transforming data with Athena and finally writing the transformed data back to S3 bucket, this playlist covers all the essential steps in creating an efficient data pipeline. The underlying Big Data concepts are introduced as if a layman can understand. Data set can be found at https://github.com/Kamalabot/s3-wareh... Each tutorial in this playlist is designed to provide a step-by-step guide, starting from the basics and moving on to more advanced topics. Whether you are a seasoned data professional or just starting with AWS, this playlist will help you develop the skills you need to create a powerful data pipeline that can handle any data challenge. So, get ready to dive into the world of AWS and transform your data pipeline today!