Using AWS Data Wrangler with AWS Glue Job 2.0 and Amazon Redshift connection

Published in

Analytics Vidhya

6 min readNov 21, 2020

I will admit, AWS Data Wrangler has become my go to package for developing extract, transform, and load (ETL) data pipelines and other day-to-day scripts. AWS Data Wrangler integration with multiple big data AWS services like S3, Glue Catalog, Athena, Databases, EMR, and others makes life simple for engineers. It also provides the ability to import packages like Pandas and PyArrow to help writing transformations.

In this blog post I will walk you through a hypothetical use-case to read data from glue…

Using AWS Data Wrangler with AWS Glue Job 2.0 and Amazon Redshift connection

Written by Anand Prakash