Published inAWS TipAmazon SageMaker: Data Processing· Overview · Manual Data Processing in Jupyter Notebook ∘ Manual Steps · Using built-in SKLearnProcessor container · Bring Your Own…Feb 16, 20231Feb 16, 20231
Machine Learning System Design — ResourcesRecently I completed Machine Learning System Design course from educative.io.Jan 7, 2023Jan 7, 2023
GoodBye 2022, Hello 2023!2022 has been a peaceful and kind year both professionally and personally for me. I began the year as an AWS Data Lab Solution Architect…Dec 31, 2022Dec 31, 2022
Serverless ETL and Analytics with AWS Glue — Book ReviewI have personally worked with 2 authors of the book — Vishal Pathak and Noritaka Sekiyama when writing the AWS blog — Ingest streaming…Dec 19, 2022Dec 19, 2022
Published inAWS TipDetect, Redact, and Mask PII data with AWS Services· Overview · AWS Glue (Studio) · AWS Glue Studio — Find sensitive data in each row ∘ Actions — enrich data with detection results ∘…Sep 24, 2022Sep 24, 2022
Published inAWS TipAWS Glue DataBrew — Overview· Overview · Components of AWS Glue DataBrew ∘ Dataset ∘ Data Profile ∘ Projects ∘ Recipes ∘ Jobs · ConclusionSep 6, 2022Sep 6, 2022
Published inAWS TipUse Amazon Athena Federated Query to query data from Aurora PostgreSQL running in Private SubnetAmazon Athena is an interactive query service that enables users to analyze data in Amazon S3 using standard SQL. In most cases, Amazon…Mar 31, 20221Mar 31, 20221
Published inAWS TipUse Amazon MSK Connect with Lenses plugin to sink data from Amazon MSK to Amazon S3Apache Kafka is an open-source distributed event streaming platform consisting of servers and clients communicating via high performance…Feb 18, 20221Feb 18, 20221
Published inAWS TipUse AWS Glue Job Bookmark feature with Aurora PostgreSQL Database· Overview · Load data into Aurora PostgreSQL using Glue Catalog table · Job Bookmark for JDBC source · Create AWS Glue Job with Job…Jan 23, 20221Jan 23, 20221