Amazon Web Services Makes AWS Glue Available to All Customers

7 years ago

Amazon Web Services (AWS), an Amazon.com company launched AWS Glue, a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data into Amazon Simple Storage Service (Amazon S3), Amazon Redshift, Amazon Relational Database Service (Amazon RDS), and databases running on Amazon Elastic Compute Cloud (Amazon EC2) for query and analysis. Customers can create and run an ETL job with a few clicks in the AWS Management Console. Customers simply point AWS Glue at their data stored on AWS, and AWS Glue discovers the associated metadata and classifies it, generates ETL scripts for data transformation, and loads the transformed data into a destination data store, provisioning the infrastructure needed to complete the job. With AWS Glue, data can be available for analysis in minutes, and because AWS Glue is server less, customers only pay for the compute resources they consume while executing data preparation and loading jobs.

After crawling a customer’s selected data sources, AWS Glue identifies data formats and schemas to build a unified Data Catalog that provides a central view of customers’ selected data. This makes it easy for customers to search and manage all of their data across various data stores without having to manually move it. When a customer identifies a data source and target from the Data Catalog, AWS Glue matches the schemas and generates data transformation code that is customizable, reusable, portable, and sharable. Developers can schedule any number of ETL jobs, and AWS Glue manages the rest – automatically spinning compute resources up or down depending on customer ETL workloads. By streamlining the process of creating ETL jobs, AWS Glue allows customers to build scalable and reliable data preparation platforms spanning thousands of jobs, with built-in dependency resolution, scheduling, resource management, and monitoring.

“We developed AWS Glue to eliminate much of the undifferentiated heavy lifting involved with ETL. By cataloguing all of a customer’s data and automating the ETL process, AWS Glue not only takes a lot of the hassle out of analytics. It also makes it possible for customers to store their data in as many sources as they want, and very quickly start analyzing all of it with whatever AWS service they choose” said Raju Gulabani, Vice President, Databases, Analytics, and AI, Amazon Web Services.