AWS Launches Quick Start for Building Data Lakes in Its Cloud
Amazon Web Services (AWS) has made it easier for users to organize and analyze large amounts of native data using its infrastructure.
Announced earlier this month, the new "Data Lake Foundation on AWS" Quick Start deployment solution expedites the process of setting up a data lake foundation on AWS that integrates with many of its data storage, streaming and analytics services.
Quick Starts are deployment templates co-developed with AWS partners that are designed to automate and streamline the process of setting up, configuring and launching new solutions on AWS. This particular Quick Start was developed with AWS Premier Consulting Partner 47Lining and lets users build an AWS data lake environment in about 50 minutes, according to this .PDF reference guide.
The data lake foundations created by this Quick Start work with multiple AWS services to stream, organize and manage data. For instance, it uses the Amazon Simple Storage Service (S3) and Amazon Kinesis Firehouse for batch and streaming submissions, respectively.
Ingest processing is done through integration with AWS Lambda, Amazon Elasticsearch Service (ES), Kinesis Analytics and Amazon Simple Notification Service (SNS).
Users can manage their datasets using Amazon Redshift and Kinesis Analytics, and perform transformations and analyses using Redshift and Amazon Athena. The Quick Start also includes integration with the open source Kibana tool for searches.
Finally, integration with Amazon QuickSight lets users publish data to their S3 buckets.
Users can also bolt on third-party tools to their data lakes, if they choose.
"The deployment also includes an optional wizard and a sample dataset that is loaded into the Amazon Redshift cluster and Kinesis streams," AWS said in its announcement. "The data lake wizard uses the dataset to demonstrate data lake capabilities such as search, transforms, queries, analytics, and visualization."
More information on this Quick Sight is available here.
About the Author
Gladys Rama (@GladysRama3) is the editor of Redmondmag.com, RCPmag.com and AWSInsider.net, and the editorial director of Converge360.