Member-only story
Spring Boot with Amazon Athena
How to integrate Spring Boot with your Data Lake. An SQL interface to your S3 data on Amazon.
data:image/s3,"s3://crabby-images/f6e6a/f6e6aeecf7b02aed8da999c008b1ec2dbf97cc77" alt=""
Introduction
We are going to look at interfacing a SpringBoot application with Amazon Athena. We will start out however by first learning what is Athena and how to work with it. Then we build a SpringBoot 3 application that queries Athena via an exposed Rest API.
Athena Introduced
Amazon Athena is a query service that allows you to run standard SQL over data stored in a variety of sources and formats, mainly used with S3 (see below). Athena is based on Presto, a distributed SQL engine that was open sourced by Facebook. Athena is serverless, so there is no infrastructure to setup or manage.
Athena is used to analyze the data which is present in Amazon S3. Athena can operate with various types of structured and unstructured data types which includes data formats like CSV, ORC, Apache Parquet and Apache Avro and JSON.
Amazon’s Simple Storage Service, or S3 for short, is easily the most recommended data store to use with Athena. S3 is a object storage service on AWS that provides low cost storage of any file format. S3 stores data as objects within buckets. An object is a file and any metadata that describes the file. A bucket is a container for objects.