AWS Certified Big Data - Specialty (#60)

An organization would like to run analytics on their Elastic Load Balancing logs stored in Amazon S3 and join this data with other tables in Amazon S3. The users are currently using a BI tool connecting with JDBC and would like to keep using this BI tool. Which solution would result in the LEAST operational overhead?

Trigger a Lambda function when a new log file is added to the bucket to transform and load it into Amazon Redshift. Run the
VACUUM
command on the Amazon Redshift cluster every night.
Launch a long-running Amazon EMR cluster that continuously downloads and transforms new files from Amazon S3 into its HDFS storage. Use Presto to expose the data through JDBC.
Trigger a Lambda function when a new log file is added to the bucket to transform and move it to another bucket with an optimized data structure. Use Amazon Athena to query the optimized bucket.
Launch a transient Amazon EMR cluster every night that transforms new log files and loads them into Amazon Redshift.