Best practices for implementing a data lake in Amazon S3 (Level 200)

August 7, 2020
"Flexibility is key when building and scaling data lakes, and by choosing the right storage architecture, you can have the agility necessary to quickly experiment and migrate with the latest analytics solutions. In this session, we explore the best practices for building a data lake on Amazon S3, which allow you to leverage an entire array of AWS, open-source, and third-party analytics tools, helping you remain at the cutting edge. We explore use cases for analytics tools, including Amazon EMR and AWS Glue, and query-in-place tools like Amazon Athena, Amazon Redshift Spectrum, Amazon S3 Select, and Amazon S3 Glacier Select. Speaker: Kumar Nachiketa, Senior Partner Solutions Architect, Amazon Web Services"
Previous Video
Streaming and real-time analytics (Level 300)
Streaming and real-time analytics (Level 300)

"Learn how streaming technologies can help analyze data in real time, move data between systems in real tim...

Next Video
Turn data into insights (Level 200)
Turn data into insights (Level 200)

"Data is incredibly valuable, but extracting it is getting harder as the volume, variety, and velocity of d...