Analysis with Athena

Analyze with Athena

Because Athena uses the AWS Glue Catalog to keep track of data sources, all tables in Glue will be queryable by Athena.

  1. Go to AWS Management Console
  • Find Athena
  • Select Athena

Analyze Data with Athena

  1. In the Athena interface
  • Data Source, select AwsDataCatalog
  • Database, select summitdb We execute the following SQL query:
SELECT artist_name,
         count(artist_name) AS count
FROM processed_data
GROUP BY artist_name
ORDER BY count desc
  • Run Run Query
  • After running Complete
  • View query results

Analyze Data with Athena

  1. Try to explore Athena’s interface and do some more queries.
  • The query below will return the list of tracks that are played repeatedly by the device, we will also use the query below to create a chart through QuickSight.
SELECT device_id,
         track_name,
         count(track_name) AS count
FROM processed_data
GROUP BY device_id, track_name
ORDER BY count desc

Analyze Data with Athena