Data Check

Data Check

  1. Go to AWS Management Console
  • Find S3
  • Select S3

Verify Ingested Data

  1. In the S3 interface
  • Select Buckets
  • Select asg-datalake-demo-bucket bucket

Verify Ingested Data

  1. We will create a folder for Athena
  • Select Create folder

Verify Ingested Data

  1. In the Create folder interface
  • Folder name, enter Athena
  • Select Create folder

Verify Ingested Data

  1. Successfully created folder

Verify Ingested Data

  1. Go to AWS Management Console
  • Find Athena
  • Select Athena

Verify Ingested Data

  1. In the Athena interface
  • Select View settings to set the path to store query results

Verify Ingested Data

  1. In the Amazon Athena interface
  • Select Settings
  • Select Manage

Verify Ingested Data

  1. Select the path to the newly created Athena folder, then select Choose

Verify Ingested Data

  1. Return to Manage settings interface, check again and select Save

Verify Ingested Data

  1. We use Amazon Athena to query data
  • Data Source, select AwsDataCatalog
  • Database, select summitdb
  • Select raw2022 table
  • Select Preview Table

Verify Ingested Data

  1. Make a query of 10 rows of data from the raw2022 table in the database summitdb
  • Query successful
  • Test data

Verify Ingested Data

  1. Continue checking the data. In the query editor pane, copy and paste the query below
SELECT activity_type,
         count(activity_type)
FROM raw2022
GROUP BY activity_type
ORDER BY activity_type
  • Select Run Query
  • Check the result

Verify Ingested Data