amazon-athenaathena-pricing

Athena Pricing Calculator - SQL Queries, Provisioned Capacity & Spark

Calculate Amazon Athena costs for SQL queries, provisioned capacity, and Apache Spark. Estimate monthly expenses based on data scanned or DPU hours with real-time pricing.

Frequently Asked Questions

How does Amazon Athena pricing work?

Amazon Athena pricing is based on two models: On-demand charges $5 per TB of data scanned by your SQL queries, or provisioned capacity where you pay for DPU-hours (Data Processing Units). Queries on partitioned data and compressed formats like Parquet reduce costs by scanning less data. DDL statements and failed queries are not charged.

How can I reduce my Athena query costs?

Reduce Athena costs by: (1) Partitioning tables to scan only relevant data, (2) Using columnar formats like Parquet or ORC with compression, (3) Limiting columns in SELECT statements instead of SELECT *, (4) Using provisioned capacity for predictable workloads, and (5) Implementing query result caching to avoid re-scanning data.

When should I use Athena provisioned capacity?

Use Athena provisioned capacity when you have predictable, high-volume query workloads that scan more than 100 TB per month. Provisioned capacity offers consistent performance with DPU-hour pricing, which can be more cost-effective than on-demand ($5/TB) for heavy users. For sporadic or unpredictable workloads, on-demand pricing is typically better.

How much do Athena for Apache Spark sessions cost?

Athena for Apache Spark charges based on DPU-hours consumed by your notebook sessions. A DPU (Data Processing Unit) provides 4 vCPUs and 16 GB of memory. Pricing varies by region, typically around $0.30-0.40 per DPU-hour. Spark applications automatically scale compute resources based on workload demands, and you only pay for actual usage time.

What counts as data scanned in Athena queries?

Athena charges based on the amount of data read from S3 to execute your query, measured in TB. Partitioning, columnar formats (Parquet/ORC), and compression significantly reduce data scanned. For example, a query on a 1 TB Parquet table might only scan 100 GB if properly partitioned and selecting specific columns, costing $0.50 instead of $5.

CloudBurn

Catch AWS cost mistakes before they ship.

Use the calculator for quick estimates, then use CloudBurn when you need a deterministic cost review workflow. Run scan against Terraform and CloudFormation before deploy, then run discover against live AWS to find the waste that is already burning.

Read the docs to learn how it works.