AWS Lambda Pricing: Enterprise Serverless Cost Strategy and Negotiation Guide

$0.20

Per 1 million Lambda requests — deceptively cheap at small scale, adds up fast at enterprise volume

$0.0000166

Per GB-second of Lambda duration — the primary cost driver for compute-intensive functions

30-40%

Typical Lambda cost reduction achievable through architecture optimisation before EDP negotiation

AWS Lambda is the world's most widely deployed serverless compute platform. It's the backbone of event-driven architectures, microservice APIs, data pipeline processing, and automation workflows across thousands of enterprise AWS deployments. Its pricing model — pay only for what you use, with no idle infrastructure cost — is genuinely advantageous for intermittent or unpredictable workloads.

The commercial complexity starts when Lambda becomes a foundational compute platform rather than an occasional helper function. Enterprises that have gone "serverless-first" with Lambda sometimes discover their monthly Lambda bill exceeds what equivalent EC2 Reserved Instance workloads would cost — and unlike EC2, they have no Reserved Instance or Savings Plan commitment to reduce it.

The core issue: Lambda is not included in Compute Savings Plans commitments by default in the way EC2 is. While AWS offers a Compute Savings Plan that does cover Lambda, many enterprise negotiators focus their EDP and commitment structures primarily on EC2 and RDS — leaving Lambda spend at full on-demand rates. At significant Lambda volumes, this oversight is costly.

AWS Lambda Pricing: The Full Cost Structure

No Save, No Pay

Overpaying for AWS? We handle AWS EDP, Reserved Instance and savings plan negotiation on a 25% gainshare basis — you keep 75% of every dollar saved. No retainer. No risk.

Get a free AWS savings estimate →

Lambda billing has four distinct components, each of which requires separate optimisation attention:

Pricing Component	Price	Free Tier	Enterprise Relevance
Requests	$0.20 per 1M requests	1M requests/month	High-volume APIs can generate billions of requests monthly — $200+ per billion
Duration (GB-seconds)	$0.0000166667 per GB-second	400,000 GB-seconds/month	Primary cost driver — memory × execution time × invocations compounds rapidly
Provisioned Concurrency	$0.0000097 per GB-second allocated	None	Billed continuously whether invoked or not — easy to over-provision
Data Transfer	$0.09/GB out to internet (standard AWS rates)	1GB/month out	Often overlooked in Lambda cost analysis — relevant for APIs with large response payloads

In most enterprise Lambda deployments, duration billing (GB-seconds) accounts for 70–80% of the total Lambda cost. Requests account for most of the remainder. Data transfer and provisioned concurrency are significant for specific workload patterns.

Duration Billing: Where Enterprise Lambda Costs Really Live

Lambda duration billing charges for the amount of memory allocated to a function multiplied by the execution duration in milliseconds, rounded up to the nearest 1ms. This means:

A 1GB-memory function running for 100ms costs 0.1 GB-seconds
Running 1 billion such invocations per month = 100 million GB-seconds = $1,667/month in duration charges alone
A 512MB-memory function running the same workload = $833/month — 50% savings just from right-sizing memory

The critical insight: Lambda memory allocation affects both duration cost and execution speed. AWS Lambda runs on proportional CPU — more memory means more vCPU allocated. A function allocated 1GB of memory gets twice the CPU of a 512MB function and may execute in half the time. The optimal memory setting for cost is not the minimum — it's the configuration where price × performance is minimised.

AWS's Lambda Power Tuning tool (an open-source AWS Step Functions application) automates this analysis. Running it across your top 20–30 Lambda functions by cost often identifies 15–30% duration savings from memory right-sizing alone, before any code changes.

💡 Graviton2 Lambda: 20% Cost Reduction, Easy Migration

AWS Lambda functions running on Graviton2 (arm64 architecture) are priced 20% lower than x86 Lambda for equivalent GB-seconds. For workloads written in Python, Node.js, Java, Go, or Ruby, migration to arm64 Lambda typically requires changing a single configuration parameter. This is the single highest-ROI optimisation available to most enterprise Lambda users and should be evaluated before any EDP negotiation.

Provisioned Concurrency: The Always-On Cost Trap

Lambda's default execution model scales from zero — the first request to a cold Lambda function incurs a "cold start" latency penalty of 100ms to several seconds depending on runtime and package size. For latency-sensitive applications (customer-facing APIs, real-time event processing), cold starts are unacceptable.

Provisioned Concurrency pre-warms Lambda function instances to eliminate cold starts — but it is billed continuously for all allocated concurrency, whether or not any invocations occur. At $0.0000097 per GB-second of allocated concurrency:

10 provisioned concurrency units × 1GB memory × 730 hours/month × 3,600 seconds/hour = $255/month
100 provisioned concurrency units × 2GB × same calculation = $5,103/month
500 provisioned concurrency units across a large microservice fleet: $25K+/month in provisioned concurrency alone

Common enterprise misconfiguration: provisioned concurrency is set for 24/7 at peak-load levels when the actual peak period is only 8–12 hours per day. Implementing Application Auto Scaling for provisioned concurrency — scaling it up before business hours, down overnight — reduces provisioned concurrency costs by 40–60% for typical B2B workloads without any latency impact during business hours.

AWS cost exceeding plan? We identify and negotiate the savings.

Our AWS cloud cost negotiation service covers the full AWS stack — Lambda, EC2, RDS, S3, data transfer, and EDP structure. We identify engineering optimisations and negotiate commercial discounts. We work on a 25% gainshare basis — no savings, no fee.

Get Your Free AWS Cost Assessment →

Lambda vs. Alternative Compute: When Serverless Costs More

Lambda is not always the most cost-effective compute option at enterprise scale. Understanding when to shift workloads away from Lambda to containers or reserved EC2 is an important part of cloud cost optimisation strategy:

Workload Pattern	Lambda Cost Efficiency	Better Alternative
Sporadic/event-driven (< 100K invocations/day)	Excellent — true pay-per-use	Lambda is the right choice
High-volume API (10M+ invocations/day)	Moderate — compare to ECS Fargate or EC2 t-series	ECS/Fargate with Compute Savings Plan
Long-running batch processing (>15 min)	Poor — Lambda has 15-minute max execution limit	AWS Batch, ECS, or Fargate Spot
Always-on background workers	Poor — provisioned concurrency cost exceeds EC2	ECS Fargate or EC2 Reserved Instances
ML inference at low volume	Good — avoid idle GPU costs	Lambda with container images for larger models
ML inference at high volume	Poor — per-invocation cost exceeds dedicated endpoint	SageMaker Inference Endpoints with Reserved Capacity

Including Lambda in Your AWS EDP Negotiation

The AWS Enterprise Discount Program (EDP) provides rate card discounts applied across all AWS services based on a committed annual spend. Lambda is included in the EDP rate card — but the way you structure your EDP commitment significantly affects how much discount Lambda receives.

Ensure Lambda Is Explicitly Included in Your EDP Scope

Some EDP agreements specify discounted service categories explicitly. Ensure your EDP contract includes Lambda in the discount scope — both for on-demand Lambda charges and provisioned concurrency. If Lambda is your fastest-growing compute cost, it needs explicit inclusion rather than assumption.

Use Lambda Spend as Leverage for a Larger EDP Commitment

If your Lambda spend is growing 50–100%+ year-over-year (common in organisations undergoing serverless-first migrations), use your projected Lambda growth to justify a larger overall EDP commitment. A larger commitment unlocks higher discount tiers across all services, and the Lambda growth trajectory supports the increased commitment size.

Negotiate Compute Savings Plan as an EDP Component

AWS's Compute Savings Plan covers Lambda in addition to EC2 and Fargate. Unlike Lambda On-Demand, Compute Savings Plans provide a discount (up to 17% on Lambda duration) in exchange for a spend commitment. For predictable Lambda workloads — production APIs, scheduled jobs — structuring a Compute Savings Plan commitment as part of your EDP negotiation optimises both the plan discount and the EDP overlay discount.

Challenge Lambda Data Transfer Charges

Lambda-to-API-Gateway and Lambda-to-services-within-the-same-region data transfer charges are often overlooked in EDP negotiations. At significant Lambda volumes, internal data transfer charges can add $5K–$50K+/month. In an EDP negotiation, push for data transfer waiver or reduced rates on intra-region Lambda traffic, particularly for high-volume event-driven architectures.

The 10-Point Lambda Cost Optimisation Checklist

Before negotiating AWS EDP pricing, implement these Lambda engineering optimisations to reduce your baseline cost — each point of waste you eliminate means more savings available through EDP:

Run Lambda Power Tuning on your top functions to find optimal memory configurations
Migrate Python, Node.js, Java, Go functions to arm64/Graviton2 for an immediate 20% duration cost reduction
Implement Application Auto Scaling for provisioned concurrency to eliminate overnight and weekend idle costs
Audit Lambda timeouts — functions timing out at 15 minutes burn full duration charges; reduce timeouts to match actual expected execution time
Enable Lambda SnapStart for Java functions to reduce cold starts without provisioned concurrency
Review Lambda concurrency limits — reserved concurrency that limits invocations below your actual needs causes throttling and retries, inflating request count
Audit zombie functions — functions that haven't been invoked in 90+ days but are still deployed and potentially incurring provisioned concurrency charges
Consolidate small functions — very small functions (sub-millisecond execution) are relatively expensive per unit of work due to per-request billing overhead
Implement connection pooling for database-connected Lambda functions using RDS Proxy to reduce cold-start latency without provisioned concurrency
Review data transfer patterns — reduce cross-AZ data transfer by co-locating Lambda functions with their downstream services

Lambda Costs in the Context of Your Total AWS Estate

Lambda is rarely the largest single line item in an enterprise AWS bill — EC2, RDS, S3, and data transfer typically dominate. But Lambda is often the fastest-growing cost category as organisations build more event-driven and API-centric architectures, and it's the most likely to be unoptimised because engineering teams don't apply the same cost-consciousness to serverless as they do to provisioned infrastructure.

A comprehensive AWS cost optimisation engagement covers Lambda alongside EC2 right-sizing, Reserved Instance and Savings Plan strategy, data transfer optimisation, and EDP structure. For most enterprise AWS customers, the combination of engineering optimisation and EDP renegotiation delivers 25–40% total cloud cost reduction. We work on a 25% gainshare model — you pay nothing unless we reduce your AWS bill.

See our related guides on AWS EDP negotiation strategy, Spot vs Reserved vs Savings Plans, AWS Graviton migration ROI, and our comprehensive AWS EDP Negotiation Handbook. For a broader cloud cost perspective, see our guide on enterprise FinOps strategy and our cloud cost negotiation service.