AWS Lambda is the world's most widely deployed serverless compute platform. It's the backbone of event-driven architectures, microservice APIs, data pipeline processing, and automation workflows across thousands of enterprise AWS deployments. Its pricing model — pay only for what you use, with no idle infrastructure cost — is genuinely advantageous for intermittent or unpredictable workloads.
The commercial complexity starts when Lambda becomes a foundational compute platform rather than an occasional helper function. Enterprises that have gone "serverless-first" with Lambda sometimes discover their monthly Lambda bill exceeds what equivalent EC2 Reserved Instance workloads would cost — and unlike EC2, they have no Reserved Instance or Savings Plan commitment to reduce it.
The core issue: Lambda is not included in Compute Savings Plans commitments by default in the way EC2 is. While AWS offers a Compute Savings Plan that does cover Lambda, many enterprise negotiators focus their EDP and commitment structures primarily on EC2 and RDS — leaving Lambda spend at full on-demand rates. At significant Lambda volumes, this oversight is costly.
AWS Lambda Pricing: The Full Cost Structure
Overpaying for AWS? We handle AWS EDP, Reserved Instance and savings plan negotiation on a 25% gainshare basis — you keep 75% of every dollar saved. No retainer. No risk.
Get a free AWS savings estimate →Lambda billing has four distinct components, each of which requires separate optimisation attention:
| Pricing Component | Price | Free Tier | Enterprise Relevance |
|---|---|---|---|
| Requests | $0.20 per 1M requests | 1M requests/month | High-volume APIs can generate billions of requests monthly — $200+ per billion |
| Duration (GB-seconds) | $0.0000166667 per GB-second | 400,000 GB-seconds/month | Primary cost driver — memory × execution time × invocations compounds rapidly |
| Provisioned Concurrency | $0.0000097 per GB-second allocated | None | Billed continuously whether invoked or not — easy to over-provision |
| Data Transfer | $0.09/GB out to internet (standard AWS rates) | 1GB/month out | Often overlooked in Lambda cost analysis — relevant for APIs with large response payloads |
In most enterprise Lambda deployments, duration billing (GB-seconds) accounts for 70–80% of the total Lambda cost. Requests account for most of the remainder. Data transfer and provisioned concurrency are significant for specific workload patterns.
Duration Billing: Where Enterprise Lambda Costs Really Live
Lambda duration billing charges for the amount of memory allocated to a function multiplied by the execution duration in milliseconds, rounded up to the nearest 1ms. This means:
- A 1GB-memory function running for 100ms costs 0.1 GB-seconds
- Running 1 billion such invocations per month = 100 million GB-seconds = $1,667/month in duration charges alone
- A 512MB-memory function running the same workload = $833/month — 50% savings just from right-sizing memory
The critical insight: Lambda memory allocation affects both duration cost and execution speed. AWS Lambda runs on proportional CPU — more memory means more vCPU allocated. A function allocated 1GB of memory gets twice the CPU of a 512MB function and may execute in half the time. The optimal memory setting for cost is not the minimum — it's the configuration where price × performance is minimised.
AWS's Lambda Power Tuning tool (an open-source AWS Step Functions application) automates this analysis. Running it across your top 20–30 Lambda functions by cost often identifies 15–30% duration savings from memory right-sizing alone, before any code changes.
💡 Graviton2 Lambda: 20% Cost Reduction, Easy Migration
AWS Lambda functions running on Graviton2 (arm64 architecture) are priced 20% lower than x86 Lambda for equivalent GB-seconds. For workloads written in Python, Node.js, Java, Go, or Ruby, migration to arm64 Lambda typically requires changing a single configuration parameter. This is the single highest-ROI optimisation available to most enterprise Lambda users and should be evaluated before any EDP negotiation.
Provisioned Concurrency: The Always-On Cost Trap
Lambda's default execution model scales from zero — the first request to a cold Lambda function incurs a "cold start" latency penalty of 100ms to several seconds depending on runtime and package size. For latency-sensitive applications (customer-facing APIs, real-time event processing), cold starts are unacceptable.
Provisioned Concurrency pre-warms Lambda function instances to eliminate cold starts — but it is billed continuously for all allocated concurrency, whether or not any invocations occur. At $0.0000097 per GB-second of allocated concurrency:
- 10 provisioned concurrency units × 1GB memory × 730 hours/month × 3,600 seconds/hour = $255/month
- 100 provisioned concurrency units × 2GB × same calculation = $5,103/month
- 500 provisioned concurrency units across a large microservice fleet: $25K+/month in provisioned concurrency alone
Common enterprise misconfiguration: provisioned concurrency is set for 24/7 at peak-load levels when the actual peak period is only 8–12 hours per day. Implementing Application Auto Scaling for provisioned concurrency — scaling it up before business hours, down overnight — reduces provisioned concurrency costs by 40–60% for typical B2B workloads without any latency impact during business hours.
AWS cost exceeding plan? We identify and negotiate the savings.
Our AWS cloud cost negotiation service covers the full AWS stack — Lambda, EC2, RDS, S3, data transfer, and EDP structure. We identify engineering optimisations and negotiate commercial discounts. We work on a 25% gainshare basis — no savings, no fee.
Get Your Free AWS Cost Assessment →Lambda vs. Alternative Compute: When Serverless Costs More
Lambda is not always the most cost-effective compute option at enterprise scale. Understanding when to shift workloads away from Lambda to containers or reserved EC2 is an important part of cloud cost optimisation strategy:
| Workload Pattern | Lambda Cost Efficiency | Better Alternative |
|---|---|---|
| Sporadic/event-driven (< 100K invocations/day) | Excellent — true pay-per-use | Lambda is the right choice |
| High-volume API (10M+ invocations/day) | Moderate — compare to ECS Fargate or EC2 t-series | ECS/Fargate with Compute Savings Plan |
| Long-running batch processing (>15 min) | Poor — Lambda has 15-minute max execution limit | AWS Batch, ECS, or Fargate Spot |
| Always-on background workers | Poor — provisioned concurrency cost exceeds EC2 | ECS Fargate or EC2 Reserved Instances |
| ML inference at low volume | Good — avoid idle GPU costs | Lambda with container images for larger models |
| ML inference at high volume | Poor — per-invocation cost exceeds dedicated endpoint | SageMaker Inference Endpoints with Reserved Capacity |
Including Lambda in Your AWS EDP Negotiation
The AWS Enterprise Discount Program (EDP) provides rate card discounts applied across all AWS services based on a committed annual spend. Lambda is included in the EDP rate card — but the way you structure your EDP commitment significantly affects how much discount Lambda receives.
Ensure Lambda Is Explicitly Included in Your EDP Scope
Some EDP agreements specify discounted service categories explicitly. Ensure your EDP contract includes Lambda in the discount scope — both for on-demand Lambda charges and provisioned concurrency. If Lambda is your fastest-growing compute cost, it needs explicit inclusion rather than assumption.
Use Lambda Spend as Leverage for a Larger EDP Commitment
If your Lambda spend is growing 50–100%+ year-over-year (common in organisations undergoing serverless-first migrations), use your projected Lambda growth to justify a larger overall EDP commitment. A larger commitment unlocks higher discount tiers across all services, and the Lambda growth trajectory supports the increased commitment size.
Negotiate Compute Savings Plan as an EDP Component
AWS's Compute Savings Plan covers Lambda in addition to EC2 and Fargate. Unlike Lambda On-Demand, Compute Savings Plans provide a discount (up to 17% on Lambda duration) in exchange for a spend commitment. For predictable Lambda workloads — production APIs, scheduled jobs — structuring a Compute Savings Plan commitment as part of your EDP negotiation optimises both the plan discount and the EDP overlay discount.
Challenge Lambda Data Transfer Charges
Lambda-to-API-Gateway and Lambda-to-services-within-the-same-region data transfer charges are often overlooked in EDP negotiations. At significant Lambda volumes, internal data transfer charges can add $5K–$50K+/month. In an EDP negotiation, push for data transfer waiver or reduced rates on intra-region Lambda traffic, particularly for high-volume event-driven architectures.
The 10-Point Lambda Cost Optimisation Checklist
Before negotiating AWS EDP pricing, implement these Lambda engineering optimisations to reduce your baseline cost — each point of waste you eliminate means more savings available through EDP:
- Run Lambda Power Tuning on your top functions to find optimal memory configurations
- Migrate Python, Node.js, Java, Go functions to arm64/Graviton2 for an immediate 20% duration cost reduction
- Implement Application Auto Scaling for provisioned concurrency to eliminate overnight and weekend idle costs
- Audit Lambda timeouts — functions timing out at 15 minutes burn full duration charges; reduce timeouts to match actual expected execution time
- Enable Lambda SnapStart for Java functions to reduce cold starts without provisioned concurrency
- Review Lambda concurrency limits — reserved concurrency that limits invocations below your actual needs causes throttling and retries, inflating request count
- Audit zombie functions — functions that haven't been invoked in 90+ days but are still deployed and potentially incurring provisioned concurrency charges
- Consolidate small functions — very small functions (sub-millisecond execution) are relatively expensive per unit of work due to per-request billing overhead
- Implement connection pooling for database-connected Lambda functions using RDS Proxy to reduce cold-start latency without provisioned concurrency
- Review data transfer patterns — reduce cross-AZ data transfer by co-locating Lambda functions with their downstream services
Lambda Costs in the Context of Your Total AWS Estate
Lambda is rarely the largest single line item in an enterprise AWS bill — EC2, RDS, S3, and data transfer typically dominate. But Lambda is often the fastest-growing cost category as organisations build more event-driven and API-centric architectures, and it's the most likely to be unoptimised because engineering teams don't apply the same cost-consciousness to serverless as they do to provisioned infrastructure.
A comprehensive AWS cost optimisation engagement covers Lambda alongside EC2 right-sizing, Reserved Instance and Savings Plan strategy, data transfer optimisation, and EDP structure. For most enterprise AWS customers, the combination of engineering optimisation and EDP renegotiation delivers 25–40% total cloud cost reduction. We work on a 25% gainshare model — you pay nothing unless we reduce your AWS bill.
See our related guides on AWS EDP negotiation strategy, Spot vs Reserved vs Savings Plans, AWS Graviton migration ROI, and our comprehensive AWS EDP Negotiation Handbook. For a broader cloud cost perspective, see our guide on enterprise FinOps strategy and our cloud cost negotiation service.
Further Reading
- AWS Pricing Calculator ↗
- AWS Cost Optimization Hub ↗
- Gartner Magic Quadrant for Cloud Infrastructure ↗
The NoSaveNoPay AWS Guarantee
We optimise your AWS estate and renegotiate your EDP. Our fee: 25% of verified annual savings. If we don't reduce your AWS bill, you pay nothing. That's not marketing — it's contractual. Average AWS client savings: 30%. See how it works.
Start Your Risk-Free AWS Engagement →