AWS API Gateway - The API Service That Actually Works

What is AWS API Gateway

API Gateway is the thing AWS puts between your users and your actual services. It's basically a bouncer that checks IDs and deals with rate limiting so you don't have to write that tedious shit yourself. Works great until you get your first $3K AWS bill.

AWS API Gateway Architecture

You get three flavors: REST APIs with all the bells and whistles, HTTP APIs that are faster and cheaper but with fewer features, and WebSocket APIs for when you need real-time communication. Pick your poison based on what you actually need, not what sounds impressive.

How It Actually Works

API Gateway runs on AWS's edge locations (CloudFront's network) when you pick "edge-optimized," or stays in one region if you choose "regional." Edge-optimized is marketing bullshit for "double your CloudFront bill" - learned this when our API Gateway costs were $200/month but CloudFront hit $800. Those execute-api URLs look like https://abc123.execute-api.us-east-1.amazonaws.com/prod and you can't change the ugly bastards without custom domains.

AWS API Gateway Request Flow

It plays nice with AWS Lambda (probably what you'll use most), Amazon Cognito for user management, and IAM for permissions. CORS configuration will make you question your life choices - the error messages are complete garbage like "CORS policy: Cross origin requests are only supported for protocol schemes: http, data, chrome, chrome-extension, https." Deployment stages confuse the shit out of everyone: "prod" and "production" are different things to API Gateway because fuck consistency. CloudWatch handles monitoring, assuming you want to pay $0.50/GB for detailed logs.

Performance Reality Check

API Gateway scales automatically, but your backend better keep up. Default limits are 10,000 requests per second per account - sounds generous until you do the math: that's $35K/month on REST APIs if you max it out. You can request limit increases through AWS Support, but expect a 20-question Spanish Inquisition about your traffic patterns.

Since the 2019 HTTP API launch, nobody should use REST APIs unless they need the features. HTTP APIs are actually 71% cheaper than REST APIs and faster too, but you lose request validation and caching. Lambda cold starts can add 500ms+ to your first request - especially brutal for those "just checking if the API is up" health checks. Edge-optimized might help global latency but adds complexity and CloudFront costs.

Who Actually Uses This Thing

TiVo uses API Gateway for streaming because it auto-scales during peak viewing times (think Sunday night HBO premieres) without them having to provision servers. WirelessCar needed sub-100ms response times for connected cars - edge-optimized endpoints solved their latency problem.

Most people use it for microservices APIs, mobile app backends, or modernizing legacy systems. It's particularly good when you need something up fast and don't want to manage load balancers, SSL certificates, and all that infrastructure nonsense. Just remember: it's another layer that can break, so plan accordingly.

API Gateway Types - Choose Your Poison

Feature	REST API	HTTP API	WebSocket API
Pricing	$3.50 per million requests 💸	$0.90-$1.00 per million requests 💰	$1.00 per million messages
Use Case	Feature-rich APIs (if you need the features)	Fast, cheap APIs (if you can live without validation)	Real-time stuff (chat, gaming)
Authentication	IAM, Cognito, API Keys, Lambda authorizers	IAM, Cognito, JWT, Lambda authorizers	IAM, Lambda authorizers
Caching	✅ Built-in ($0.02/GB/hour though)	❌ No caching (handle it yourself)	❌ No caching
Request Validation	✅ Built-in validation	❌ Validate in your Lambda instead	❌ No validation
Custom Domain	✅ Works (ACM certificate required)	✅ Works (ACM certificate required)	✅ Works (ACM certificate required)
AWS WAF Integration	✅ Protection against attacks	❌ You're on your own	❌ You're on your own
Private Endpoints	✅ VPC endpoints (complex setup)	❌ Internet-only	❌ Internet-only
API Keys & Usage Plans	✅ Built-in monetization	❌ Roll your own	❌ Roll your own
Request/Response Transformation	✅ Full VTL mapping templates	✅ Basic parameter mapping only	✅ Route-based transformation
Performance	Standard latency (cold starts hurt)	Up to 71% faster (still cold starts)	Persistent connection (connection limits)
Monitoring	CloudWatch + X-Ray (extra cost)	CloudWatch only	CloudWatch only
Best For	Enterprise APIs that need every feature	Simple proxy APIs, microservices	Chat apps, live dashboards, multiplayer games

Questions Engineers Actually Ask at 3AM

Why does my API randomly return 502 errors?

Usually Lambda cold starts, backend timeouts, or your Lambda function shitting itself.

Check CloudWatch logs first

look for "Task timed out after 30.00 seconds" or "RequestId: abc123 Process exited before completing request" or my personal favorite "Runtime exited with error: exit status 1".

Your backend needs to respond within 30 seconds or API Gateway gives up and returns that useless 502. If it's cold starts, consider provisioned concurrency (costs $41.67/month per GB but beats explaining to your CEO why the API is down).

Why is my AWS bill so high when I'm only making a few API calls?

REST APIs cost $3.50 per million requests, HTTP APIs cost $0.90-$1.00 per million. Sounds cheap until you scale

10M requests/month is $35 on REST APIs. Plus data transfer costs (first GB free, then $0.09/GB). If you enabled caching, that's $0.02/GB/hour whether you use it or not. Edge-optimized? CloudFront costs might exceed your API Gateway costs.

How do I fix CORS errors that make no sense?

CORS preflight requests fail silently and you'll spend 2 hours debugging why OPTIONS returns 200 but your POST still gets blocked.

CORS errors in API Gateway are designed by sadists

the browser shows "Access to fetch at 'https://api.example.com' from origin 'https://example.com' has been blocked by CORS policy" but gives no hint what's actually wrong.

Make sure you enabled CORS on your resource AND methods (both boxes checked in the console), set Access-Control-Allow-Origin to your domain (not * if you're using credentials), and include Access-Control-Allow-Credentials: true if needed.

For preflight requests, you need to handle OPTIONS method separately. VPC cold starts can take 15+ seconds if your ENIs aren't warm

learned this during a live demo that went to shit. This GitHub issue thread has more useful info than AWS's entire CORS documentation.

Can I connect API Gateway to my private services?

Yes, but it's complicated. VPC Links let you connect to services in your VPC, but they require a Network Load Balancer (extra cost and complexity). For REST APIs only

HTTP APIs can't do private integrations. HTTP integrations work for any public endpoint though.

Why does my API return "Missing Authentication Token" when I think auth is working?

That "Missing Authentication Token" error is AWS-speak for "your URL is probably wrong, genius." This cryptic bullshit error usually means:

Your URL is wrong (/prod/users vs /users - yes, the stage matters)
The HTTP method isn't deployed (GET /users works but POST /users returns 403)
You forgot to deploy after making changes (classic mistake)
Custom domain base path mapping is fucked

Check that you're hitting the right stage URL - /prod, /dev, /staging, whatever. If you're using custom domains, make sure the base path mapping actually points to something. That 29-second timeout is actually 30 seconds, but Lambda reserves 1 second for cleanup because reasons. IAM, Cognito, and Lambda authorizers work fine when configured correctly (emphasis on "correctly").

How do I debug why my API is slow?

Cloud

Watch logs take 5-10 minutes to show up, so grab coffee while you wait for your debug info.

Enable X-Ray tracing (REST APIs only) to see where time is spent.

Lambda cold starts are usually the culprit

first request after idle can take 500ms+. API Gateway throttling is per-account, so one bad API can kill your others. CloudWatch metrics show latency breakdown. Check your backend response time vs. API Gateway latency to isolate the bottleneck.

What's the maximum request size and other gotchas?

6MB payload limit will bite you if you're passing large datasets. 30-second timeout for Lambda integrations (use async if you need longer).

Throttling limits are per account

one bad API can break your others. 10,000 requests/second default sounds generous until you're paying $35K/month for it.

Should I use API Gateway or just put ALB in front of my containers?

Depends. API Gateway gives you auth, rate limiting, and AWS integrations built-in. ALB is cheaper for high-volume traffic but you build auth/throttling yourself. If you're already on containers and don't need API Gateway's features, ALB + Kong/Envoy might be better. For serverless (Lambda), API Gateway makes more sense.

Can I use custom domains without the ugly AWS URLs?

Yes, custom domains work with ACM certificates. You'll need to create the domain, get the CloudFront distribution ID (edge-optimized) or ALB DNS (regional), and update your DNS. Takes about 20 minutes if everything goes smoothly.

What happens when API Gateway is down?

99.95% SLA sounds good until you realize that's 22 minutes of downtime per month. It runs across multiple AZs but regional outages happen. Edge-optimized APIs use CloudFront's global network for better resilience. Keep circuit breakers and timeouts in your clients.

Real-World Usage (What Actually Works)

Serverless APIs That Don't Suck

API Gateway + Lambda is probably what you'll use. It's the easiest way to build APIs without managing servers, until you discover Lambda cold starts during your first production demo. Cold starts are your biggest enemy - we've seen 2-second delays on first requests after idle periods, and 8+ seconds for Java Spring Boot apps. Provisioned concurrency starts at $0.015 per GB-second ($10.80/month for 1GB function running 24/7) and adds up fast, but fixes the problem if your budget can handle it.

Sync requests work for web APIs, async for long-running stuff. That $0.02/GB/hour caching fee adds up fast - we hit $400/month caching data we barely used. Response caching beats hitting your Lambda repeatedly, but cache invalidation is manual and takes 5-10 minutes to propagate.

Microservices Frontend (Single Point of Failure)

API Gateway works as a front door for microservices, but remember it's another layer that can break. It handles auth, rate limiting, and routing so your services don't have to. Just be aware that now everything depends on API Gateway being up.

You can integrate directly with DynamoDB, S3, and SNS without Lambda functions - saves latency and cost but adds complexity. Great for simple CRUD operations, terrible for anything requiring logic.

Security That Doesn't Get in Your Way

AWS WAF integration (REST APIs only) blocks common attacks like SQL injection. Worth it if you're getting hit by bots. Private APIs stay inside your VPC - great for internal services that shouldn't touch the internet.

Cognito handles OAuth/SAML if you need enterprise SSO. Resource policies let you restrict by IP or VPC. The security is solid, but each feature adds complexity - only enable what you actually need.

Making It Actually Fast

Caching works great if your data doesn't change often. Costs $0.02/GB/hour though, and cache misses still hit your backend. TTL settings are crucial - too short and you're wasting money, too long and you serve stale data.

Regional endpoints are faster and simpler. Edge-optimized sounds fancy but adds CloudFront costs and complexity. Pick regional unless your users are actually global. Lambda cold starts are your biggest enemy - keep functions warm with periodic pings if budget allows.

The Bill Reality Check

HTTP APIs are 71% cheaper than REST APIs - $1/million vs $3.50/million requests. Sounds great until you hit real scale: 100M requests/month is $100 (HTTP) vs $350 (REST). Add data transfer costs, caching fees, and CloudFront charges for edge-optimized APIs.

Usage plans are great until you realize customers will hit your rate limits in creative ways you didn't expect - like making 1000 requests in the first second of each minute to game your per-minute limits. Usage plans let you charge customers for API access and throttle them when they go over. CloudWatch metrics show where your money goes - sort by Count metric to find your most expensive APIs. Set billing alarms at $100, $500, and $1000 - we learned this the hard way when a traffic spike from a Reddit post mentioning our API cost us $2K overnight (mostly Lambda invocations, not API Gateway charges).

Quick Navigation

How It Actually Works

Performance Reality Check

Who Actually Uses This Thing

Why does my API randomly return 502 errors?

Why is my AWS bill so high when I'm only making a few API calls?

How do I fix CORS errors that make no sense?

Can I connect API Gateway to my private services?

Why does my API return "Missing Authentication Token" when I think auth is working?

How do I debug why my API is slow?

What's the maximum request size and other gotchas?

Should I use API Gateway or just put ALB in front of my containers?

Can I use custom domains without the ugly AWS URLs?

What happens when API Gateway is down?

Serverless APIs That Don't Suck

Microservices Frontend (Single Point of Failure)

Security That Doesn't Get in Your Way

Making It Actually Fast

The Bill Reality Check

Related Tools & Recommendations

AWS Lambda Overview: Run Code Without Servers - Pros & Cons

Amazon SageMaker: AWS ML Platform Overview & Features Guide

Amazon DynamoDB - AWS NoSQL Database That Actually Scales

AWS Lambda DynamoDB: Serverless Data Processing in Production

AWS API Gateway Security Hardening: Protect Your APIs in Production

Amazon EC2 Overview: Elastic Cloud Compute Explained

KrakenD Production Troubleshooting - Fix the 3AM Problems

Kong Gateway: Cloud-Native API Gateway Overview & Features

AWS Database Migration Service: Real-World Migrations & Costs

KrakenD API Gateway: Fast, Open Source API Management Overview

Firebase - Google's Backend Service for Serverless Development

Neon Production Troubleshooting Guide: Fix Database Errors

Amazon Q Business vs. Developer: AWS AI Comparison & Pricing Guide

MuleSoft Review - Is It Worth the Insane Price Tag?

Terraform Alternatives That Don't Suck to Migrate To

Infrastructure as Code Pricing Reality Check: Terraform vs Pulumi vs CloudFormation

Terraform - Define Infrastructure in Code Instead of Clicking Through AWS Console for 3 Hours

Python 3.13 - You Can Finally Disable the GIL (But Probably Shouldn't)

Vercel Overview: Deploy Next.js Apps & Get Started Fast

Neon Serverless PostgreSQL: An Honest Review & Production Insights