Amazon DynamoDB

Amazon DynamoDB is a fully managed, serverless NoSQL key-value and document database. It delivers single-digit millisecond latency at any scale, with built-in horizontal sharding, multi-AZ replication, and optional global replication — and no servers, patches, or storage capacity to manage.

Key Features:

Serverless & Auto-Scaling: Two capacity modes — on-demand (scales instantly with traffic, pay per request) and provisioned (reserve RCUs/WCUs with optional auto-scaling).
Single-Digit Millisecond Latency: Partitioned by hash key with SSD-backed storage; DAX provides microsecond read latency via an in-memory cache.
Primary Keys, GSIs, LSIs: Partition key (+ optional sort key) defines the primary access pattern; Global Secondary Indexes enable alternative query patterns.
DynamoDB Streams: Change-data-capture stream of item-level modifications, consumed by Lambda or Kinesis for event-driven architectures.
Global Tables: Multi-region, active-active replication with last-writer-wins conflict resolution.
Transactions: ACID transactions across up to 100 items or 4 MB for multi-item operations that must succeed or fail together.
TTL: Automatic expiry of items based on a timestamp attribute — useful for sessions, caches, and compliance retention.
PITR & On-Demand Backup: Point-in-time recovery for the last 35 days plus on-demand snapshots retained indefinitely.
Zero-ETL to OpenSearch & Redshift: Managed continuous replication for search and analytics use cases.

Common Use Cases:

High-Scale Web & Mobile Backends: Session stores, user profiles, shopping carts, leaderboards.
IoT Telemetry: Millions of small writes per second with time-partitioned sort keys.
Serverless Architectures: Natural backing store for Lambda + API Gateway applications.
Gaming: Player state, matchmaking, and leaderboards with low latency globally.
Event Sourcing: Append-only event logs consumed via Streams.

Data Modeling Principles:

Design around access patterns first — unlike relational DBs, schema follows queries.
Favor single-table design with composite keys and GSIs for item-collection queries.
Keep partition keys high-cardinality to avoid hot partitions.
Prefer batch (BatchGetItem, BatchWriteItem) and Query over Scan.

Service Limits & Quotas:

Item size: 400 KB max (hard).
Partition key value: 2,048 bytes max.
Sort key value: 1,024 bytes max.
Tables per region per account: default soft limit 2,500.
GSIs per table: 20.
LSIs per table: 5 (must be defined at table creation).
Per-partition throughput: 3,000 RCU and 1,000 WCU; exceeding causes throttling unless adaptive capacity rebalances.
Transaction: up to 100 items or 4 MB across all items.
Query result: 1 MB per page (use LastEvaluatedKey to paginate).
PITR window: last 35 days.

Pricing Model:

On-demand: ~$1.25 per million write request units, ~$0.25 per million read request units (eventually consistent reads cost half).
Provisioned: ~$0.00065 per WCU-hour, ~$0.00013 per RCU-hour; cheaper at predictable steady traffic, especially with reserved capacity.
Storage: $0.25/GB-month (Standard); $0.10/GB-month (Standard-IA for cold tables).
Streams: first 2.5 million reads/month free per shard, then per request.
Global Tables: additional WCU charges in each replica region (replicated writes consume rWCUs).
Backup: on-demand $0.10/GB-month; PITR $0.20/GB-month.
DAX: per node-hour by instance class.
Common cost surprise: on-demand mode on a steady high-traffic workload — provisioned with auto-scaling and reserved capacity is often 3-5x cheaper. Also: GSIs double write costs (each item write hits the base table and every GSI it appears in).

Code Example — PutItem and Query:


import boto3
from boto3.dynamodb.conditions import Key

table = boto3.resource("dynamodb", region_name="us-west-2").Table("Orders")

table.put_item(Item={
    "pk": "CUSTOMER#1042",
    "sk": "ORDER#2026-04-21#A-482",
    "total": 129.95,
    "status": "SHIPPED",
})

resp = table.query(
    KeyConditionExpression=Key("pk").eq("CUSTOMER#1042") & Key("sk").begins_with("ORDER#2026-04"),
)
for item in resp["Items"]:
    print(item["sk"], item["status"], item["total"])

Transactional Write Across Two Items:


client = boto3.client("dynamodb", region_name="us-west-2")

client.transact_write_items(TransactItems=[
    {"Update": {
        "TableName": "Accounts",
        "Key": {"pk": {"S": "ACCT#A"}},
        "UpdateExpression": "SET balance = balance - :amt",
        "ConditionExpression": "balance >= :amt",
        "ExpressionAttributeValues": {":amt": {"N": "100"}},
    }},
    {"Update": {
        "TableName": "Accounts",
        "Key": {"pk": {"S": "ACCT#B"}},
        "UpdateExpression": "SET balance = balance + :amt",
        "ExpressionAttributeValues": {":amt": {"N": "100"}},
    }},
])

Common Interview Questions:

What's the difference between a GSI and an LSI?

LSI (Local Secondary Index) shares the partition key with the base table but has a different sort key, must be defined at table creation, and shares throughput with the base table — limited to 10 GB per partition key. GSI (Global Secondary Index) has independent partition and sort keys, can be added or dropped anytime, has its own throughput, and is eventually consistent. Use GSIs almost always.

How do you avoid hot partitions?

Choose high-cardinality partition keys (user ID, not country code), avoid time-only keys for writes (shard with random suffix or hash prefix), and rely on adaptive capacity to handle short bursts. For genuinely uneven access patterns, write-shard the key (USER#123#0..15) and fan-out reads.

When use single-table design?

When you have known, bounded access patterns and want one network round-trip per query. Multiple entity types share one table with composite keys (pk: USER#123, sk: ORDER#...) so a single Query returns related items. Adds modeling complexity; not worth it for simple CRUD.

On-demand vs. provisioned capacity?

On-demand: zero capacity planning, instant scaling, pay per request — best for unpredictable, spiky, or new workloads. Provisioned: cheaper at steady traffic, supports reserved capacity discounts, requires auto-scaling configuration. Switch from on-demand to provisioned once traffic is predictable.

What is DynamoDB Streams used for?

Ordered change log per partition of all item writes (24-hour retention). Consumed by Lambda for event-driven processing — denormalize into search indexes, fan out to other services, or trigger workflows. Each Stream record contains old and new images of the item.

How do Global Tables resolve conflicts?

Last-writer-wins based on the latest write timestamp at the originating region. There's no application-level conflict resolution hook. Design partition keys so concurrent writes to the same item from multiple regions are rare (e.g., region-affinitized customers).

When to add DAX?

For read-heavy workloads where p99 latency matters more than cost (microseconds vs. single-digit milliseconds), and where the working set fits in DAX cluster memory. DAX is a write-through cache; writes go to DynamoDB first, then update the cache. Doesn't help write-heavy workloads.

DynamoDB is the default choice on AWS when you need predictable performance at any scale and your access patterns can be modeled as key-value or document lookups.