System design learning note 1: Design a system that scales to millions of users on AWS

3 min readDec 24, 2021

This is a learning note from this link.

EC2 + MySQL DB

Vertical scaling

Basic monitoring: CPU, MEM, IO, NETWORK

EC2 with public static IP (AWS Elastic IP)

DNS (Route 53) to map the domain to the public ip

Security

Allow incoming requests from:

DNS -> Web Server -> MySQL + Object Store

Add Object Store (e.g. S3) to store static contents

DNS, CDN, Load Balancer -> Web servers, Application Servers -> MySQL (master — slave), Object Store

Horizontal scaling

Load Balancer

Application Servers separate from Web Servers

Add CDN such as CloudFront

DNS, CDN, Load Balancer -> Web servers, Application Servers -> MySQL (master — slave), MySQL Read replicas, Memory Cache, Object Store

First configure MySQL DB cache to see if it’s sufficient, if not use memory cache to store:

Add read replicas for mysql to reduce load on write master

More server instances

DNS, CDN, Load Balancer -> Web servers, Application Servers -> MySQL (master — slave), MySQL Read replicas, Memory Cache, Object Store

Add auto-scaling

AWS AutoScaling
a group per app server type/web server type, place each group in multiple AZs
set up min/max number of instances
scale up/down through cw, using metrics like cpu, latency, network traffic, custom metric

Automate DevOps

Monitor metrics

DNS, CDN, Load Balancer -> Web servers, Application Servers -> MySQL (master — slave), MySQL Read replicas, Memory Cache, Object Store, NoSQL

Consider using data warehouse to store long-lived data if db is too large.

Scale memory cache if we reach 40k reads/s

Think about other scaling patterns for DBs

Some data can be moved to NoSQL DB such as DynamoDb

Some processes that do not need to be done in real-time, we can do it asynchronously with queues and workers

Written by Qi Hu