Skip to main content

Designing Memcached or an in-memory KV store

Requirements

High-performance, distributed key-value store

Why distributed?
- Answer: to hold a larger size of data

For in-memory storage of small data objects
Simple server (pushing complexity to the client) and hence reliable and easy to deploy

Architecture

Big Picture: Client-server

client
given a list of Memcached servers
chooses a server based on the key
server
store KVs into the internal hash table
LRU eviction

The Key-value server consists of a fixed-size hash table + single-threaded handler + coarse locking

hash table

How to handle collisions? Mostly three ways to resolve:

Separate chaining: the collided bucket chains a list of entries with the same index, and you can always append the newly collided key-value pair to the list.
open addressing: if there is a collision, go to the next index until finding an available bucket.
dynamic resizing: resize the hash table and allocate more spaces; hence, collisions will happen less frequently.

How does the client determine which server to query?

See Data Partition and Routing

How to use cache?

See Key value cache

How to further optimize?

See How Facebook Scale its Social Graph Store? TAO

References:

Want to keep learning more?

Twitter LinkedIn Telegram Discord 小红书

Requirements
Architecture
How does the client determine which server to query?
How to use cache?
How to further optimize?

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.