Skip to content
Complete Guide

System Design & Distributed Systems: The Complete Guide

Consensus algorithms, caching strategies, database internals, networking protocols, and architecture patterns for large-scale systems.

17 Articles
148 min total read time
1

Networking & Protocols

How data moves across the wire — from TCP/UDP to HTTP evolution and DNS.

2

Data Infrastructure

Database internals, indexing, caching, and migrations at scale.

3

Distributed Systems Patterns

Consensus, rate limiting, messaging, idempotency, and microservice architecture.

10
7 min readHard

Understanding Raft Consensus: The Algorithm That Keeps Your Database Honest

How Raft works in production: leader election, log replication, safety guarantees, and failure modes in etcd and CockroachDB.

11
9 min readMedium

Consistent Hashing: The Algorithm Behind Every Scalable Distributed System

Adding one cache server shouldn't invalidate every key. Consistent hashing with virtual nodes and bounded loads — full Go and Java implementations.

12
7 min readMedium

Rate Limiter Algorithms: Token Bucket vs Sliding Window

Five rate limiting algorithms, their trade-offs, how to distribute them across a fleet, and client-side backoff that works.

13
6 min readHard

Distributed Rate Limiting at Scale: The Probabilistic Drop Architecture

Probabilistic drop rate limiting: uncoordinated enforcement bypassing Redis for 1M+ RPS with zero coordination overhead.

14
9 min readMedium

Idempotency Patterns: Building Retry-Safe Distributed Systems

Why exactly-once is a myth, and how idempotency keys, database constraints, and the outbox pattern make retries safe in Go and Java.

15
10 min readHard

Kafka vs RabbitMQ vs NATS vs SQS: Choosing the Right Message Broker

Kafka vs RabbitMQ vs NATS vs SQS: delivery semantics, ordering, throughput, ops complexity, and a decision framework with Go code.

16
13 min readHard

Microservices Architecture: From Monolith to Production-Ready Services

When to decompose a monolith, how to define boundaries, and the patterns that work: API gateways, sagas, and event-driven comms.

17
9 min readHard

REST vs gRPC vs GraphQL: A Production Decision Guide

How a team serving mobile, microservices, and third-party integrations ended up running REST, gRPC, and GraphQL together.