Companion Youtube Channel
https://www.youtube.com/@CSAI-TTL
Please note that this website is a work in progress; only linked content is currently available for viewing.
Domain 1: Foundations and Core Concepts
- What is System Design? Goals, constraints, and trade-offs
- Latency vs. Throughput
- Bandwidth and Capacity Planning
- SLA, SLO, SLI: Definitions and implementation
- CAP Theorem
- PACELC Theorem
- ACID vs. BASE
- Consistency Models: Strong, eventual, causal
- Stateful vs. Stateless Architecture
- Fault Tolerance and Graceful Degradation
- Back-of-the-Envelope Estimation (QPS, storage, bandwidth)
- 12-Factor App Principles
Domain 2: Networking and Communication
- OSI Model and TCP/IP Fundamentals
- DNS Resolution: TTL, Anycast
- HTTP Evolution: HTTP/1.1 vs. HTTP/2 vs. HTTP/3 (QUIC)
- TLS and mTLS Fundamentals
- WebSockets vs. SSE vs. Long Polling
- gRPC Design and Streaming Modes
- GraphQL Architecture: Federation, N+1 mitigation
- REST API Design Patterns: Versioning, pagination
- API Gateway Patterns: Routing, auth, throttling
- Service Mesh Concepts: Istio, Linkerd
- CDN Architecture and Cache Invalidation
- Load Balancing: L4 vs. L7, routing algorithms
- Reverse Proxy vs. Forward Proxy
- P2P Networking and WebRTC
- Network Partition and Split-Brain Handling
Domain 3: Data Storage and Databases
- Relational Database Schema Design and Indexing
- Query Optimization and Execution Plans
- Transactions and Isolation Levels
- Distributed SQL Systems
- NoSQL Categories: Document, key-value, wide-column, graph
- Document Stores and Schema Patterns
- Key-Value Stores and TTL/Eviction Strategies
- Wide-Column Stores: Partitioning and compaction
- Graph Databases and Query Models
- Time-Series Database Design
- Search Engines and Relevance Architecture
- Vector Databases and ANN Indexes
- NewSQL Approaches
- Replication Models: Sync/async, leader-follower, multi-master
- Sharding Strategy and Hotspot Mitigation
- Read Replicas and Query Routing
- Partitioning: Range, hash, list, composite
- Object Storage Architecture and Access Patterns
- Data Archival and Storage Tiering
- Zero-Downtime Database Migration
- Polyglot Persistence
- Connection Pooling and DB Proxy Patterns
Domain 4: Caching
- Cache Fundamentals and Hit Ratio Optimization
- Cache Placement Layers: Client/CDN/App/DB
- Caching Strategies: Cache-aside, read-through, write-through, write-behind
- Redis Patterns: Data structures, streams, pub/sub
- Distributed Cache Topology and Hashing
- Cache Stampede Mitigation
- Cache Invalidation Strategies
- Probabilistic Data Structures: Bloom filters, HyperLogLog
- Hot-Key Mitigation
Domain 5: Scalability
- Vertical vs. Horizontal Scaling
- Auto-Scaling: Reactive, predictive, scheduled
- Stateless Scaling and Session Externalization
- Read Scaling via Replicas and Caching
- Write Scaling via CQRS and Event-Sourcing
- Consistent Hashing with Virtual Nodes
- Cell-Based Architecture
- Bulkhead Isolation and Blast-Radius Control
- Backpressure and Flow Control
- Data Locality and Affinity Routing
- Multi-Region and Geo-Distributed Architecture
- Elasticity vs. Scalability
Domain 6: Reliability and Availability
- High Availability Patterns: Active-active, active-passive
- Failover Mechanisms and Trade-offs
- Circuit Breaker Pattern
- Retry Strategy with Exponential Backoff and Jitter
- Timeouts and Deadline Propagation
- Liveness/Readiness/Startup Checks
- Chaos Engineering Principles and Tooling
- Disaster Recovery: RTO/RPO planning
- Idempotency in Distributed Workflows
- Exactly-Once Semantics Trade-offs
- Redundancy and Replication for Resilience
- Runbooks and Game-Day Operations
- Degraded-Mode Operation Design
Domain 7: Messaging and Event-Driven Architecture
- Synchronous vs. Asynchronous Communication
- Queueing Systems (Point-to-Point)
- Pub/Sub Systems and Fanout
- Kafka Architecture: Partitions, offsets, consumer groups
- Event-Driven Architecture Patterns
- Event Sourcing
- CQRS
- Event Mesh and Event Streaming Platforms
- Outbox Pattern
- Saga Pattern: Orchestration vs. choreography
- Dead-Letter Queue Strategy
- Stream vs. Batch Processing
- Schema Evolution and Compatibility
- Delivery Guarantees and Exactly-Once Processing
Domain 8: Microservices and Service Architecture
- Monolith vs. Microservices Decision Framework
- Service Decomposition Strategies
- Inter-Service Communication Patterns
- API Gateway and Backend-for-Frontend (BFF)
- Service Discovery Patterns
- Sidecar and Ambassador Patterns
- Distributed Tracing in Service Ecosystems
- Modular Monolith Architecture
- Strangler Fig Migration Pattern
- Service Versioning and Compatibility
- Contract Testing
- Micro-Frontend Architecture
- Shared Library Governance in Distributed Teams
Domain 9: Security in System Design
- Authentication Patterns and Credential Lifecycle
- OAuth 2.0 and OpenID Connect
- Authorization Models: RBAC, ABAC, ReBAC
- Enterprise SSO with SAML
- Zero Trust Architecture
- Encryption in Transit and at Rest
- Secrets Management Patterns
- API Security Hardening
- DDoS Mitigation Architecture
- Software Supply-Chain Security (SBOM, Signing)
- Data Privacy and Compliance Architecture
- Audit Logging and Tamper Evidence
- Threat Modeling (STRIDE, PASTA)
- WAF and IDS/IPS Integration
Domain 10: Distributed Systems Theory
- Consensus Algorithms: Paxos and Raft
- Leader Election Strategies
- Distributed Locks and Coordination Services
- Logical Clocks and Causality
- Lamport Timestamps
- Byzantine Fault Tolerance
- Two-Phase and Three-Phase Commit
- Distributed Transaction Models
- CRDTs for Conflict-Free Replication
- Gossip Protocols
- Quorum-Based Read/Write Systems
- Linearizability vs. Serializability
- Fallacies of Distributed Computing
Domain 11: Cloud and Infrastructure
- Cloud Service Models: IaaS, PaaS, SaaS, FaaS
- Shared Responsibility Model
- VPC/Subnet/Routing/Security Architecture
- Multi-Cloud and Hybrid Patterns
- Infrastructure as Code (IaC)
- Serverless Architecture and Trade-offs
- Container Fundamentals
- Kubernetes Architecture and Workloads
- Kubernetes Advanced Patterns: Operators, CRDs
- Service Mesh on Kubernetes
- Cloud Cost Optimization Design
- Landing Zone Architecture
- FinOps Operating Model
- Spot/Preemptible Instance Strategies
Domain 12: Data Engineering and Analytics
- Data Warehouse vs. Data Lake vs. Lakehouse
- ETL vs. ELT
- Batch Processing Architecture
- Stream Processing Architecture
- Lambda Architecture
- Kappa Architecture
- Data Mesh Principles
- Metadata, Catalog, and Governance
- CDC (Change Data Capture) Patterns
- Data Quality and Lineage
- Schema Registry and Data Contracts
- Columnar Formats: Parquet, ORC, Arrow
- OLAP Engines and Analytical Serving
- Feature Stores for ML Systems
Domain 13: Observability and SRE
- Logs, Metrics, and Traces Fundamentals
- Structured Logging Architecture
- Metrics Collection and Dashboard Strategy
- Distributed Tracing and Context Propagation
- Alerting Strategy and On-Call Design
- OpenTelemetry Deep Dive
- SRE: Error budgets and toil reduction
- Synthetic Monitoring and Canary Checks
- Continuous Profiling
- AIOps for Anomaly Detection and Auto-remediation
Domain 14: CI/CD and Delivery
- Deployment Strategies: Rolling, canary, blue-green, shadow
- CI/CD Pipeline Architecture
- Feature Flag Architecture
- Branching Strategies: Trunk-based vs. Gitflow
- GitOps Delivery Model
- Supply Chain Controls in CI/CD
- Test Strategy Portfolio
- Database Migration in Release Pipelines
- Artifact Versioning and Promotion
Domain 15: Real-Time and Streaming Systems
- Real-time vs. Near-real-time Trade-offs
- Chat System Architecture
- Notification Architecture: Multi-channel fanout
- Presence and Typing Indicators at Scale
- Live Video Streaming Design
- Collaborative Editing (OT vs. CRDT)
- Geospatial / Ride-Hailing Systems
- Low-Latency Trading System Design
- IoT Ingestion and Edge Processing
Domain 16: AI and ML Systems
- ML Lifecycle Architecture: Train, evaluate, serve
- Inference Patterns: Online, batch, stream
- Feature Stores and Consistency
- MLOps: Model registry, lineage, drift monitoring
- Experimentation and A/B Testing Platforms
- Shadow Deployments and Canaries for ML
- LLM Inference Infrastructure
- Retrieval-Augmented Generation (RAG) Architecture
- Agentic System Orchestration and Tool Use
- Vector Search at Scale
- Responsible AI Controls
- Multi-modal System Architecture
Domain 17: API and Developer Experience
- REST Resource Modeling and HTTP Semantics
- API Versioning Lifecycle
- Pagination Patterns: Cursor vs. offset
- GraphQL Federation
- Webhook Platform Design
- SDK Design and Dev Portal Architecture
- OpenAPI/AsyncAPI Contract-First Design
- Rate Limiting Algorithms
- API Monetization and Quota Systems
- API Governance: Internal vs. external
Domain 18: Frontend and Mobile Architecture
- SPA vs. SSR vs. SSG vs. ISR
- Micro-Frontend Patterns
- State Management Strategies
- PWA and Offline-First Architecture
- Web Performance Architecture (Core Web Vitals)
- Mobile Architecture Patterns
- Cross-Platform vs. Native Trade-offs
- Offline Sync and Conflict Resolution
- Frontend Observability and RUM
Domain 19: Enterprise Architecture Patterns
- Enterprise Integration Patterns
- Domain-Driven Design (DDD)
- Hexagonal Architecture
- Clean Architecture
- TOGAF Overview and Adaptation
- Architecture Decision Records (ADRs)
- Architecture Fitness Functions
- Platform Engineering and Internal Developer Platforms (IDPs)
- Technical Debt Management Frameworks
- API-First Enterprise Strategy
- Team Topologies and Conway’s Law
- Evolutionary Architecture
Domain 20: Classic System Design Case Studies
- URL Shortener Platform
- Web Crawler Platform
- Search Engine Architecture
- Social Feed/Timeline System
- Ride-Hailing Platform (Uber/Lyft)
- Video Streaming Platform (Netflix/YouTube)
- Messaging/Chat Platform (WhatsApp/Slack)
- Payments and Ledger System
- File Storage and Sync Platform (Dropbox/Google Drive)
- E-commerce Platform (Amazon)
- Recommendation Engine
- Notification Platform
- Distributed Rate Limiter
- Ad Serving Platform
- Distributed Message Queue
- Distributed Key-Value Store
- Global CDN Architecture
- Code Collaboration Platform (GitHub)
- Booking and Inventory Platform (Airbnb/Expedia)
- Email Platform
Domain 21: Cutting-Edge and Emerging Topics
- eBPF for Networking and Observability
- Server-Side WebAssembly (Wasm) and Edge Compute
- Confidential Computing and Trusted Execution Environments (TEEs)
- Quantum-Resistant Cryptography
- AI-Native Infrastructure Design
- Self-Healing / Autonomic Systems
- Unified Data and AI Platform Patterns
- Multi-Agent AI Orchestration Platforms
- Event-Driven Serverless at Edge Scale
- Programmable Data Planes and SmartNICs
- Sustainable and Carbon-Aware Architecture
- Decentralized Identity and Verifiable Credentials