System Design Tutorial Summary

Introduction

Coverage of core concepts for system design interviews: scalability, reliability, data handling, and high-level architecture.
Focus on how to "glue" a system together rather than coding.

Layered System in Computers
- Functions through binary (0s and 1s).
- Data Units:
  - Bit: smallest unit (0 or 1)
  - Byte: 8 bits; represents a single character.
  - Larger units: Kilobyte, Megabyte, Gigabyte, Terabyte.
- Disk Storage:
  - Types: HDD (slower, cheaper) vs. SSD (faster, more expensive).
- RAM (Random Access Memory):
  - Primary active data holder (volatile memory).
  - Speed: 5,000 MB/sec and above.
- Cache:
  - Smaller than RAM (measured in MB), faster access times.
  - Levels: L1, L2, L3 caches.
- CPU (Central Processing Unit):
  - Processes instructions; high-level code compiled into machine code.
- Motherboard:
  - Connects all components and facilitates data flow.

CI/CD Pipeline:
- Automates deployment processes (e.g., Jenkins, GitHub Actions).
Load Balancers & Reverse Proxies:
- Distribute user requests across multiple servers.
External Storage:
- Stores data not on the same server, accessed over the network.
Logging & Monitoring:
- Tools: PM2 for backend; Sentry for frontend.
- Alerting services integrated with platforms like Slack.
Debugging Process:
- Identify issues through logs, replicate in staging, apply hotfixes.

Key Principles:
- Scalability: system growth with user base.
- Maintainability: ease of understanding for future developers.
- Efficiency: optimal use of resources.
Core Elements:
- Moving Data: seamless data flow.
- Storing Data: considerations for SQL vs. NoSQL.
- Transforming Data: turning raw data into meaningful information.

Components:
- Consistency: all nodes have the same data.
- Availability: system is always operational.
- Partition Tolerance: system functions during network partitions.
Only two out of three can be guaranteed in distributed systems.

Availability Metrics:
- Measured in percentages (e.g., 99.9% allows for 8.76 hours of downtime per year).
SLAs & SLOs:
- SLAs: formal contracts with customers.
- SLOs: internal performance goals.
Resilience Measures:
- Redundant systems, graceful degradation, reliability, and fault tolerance.
Performance Metrics:
- Throughput: amount of data handled over time (RPS, QPS, BPS).
- Latency: time to handle a single request.

IP Addresses:
- IPv4 vs. IPv6 for unique device identification.
Data Communication:
- Governed by Internet Protocol, includes TCP (reliable) and UDP (faster, less reliable).
DNS (Domain Name System):
- Translates domain names to IP addresses.
Proxy Servers:
- Forward and reverse proxies serve different purposes like caching and load balancing.

HTTP:
- Request-response protocol, stateless.
- HTTP methods: GET, POST, PUT, PATCH, DELETE.
WebSockets:
- Allows real-time, two-way communication.
Email Protocols:
- SMTP for sending, IMAP/POP3 for retrieving emails.
File Transfer Protocols:
- FTP for transferring files, SSH for secure remote operations.
RPC (Remote Procedure Call):
- Invokes code execution on remote servers.

CRUD Operations:
- Define inputs and outputs for Create, Read, Update, and Delete actions.
Communication Protocols:
- REST, GraphQL, gRPC for data transport.
Versioning:
- Maintaining backward compatibility when modifying endpoints.
Rate Limiting & CORS:
- Prevents abuse and defines access to APIs.

Caching Techniques:
- Browser, server-side, database caching to improve speed and efficiency.
CDNs (Content Delivery Networks):
- Distribute static content closer to users to reduce latency.

Purpose:
- Distributes traffic to prevent server overload.
Algorithms:
- Round Robin, Least Connections, IP Hashing, Weighted Algorithms.
Health Checks:
- Ensures traffic only directed to responsive servers.
Redundancy:
- Implementing multiple load balancers to avoid single points of failure.

Types of Databases:
- Relational Databases (SQL): ACID compliant; examples include Postgres, MySQL.
- NoSQL Databases: Flexible schema; examples include MongoDB, Redis.
- In-Memory Databases: Fast access for caching; e.g., Redis.
Scaling Databases:
- Vertical Scaling: Enhancing a single server.
- Horizontal Scaling: Distributing data across multiple servers (sharding, replication).
Performance Techniques:
- Caching, indexing, query optimization.

Importance of designing robust systems with a focus on scalability, reliability, and efficient data handling.
Understanding trade-offs based on specific use cases is crucial for successful system design.