10-caching-strategies

10 - Caching Strategies

Why Cache?

Database queries are expensive. Caching stores frequently accessed data in faster storage (memory) to reduce database load and improve response times.

Plain text

┌─────────────────────────────────────────────────────────────┐
│              Without Cache                                   │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Client ──► App ──► Database ──► Disk I/O ──► CPU/Memory    │
│         50ms      200ms       10ms         50ms              │
│                    ▲                                        │
│                    │                                        │
│         Every request hits database                         │
│         Even for same query!                                │
│                                                              │
│  Total: ~310ms per request                                 │
│                                                              │
└─────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────┐
│              With Cache                                      │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Client ──► App ──► Cache (Redis/Memcached)                 │
│         50ms      1-5ms                                    │
│                    │                                        │
│                    │ Cache Miss                              │
│                    ▼                                        │
│              Database (only on miss)                        │
│                                                              │
│  Cache Hit: ~55ms (95% of requests)                         │
│  Cache Miss: ~310ms (5% of requests)                        │
│  Effective average: ~67ms                                   │
│                                                              │
│  Plus: Database load reduced by 95%                         │
│                                                              │
└─────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────┐ │ Without Cache │ ├─────────────────────────────────────────────────────────────┤ │ │ │ Client ──► App ──► Database ──► Disk I/O ──► CPU/Memory │ │ 50ms 200ms 10ms 50ms │ │ ▲ │ │ │ │ │ Every request hits database │ │ Even for same query! │ │ │ │ Total: ~310ms per request │ │ │ └─────────────────────────────────────────────────────────────┘ ┌─────────────────────────────────────────────────────────────┐ │ With Cache │ ├─────────────────────────────────────────────────────────────┤ │ │ │ Client ──► App ──► Cache (Redis/Memcached) │ │ 50ms 1-5ms │ │ │ │ │ │ Cache Miss │ │ ▼ │ │ Database (only on miss) │ │ │ │ Cache Hit: ~55ms (95% of requests) │ │ Cache Miss: ~310ms (5% of requests) │ │ Effective average: ~67ms │ │ │ │ Plus: Database load reduced by 95% │ │ │ └─────────────────────────────────────────────────────────────┘

# 1. Hash for structured data # Store user object as hash instead of JSON string r.hset(f"user:{user_id}", mapping={ "name": user.name, "email": user.email, "age": user.age }) # Get single field (efficient!) email = r.hget(f"user:{user_id}", "email") # Get all fields user_data = r.hgetall(f"user:{user_id}") # 2. Pipeline for batch operations pipe = r.pipeline() for user_id in user_ids: pipe.get(f"user:{user_id}") results = pipe.execute() # Single round-trip # 3. Sorted Sets for leaderboards/rankings # Add score r.zadd("leaderboard", {user_id: score}) # Get top 10 r.zrevrange("leaderboard", 0, 9, withscores=True) # Get rank of user rank = r.zrevrank("leaderboard", user_id) # 4. Sets for relationships # Followers r.sadd(f"followers:{user_id}", follower_id) r.srem(f"followers:{user_id}", follower_id) followers = r.smembers(f"followers:{user_id}") # Check if following is_following = r.sismember(f"followers:{user_id}", follower_id) # Common followers (intersection) common = r.sinter(f"followers:{user_a}", f"followers:{user_b}") # 5. Rate Limiting (Sliding Window) def check_rate_limit(user_id, max_requests=100, window=60): key = f"rate_limit:{user_id}" current = r.get(key) if current and int(current) >= max_requests: return False pipe = r.pipeline() pipe.incr(key) pipe.expire(key, window) pipe.execute() return True # 6. Distributed Lock import uuid def acquire_lock(lock_name, acquire_time=10): identifier = str(uuid.uuid4()) lock_key = f"lock:{lock_name}" # NX = Only if Not eXists if r.set(lock_key, identifier, nx=True, ex=acquire_time): return identifier return None def release_lock(lock_name, identifier): lock_key = f"lock:{lock_name}" # Only release if we own it with r.pipeline() as pipe: while True: try: pipe.watch(lock_key) if pipe.get(lock_key) == identifier: pipe.multi() pipe.delete(lock_key) pipe.execute() return True pipe.unwatch() break except redis.WatchError: continue return False