Rob Zienert
1 min readMar 24, 2018

--

Bobby Tables, that’s right, Elasticache would be a much better option. At Netflix, we don’t treat any of the Redis instances as something we can just lose, but we do lose them & can recover: Every Redis has a replica that we can promote to master as well as S3 backups. Here’s a couple things to keep in mind on Redis usage:

  • Igor: Losing the index will re-notify all builds (there are guards against mass notifications, though).
  • Clouddriver: Losing Redis will cause any in-flight atomic operations to fail. Atomic operations typically don’t last longer than 30–60s. Everything else (the cache) will rebuild.
  • Orca: All running executions will fail and become orphaned. Execution history will be lost.
  • Gate: Users will need to login again.
  • Echo: We don’t yet use Redis for Echo (but will when we adopt Pubsub triggers internally). From the sounds of it, losing Redis will cause pubsub events may cause redelivery of events.
  • Front50: Never use Redis for Front50. :)

I’m not particularly keen on how much we use Redis. It’s great to develop against, but has some operational challenges around HA.

--

--