Build Resilient Microservices Using Spring Retry and Circuit Breaker Pattern

Image for post
Image for post

“Everything fails all the time” — Werner Vogels

This is sad but true, everything fails specially in Microservice architecture with many external dependencies. Modern applications have tens of microservices which communicate with each other over REST. Anytime any microservice may go down causing entire operation to fail. At a broad level we can classify these failures in two categories

Transient — where application will heal itself in a matter of seconds such as network glitch.

Non-Transient — where application suffer for a longer period, minutes or hours such as database connection, unavailability due to high traffic or throttling limit.

To improve the resilience of our microservice architecture we should consider following two patterns.

  1. Retry
  2. Circuit Breaker

For transient failures, we don’t want to fail the request immediately rather would prefer to retry few times. There may a temporary network glitch and next attempt may be successful. While implementing Retry Pattern you should be careful how many retries you want. May be you can limit to 3 retries for each REST call as an example. But if the failure is not transient and you keep on doing 3 retries for each REST call, pretty soon you will make further damage to the microservice which is already suffering. You should stop sending further request to the service after certain number of failures and resume sending requests after a while. You are right, I am talking about Circuit Breaker Pattern.

I have been after this for a while and recently implemented these two patterns in Spring boot microservice using Spring-Retry.

Concept is very simple, microservice A will make REST call to microservice B. If it fails, it will automatically retry 3 times. If Service B is still unable to process, fallback method will be called. After certain number of fallback method is execute in a given time frame, circuit will be opened. As a result Service A will not make any further REST call to Service B, reducing cloud resource usage, network bandwidth usage. Once reset time is over, circuit will be closed automatically allowing REST calls to Service B again.

Image for post
Image for post
Image for post
Image for post

Above log indicates for each request, our service retried 3 times (“called ShakyExternalService api/customer/name”) before executing the fallback method ( “returning name from fallback method”). Once fallback method is called 3 times in a period of 15 seconds, circuit was opened and further request to the api was served directly from fallback without trying to make API call. In the log you can see last 3 lines where fallback was executed directly.

You can checkout the source code in Github. Once you have cloned the repository issue below commands to build and start the microservice

gradle clean buildgradle bootRun

Once your app is booted, test the API by using CURL

curl http://localhost:8080/client/customer/name

Spring Boot Code

package com.example.demo;import java.util.Random;import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.boot.SpringApplication;
import org.springframework.boot.autoconfigure.SpringBootApplication;
import org.springframework.context.annotation.Bean;
import org.springframework.core.ParameterizedTypeReference;
import org.springframework.http.HttpMethod;
import org.springframework.http.ResponseEntity;
import org.springframework.retry.RecoveryCallback;
import org.springframework.retry.RetryCallback;
import org.springframework.retry.RetryContext;
import org.springframework.retry.annotation.CircuitBreaker;
import org.springframework.retry.annotation.EnableRetry;
import org.springframework.retry.annotation.Recover;
import org.springframework.retry.backoff.FixedBackOffPolicy;
import org.springframework.retry.policy.SimpleRetryPolicy;
import org.springframework.stereotype.Service;
import org.springframework.web.bind.annotation.GetMapping;
import org.springframework.web.bind.annotation.RequestMapping;
import org.springframework.web.bind.annotation.RestController;
import org.springframework.web.client.RestTemplate;
public class SpringRetryCircuitBreakerApplication {
public static void main(String[] args) {, args);
public RetryTemplate retryTemplate() {
RetryTemplate retryTemplate = new RetryTemplate();
FixedBackOffPolicy fixedBackOffPolicy = new FixedBackOffPolicy();
SimpleRetryPolicy retryPolicy = new SimpleRetryPolicy();
return retryTemplate;
class RobustClientController {
private RobustService robustService;
public String customerName() throws RuntimeException {
try {
//return robustService.getExternalCustomerName();
return robustService.resilientCustomerName();
} catch (RuntimeException e) {
throw new RuntimeException("ShakyExternalService is down");
class RobustService {
private static Logger logger = LoggerFactory.getLogger(RobustService.class);

@Autowired RetryTemplate retryTemplate;

public String getExternalCustomerName() {
ResponseEntity<String> exchange = new RestTemplate().exchange("http://localhost:8080/api/customer/name",
HttpMethod.GET, null, new ParameterizedTypeReference<String>() {
return exchange.getBody();

@CircuitBreaker(maxAttempts=3,openTimeout=15000l, resetTimeout=30000l)
public String resilientCustomerName() {
return retryTemplate.execute(new RetryCallback<String, RuntimeException>() {
public String doWithRetry(RetryContext context) {"Retry count %d", context.getRetryCount()));
return getExternalCustomerName();


public String fallback(Throwable e) {"returning name from fallback method");
return "Mini";

class ShakyExternalService {
private static Logger logger = LoggerFactory.getLogger(ShakyExternalService.class);@GetMapping("/customer/name")
public String customerName() {"called ShakyExternalService api/customer/name");
Random randomGenerator = new Random();
int randomInt = randomGenerator.nextInt(2);
if (randomInt < 2) {
throw new ShakyServiceException("Service is unavailable");
return "Mickey";
class ShakyServiceException extends RuntimeException {

public ShakyServiceException(String message) {
public synchronized Throwable fillInStackTrace() {
return this;

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store