JaM2in - Medium

Kubernetes 환경에서 ARCUS 캐시 사용하기

Namjae Kim — Tue, 23 Apr 2024 12:06:26 GMT

이전 포스팅에서는 Docker 컨테이너를 활용해서 하나의 장비에 ARCUS 캐시 클러스터를 구성해 보았습니다. 이번 포스팅에서는 쿠버네티스(Kubernetes)를 활용하여 다수의 호스트 장비에 ARCUS 캐시 클러스터를 구성하는 방법을 소개하겠습니다.

Kubernetes

쿠버네티스(k8s 또는 kube라고도 함)는 컨테이너화된 워크로드와 서비스를 관리하기 위한 오픈소스 플랫폼입니다. 여러 호스트 장비를 하나의 쿠버네티스 클러스터로 묶어 관리할 수 있고, 이렇게 구성된 쿠버네티스 클러스터에 서비스를 배포하면 적절한 노드(호스트 장비)에 컨테이너를 분산해서 배치하거나, 문제가 생긴 컨테이너를 교체하거나, 컨테이너 동작에 필요한 설정 관리 등의 동작을 수행합니다.

ARCUS 캐시 클러스터는 가용성과 확장성을 위해 다수의 캐시 노드를 여러 장비에 분산 배치하기 때문에 운영자는 다수의 장비에 프로세스를 적절히 구동하고 설정/관리해 주어야 합니다. 쿠버네티스 환경에서 운영자가 ARCUS 캐시 노드의 수와 분산 배치 전략을 명시하면 쿠버네티스가 그에 맞게 자동으로 분산 배치된 캐시 노드를 구동해주므로 서비스 운영에 드는 노력을 효과적으로 절감할 수 있습니다.

본 포스팅에서는 다수의 장비에 쿠버네티스 클러스터를 구성하는 대신 minikube를 활용하여 가상의 worker node를 구성하고 실습을 진행해 보겠습니다. 만약 이미 구성된 쿠버네티스 클러스터에서 실습을 진행하려는 경우 minikube 파트는 건너뛰어도 됩니다. 원활한 실습을 위해서는 컨트롤 플레인 노드(마스터 노드) 외에 3개 이상의 워커 노드가 있어야 한다는 점을 유의하세요.

minikube는 개발 및 학습이 주 목적이기 때문에, 컨트롤 플레인 노드에도 리소스가 스케줄되도록 기본 설정되어 있으니 참고 바랍니다.

Minikube

실제 운영 환경에서는 다수의 호스트 장비에 걸친 쿠버네티스 클러스터를 구성하고, 각 리소스는 여러 노드에 분산 배치됩니다. 만약 로컬 장비에서 간단한 실습을 진행하려는 경우에는 minikube를 사용하여 다수의 가상 장비를 기반으로 하는 쿠버네티스 클러스터를 구성할 수 있습니다. 실습 환경은 Linux이므로 아래와 같은 명령으로 minikube를 설치할 수 있고, 그 외 환경에서의 설치 방법은 minikube Get Started 문서를 참고해 주세요.

curl -LO https://storage.googleapis.com/minikube/releases/latest/minikube-linux-amd64
sudo install minikube-linux-amd64 /usr/local/bin/minikube && rm minikube-linux-amd64

설치가 완료된 후에는 minikube start 명령을 통해 minikube 클러스터를 시작할 수 있습니다. 본 포스팅에서는 ARCUS의 각 구성 요소가 분산 배치되는 것을 확인하기 위해, 총 4개 노드로 구성된 minikube 클러스터를 사용하겠습니다.

$ minikube start --nodes=4

구성이 완료된 뒤 kubectl get nodes 명령을 사용하면 1개의 control-plane node와 3개의 worker node를 확인하실 수 있습니다. control plane node는 쿠버네티스 클러스터를 관리하는 노드이고, worker node는 사용자가 배포한 서비스를 실행하기 위한 노드입니다.

$ kubectl get nodes
NAME           STATUS   ROLES           AGE   VERSION
minikube       Ready    control-plane   12m   v1.28.3
minikube-m02   Ready              11m   v1.28.3
minikube-m03   Ready              11m   v1.28.3
minikube-m04   Ready              10m   v1.28.3

ZooKeeper ensemble 구성

쿠버네티스 클러스터를 준비한 뒤에는 쿠버네티스 환경에서 ARCUS 캐시의 메타 정보를 관리하는 ZooKeeper ensemble을 먼저 구축합니다. 쿠버네티스 튜토리얼인 Running ZooKeeper, A Distributed System Coordinator에서 제공하는 manifest 파일을 사용하여 3개 서버로 구성되고 각각은 서로 다른 워커 노드에 배치되는 ZooKeeper ensemble을 구축합니다. 이와 같은 ZooKeeper 서버의 분산 배치를 위하여 쿠버네티스 클러스터에는 3개 이상의 워커 노드가 존재해야 합니다.

$ kubectl apply -f https://k8s.io/examples/application/zookeeper/zookeeper.yaml
service/zk-hs created
service/zk-cs created
poddisruptionbudget.policy/zk-pdb created
statefulset.apps/zk created

manifest 파일에 의해 생성되는 리소스를 살펴보면 다음과 같습니다.

zk-hs: ZooKeeper 서버 간 통신을 위한 service
zk-cs: 클라이언트 요청을 처리하기 위한 service
zk-pdb: 동시에 여러 서버가 Unavailable 상태가 되는 것을 방지하는 PodDisruptionBudget
zk: Pod를 관리하는 Statefulset

모든 리소스가 구동되고 나면 kubectl get pods 명령으로 상태를 확인할 수 있습니다. 정상 구동되었다면 3개 Pod가 Ready(1/1), Running 상태일 것입니다. 만약 그 외의 상태에서 장시간 진행되지 않는 Pod가 있다면, kubectl describe pod 명령을 통해 문제 원인을 파악하고 조치를 취해야 합니다.

$ kubectl get pods -o wide
NAME   READY   STATUS    RESTARTS   AGE     IP           NODE           NOMINATED NODE   READINESS GATES
zk-0   1/1     Running   0          2m43s   10.244.0.6   minikube                  
zk-1   1/1     Running   0          2m23s   10.244.3.6   minikube-m04              
zk-2   1/1     Running   0          2m1s    10.244.1.5   minikube-m02

ARCUS cluster 구성

ZooKeeper ensemble이 정상 동작하는 상태에서, 다음과 같은 yaml 파일을 적용하여 현재 namespace에 3개 노드로 구성된 ARCUS 캐시 클러스터를 구성할 수 있습니다. 아래 파일은 default namespace를 기준으로 작성되었으며, 다른 namespace를 사용하는 경우 zk conn str과 server fqdn의 default.svc 부분을 .svc와 같이 변경해 주어야 합니다.

apiVersion: v1
kind: Service
metadata:
  name: arcus-mc
  labels:
    app: arcus-memcached
    service-code: test
spec:
  ports:
  - port: 11211
    name: arcus-mc
  selector:
    app: arcus-memcached
    service-code: test
---
apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: arcus-mc
spec:
  selector:
    matchLabels:
      app: arcus-memcached
      service-code: test
  serviceName: arcus-mc
  replicas: 3
  template:
    metadata:
      name: arcus-memcached
      labels:
        app: arcus-memcached
        service-code: test
    spec:
      affinity:
        podAntiAffinity:
          preferredDuringSchedulingIgnoredDuringExecution:
          - weight: 100
            podAffinityTerm:
              labelSelector:
                matchExpressions:
                - key: "service-code"
                  operator: In
                  values:
                  - test
              topologyKey: "kubernetes.io/hostname"
      containers:
      - name: memcached
        image: jam2in/arcus-memcached:1.13.5
        args:
        - "-v"
        - "-m"
        - "100"
        - "-z"
        - "zk-cs.default.svc.cluster.local:2181" # zk conn str
      initContainers:
      - name: arcus-tool
        image: jam2in/zkcli:python
        args:
          - "arcus.memcached"
          - "add"
          - "zk-cs.default.svc.cluster.local:2181" # zk conn str
          - "$(POD_NAME).mc.default.svc.cluster.local:11211" # server fqdn
          - "test" # service-code
        env:
        - name: POD_NAME
          valueFrom:
            fieldRef:
              fieldPath: metadata.name

주요 설정은 다음과 같습니다.

replicas: 구동하려는 Pod 수 입니다.
podAntiAffinity: 각 Pod가 가능한 서로 다른 장비에 배치되도록 합니다.
initContainer: arcus-memcached가 실행되기 전에 ZooKeeper에 Znode를 생성하는 작업을 수행합니다.

ZooKeeper 프로세스를 PM 또는 VM 환경에 직접 구동하고 ARCUS 캐시 서버만 Kubernetes 환경에서 동작시키는 형태의 구성도 가능합니다. 이렇게 구성하려면 yaml 내용 중 zk conn str 부분에 해당 ZooKeeper 주소를 입력하면 됩니다.

이렇게 yaml 파일을 작성한 뒤에는 kubectl apply 명령으로 현재 Kubernetes context에 적용할 수 있습니다.

$ kubectl apply -f 
service/mc created
statefulset.apps/mc created

ZooKeeper와 마찬가지로, 각 Pod가 모두 구동되고 나면 kubectl get pods 명령으로 상태를 확인할 수 있습니다.

$ kubectl get pods -o wide
NAME   READY   STATUS    RESTARTS   AGE     IP           NODE           NOMINATED NODE   READINESS GATES
mc-0   1/1     Running   0          2m36s   10.244.2.5   minikube-m03              
mc-1   1/1     Running   0          2m33s   10.244.1.6   minikube-m02              
mc-2   1/1     Running   0          69s     10.244.3.7   minikube-m04              
zk-0   1/1     Running   0          23m     10.244.0.6   minikube                  
zk-1   1/1     Running   0          23m     10.244.3.6   minikube-m04              
zk-2   1/1     Running   0          23m     10.244.1.5   minikube-m02

busybox

이제 총 3개 Pod로 구성된 ARCUS 캐시 클러스터가 구동되었습니다. 이번에는 각 프로세스가 잘 동작하고 있는지 확인해 보겠습니다. Kubernetes 환경에서 실행 중인 서비스 디버깅을 위해서 busybox를 사용할 수 있습니다. 자세한 내용은 Debug Services 문서를 참고 바랍니다.

kubectl run -it --rm --restart=Never busybox --image=gcr.io/google-containers/busybox sh

busybox 컨테이너에서 nc, telnet등의 명령을 통해 각 프로세스에 명령을 전송하고 응답을 확인할 수 있습니다. 아래 예시에서 IP 주소 대신 Pod의 FQDN을 사용해도 됩니다.

/ # echo srvr | nc 10.244.0.6 2181
Zookeeper version: 3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
Latency min/avg/max: 0/0/39
Received: 150651
Sent: 150650
Connections: 2
Outstanding: 0
Zxid: 0x200000025
Mode: follower
Node count: 23

/ # echo version | nc 10.244.2.5 11211
VERSION 1.13.5

마치며

지금까지 쿠버네티스 환경에서 ARCUS 캐시 클러스터를 구동하는 방법에 대해 간단히 알아보았습니다. 쿠버네티스는 컨테이너를 활용한 배포 시 사실상 표준(de facto)으로 사용되고 있습니다. 본 포스팅에서 안내드린 내용만으로도 기본적인 클러스터 구성 및 사용이 가능하지만, ARCUS 서버 재구동 없이 설정을 변경하거나 다양한 이벤트 발생 시 유연한 처리가 어렵습니다. ARCUS Enterprise Edition을 구독한 고객에게는 최상의 안정성과 고가용성을 보장하기 위해 ARCUS Operator를 제공하고 있습니다. 본 포스트 또는 ARCUS Operator에 대한 문의 사항이 있다면 contact@jam2in.com 으로 연락 바랍니다.

Kubernetes 환경에서 ARCUS 캐시 사용하기 was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Docker 컨테이너 환경에서 ARCUS 캐시 사용하기

Imoliviarla — Thu, 29 Feb 2024 11:46:09 GMT

ARCUS 캐시를 직접 장비에 설치하여 사용하기 위해서는 의존성을 설치하고 컴파일한 후 실행해야 하는 복잡함이 존재합니다. 이번 포스팅에서는 ARCUS를 간편하게 Docker 환경에서 사용하는 방법을 살펴봅니다. Docker에 대한 개념을 먼저 간단히 설명 드리고, 단일 캐시 노드를 구동하는 방법부터 Docker Compose를 활용해 캐시 클러스터를 구동하는 방법까지 차근차근 나아가 보겠습니다.

Docker란?

Docker는 컨테이너 기술을 이용해 리눅스 프로세스를 격리된 환경에서 실행하는 가상화 플랫폼입니다. 여러 개의 프로세스를 동일한 환경에서 실행할 수 있다는 점에서 가상머신과 비슷하지만, Guest OS를 요구하지 않아 가상머신보다 성능 상 이점이 큽니다. 또한 Docker Hub를 통해 이미지를 공유하면 누구나 접근할 수 있고, Docker가 설치되어 있다면 바로 로컬 환경에서 사용해 볼 수 있습니다. Docker Compose 기능을 활용하면 여러 Docker 컨테이너를 하나의 어플리케이션으로 동작할 수 있도록 묶어 한 번에 띄울 수 있기 때문에 클러스터링에 용이합니다.

이번 포스팅에서는 Docker Hub에서 잼투인이 제공하는 arcus-memcached 이미지를 사용해 ARCUS 캐시 서버를 간편하게 구동하고 동작을 확인해보겠습니다. 그리고 Docker Compose를 사용하여 캐시 서버 3개로 구성된 ARCUS 캐시 클러스터를 구동해 보겠습니다.

단일 ARCUS 캐시 서버 실행

먼저 arcus-memcached 이미지를 사용해 하나의 ARCUS 캐시 서버를 실행해보겠습니다. 사용한 Docker 버전은 24.0.5 입니다.

1) Docker 이미지 가져오기

docker pull명령어를 사용하면 Docker Hub로부터 arcus-memcached 이미지를 pull 할 수 있습니다. 참고로 Docker 이미지를 가져오지 않은 상태여도 다음 단계의 컨테이너 실행 명령어를 수행할 때 자동으로 pull을 하기 때문에 이 과정은 생략해도 됩니다.

docker pull jam2in/arcus-memcached

2) Docker container 실행하기

docker run 명령어를 이용해 arcus-memcached 이미지를 기반으로 컨테이너를 생성하고 실행합니다. 아래 명령어를 실행하면 11211번 포트를 listen하는 ARCUS 캐시 서버 한 개가 컨테이너 상에서 실행됩니다.

docker run --name arcus_memcached -p 11211:11211 -d jam2in/arcus-memcached

컨테이너를 실행할 때 docker run 명령의 실행 옵션과 ARCUS 캐시 구동 옵션을 사용할 수 있습니다. docker run 명령에 사용된 옵션을 간단히 설명드리겠습니다.

name: 실행할 컨테이너의 이름을 지정합니다. 값을 주지 않을 경우 무작위로 이름이 부여됩니다.
p: host와 컨테이너의 port를 매핑하는 옵션입니다. host-port:container-port 형식으로 작성합니다.
d: 옵션을 부여하면 백그라운드에서 실행합니다.
자세한 docker run 옵션은 링크를 참고 바랍니다.

ARCUS 캐시의 구동 옵션은 아래와 같이 이미지 이름 다음에 지정해야 하며, 별도로 지정하지 않으면 기본값이 사용됩니다.

docker run --name arcus_memcached -p 11212:11212 -d jam2in/arcus-memcached -p 11212 -m 100 -c 100 -v

ARCUS 캐시의 구동 옵션을 간단히 설명드리겠습니다.

p: listen할 port를 지정합니다. 이 옵션을 지정한 후, 앞서 docker run 옵션에서 명시한 container port도 동일하게 설정해주어야 합니다.
m: 캐시에서 사용할 최대 메모리(MB)를 지정합니다.
c: 캐시 서버에 연결 가능한 최대 커넥션 개수를 지정할 수 있습니다.
v: 캐시 서버 로그를 확인하기 위한 옵션입니다. 이 옵션을 지정하지 않으면 WARN 레벨의 로그만 볼 수 있으며, -v 지정 시 INFO 레벨의 로그를, -vv 지정 시 DEBUG 레벨의 로그를, -vvv 지정 시 DETAIL 레벨의 로그를 확인할 수 있습니다.
z: 캐시 클러스터로 구동 시 필요한 옵션으로, 캐시 서버에서 연결할 zookeeper 앙상블의 주소를 입력합니다.
자세한 ARCUS 캐시 구동 옵션 설명과 기본값은 링크를 참고바랍니다.

3) ARCUS 동작 확인

ARCUS 캐시 서버를 컨테이너를 실행한 후에는 nc 명령어를 이용해 정상적으로 구동되었는지 확인할 수 있습니다.

echo stats | nc localhost 11211

컨테이너가 정상적으로 실행되었다면 다음과 같은 응답을 받을 수 있습니다.

STAT pid 1
STAT uptime 606
STAT time 1701941770
STAT version 1.13.4
STAT libevent 2.1.12-stable
STAT pointer_size 64
STAT hb_count 201
STAT hb_latency 339
....

Mac Sonoma OS 등 nc 명령어가 동작하지 않는 환경이 있다면, telnet 명령어를 사용해 확인해주세요.

4) ARCUS 캐시 중지

구동중인 ARCUS 캐시 서버를 중지하고자 한다면, 아래와 같이 docker stop container-name명령어를 사용해 컨테이너를 중지해야 합니다.

docker stop arcus_memcached

ARCUS 캐시 클러스터 구성 및 실행

ARCUS는 ZooKeeper를 이용한 클러스터링 방식을 지원합니다. 이번에는 아래와 같이 ZooKeeper 서버 3개로 구성된 앙상블과 ARCUS 캐시 서버 3개로 구성된 클러스터를 Docker Compose를 이용하여 단일 장비에서 구동해보겠습니다.

service code: test
+--------------------+   +--------------------+   +--------------------+
| zoo1               |   | zoo2               |   | zoo3               |
+--------------------+   +--------------------+   +--------------------+
|                    |   |                    |   |                    |
|container-port:2181 |   |container-port:2181 |   |container-port:2181 |
|host-port:2181      |   |host-port:2182      |   |host-port:2183      |
|                    |   |                    |   |                    |
+--------------------+   +--------------------+   +--------------------+
+--------------------+   +--------------------+   +--------------------+
| cache1             |   | cache2             |   | cache3             |
+--------------------+   +--------------------+   +--------------------+
|                    |   |                    |   |                    |
|container-port:11211|   |container-port:11211|   |container-port:11211|
|host-port:11211     |   |host-port:11212     |   |host-port:11213     |
|                    |   |                    |   |                    |
+--------------------+   +--------------------+   +--------------------+

1) docker-compose.yml 파일 작성

아래와 같은 과정을 거쳐 Docker Compose로 ARCUS 캐시 클러스터를 구동할 수 있습니다.

3개의 host로 구성된 Zookeeper 컨테이너를 띄워 앙상블을 구성합니다.
ZkCli 컨테이너를 구동하여 ARCUS 캐시 서버가 Zookeeper에 등록될 수 있도록 ZNode를 생성합니다. 생성이 완료되면 컨테이너는 종료됩니다.
3개의 ARCUS 캐시 컨테이너를 띄워 캐시 서버가 클러스터 형태로 실행되도록 합니다.

다음은 위 컨테이너들을 구동하는 docker-compose.yml 파일입니다.

version: "3.1"

services:
  zoo1:
    image: zookeeper:3.5.9
    hostname: zoo1
    ports:
    - 2181:2181
    environment:
      ZOO_MY_ID: 1
      ZOO_SERVERS: server.1=zoo1:2888:3888;2181 server.2=zoo2:2888:3888;2181 server.3=zoo3:2888:3888;2181
    restart: always

  zoo2:
    image: zookeeper:3.5.9
    hostname: zoo2
    ports:
    - 2182:2181
    environment:
      ZOO_MY_ID: 2
      ZOO_SERVERS: server.1=zoo1:2888:3888;2181 server.2=zoo2:2888:3888;2181 server.3=zoo3:2888:3888;2181
    restart: always

  zoo3:
    image: zookeeper:3.5.9
    hostname: zoo3
    ports:
    - 2183:2181
    environment:
      ZOO_MY_ID: 3
      ZOO_SERVERS: server.1=zoo1:2888:3888;2181 server.2=zoo2:2888:3888;2181 server.3=zoo3:2888:3888;2181
    restart: always

  register:
    depends_on:
    - zoo1
    - zoo2
    - zoo3
    image: jam2in/zkcli:3.5.9
    environment:
      ZK_ENSEMBLE: zoo1:2181,zoo2:2181,zoo3:2181
      SERVICE_CODE: test
      CACHENODES: cache1:11211,cache2:11212,cache3:11213
    restart: on-failure

  cache1:
    depends_on:
      register:
        condition: service_completed_successfully
    image: jam2in/arcus-memcached
    command: -m 100 -p 11211 -z zoo1:2181,zoo2:2181,zoo3:2181
    hostname: cache1
    ports:
    - 11211:11211
    environment:
      ARCUS_CACHE_PUBLIC_IP: 127.0.0.1
    restart: always

  cache2:
    depends_on:
      register:
        condition: service_completed_successfully
    image: jam2in/arcus-memcached
    command: -m 100 -p 11212 -z zoo1:2181,zoo2:2181,zoo3:2181
    hostname: cache2
    ports:
    - 11212:11212
    environment:
      ARCUS_CACHE_PUBLIC_IP: 127.0.0.1
    restart: always

  cache3:
    depends_on:
      register:
        condition: service_completed_successfully
    image: jam2in/arcus-memcached
    command: -m 100 -p 11213 -z zoo1:2181,zoo2:2181,zoo3:2181
    hostname: cache3
    ports:
    - 11213:11213
    environment:
      ARCUS_CACHE_PUBLIC_IP: 127.0.0.1
    restart: always

yml 파일에 작성한 옵션들 중 중요한 옵션만 간단히 살펴보겠습니다. 먼저 ZooKeeper 컨테이너 구동 시 사용한 옵션은 아래와 같습니다.

ZOO_MY_ID: ZooKeeper 서버의 myid 값을 지정합니다. 각 컨테이너마다 다르게 설정해주어야 하나의 앙상블을 구성할 수 있습니다. ID는 앙상블 내에서 고유해야 하며 1에서 255 사이의 값을 가져야 합니다.
ZOO_SERVERS: ZooKeeper 앙상블 구성을 위한 ZooKeeper 주소 목록을 server.myid=host:server-port:election-port;client-port 형태로 입력합니다.

다음으로 zkcli 컨테이너 구동 시 사용한 옵션은 아래와 같습니다. 참고로 zkcli 컨테이너는 ZooKeeper 앙상블에 접속하여 캐시 클러스터 정보를 ZNode로 저장하는 역할을 합니다. 이 정보는 클러스터 운영 시 사용됩니다.

depends_on: ZooKeeper 컨테이너들이 모두 구동되어 앙상블을 형성한 후에 zkcli 컨테이너가 시작되도록 합니다.
ZK_ENSEMBLE: 구성된 ZooKeeper 앙상블 주소를 입력하여 zkcli 컨테이너가 ZNode를 생성할 수 있도록 합니다.
SERVICE_CODE: ARCUS 캐시 클러스터의 서비스 코드를 지정합니다.
CACHENODES: ARCUS 캐시 클러스터의 모든 캐시 서버 주소를 입력하여 ZooKeeper 앙상블에서 관리되도록 합니다.

마지막으로 ARCUS 캐시 컨테이너 구동 시 사용한 옵션은 아래와 같습니다.

command: 단일 캐시 서버 구동 부분에서 설명드린 캐시 구동 옵션을 지정합니다. 여기서는 memory는 최대 100MB 사용하도록 하고, port는 각 서버마다 11211부터 11213까지 사용하고, -z 옵션으로 zookeeper 앙상블과 연결을 맺도록 했습니다.
ARCUS_CACHE_PUBLIC_IP: 도커 컨테이너 내부가 아닌 외부 환경에서 ZooKeeper 앙상블로부터 ARCUS 캐시 서버에 접근할 수 있도록 IP를 입력해줍니다.

기존에 다른 프로세스가 2181~2183 포트와 11211~11213 포트를 사용하고 있지 않는지 확인하여 만약 해당 포트를 사용중인 프로세스가 존재한다면 ZooKeeper 앙상블이나 캐시 서버가 다른 포트를 사용하도록 설정 파일을 수정해주세요.

2) docker compose 실행

docker-compose.yml 파일이 저장된 경로에서 다음 명령어를 실행하면 ZooKeeper 앙상블과 캐시 서버가 백그라운드로 실행됩니다.

docker compose up -d

3) ARCUS 동작 확인

Zookeeper 서버에 srvr 명령을 보내 서버가 정상적으로 실행되었는지 확인할 수 있습니다.

echo srvr | nc localhost 2181
echo srvr | nc localhost 2182
echo srvr | nc localhost 2183

마찬가지로 ARCUS 캐시 서버에 stats 명령을 보내 정상 실행 여부를 확인할 수 있습니다.

echo stats | nc localhost 11211
echo stats | nc localhost 11212
echo stats | nc localhost 11213

이외에도 telnet 명령어를 통해 ARCUS 캐시와 연결해 다양한 ARCUS 캐시 명령어를 사용해 볼 수 있습니다.

마치며

지금까지 Docker를 사용한 ARCUS 캐시 서버 및 캐시 클러스터 구동 방법에 대해 간단히 알아보았습니다. 컨테이너 기술은 쿠버네티스나 서버리스 기반 서비스를 개발하는데 근간이 되는 기술로, 수많은 서비스들을 빠른 시간내에 배포하는 작업이 빈번히 요구되는 운영 환경 조건들을 충족해줄 수 있어 업계에서 굉장히 많이 사용되고 있습니다.

이번 포스팅에서 다룬 Docker와 Docker Compose는 한 개의 호스트에서만 ARCUS 캐시 클러스터를 구동할 수 있기 때문에 실서비스에 적용하기 어렵다는 단점이 존재합니다. 다음 포스팅에서는 여러 호스트에서 컨테이너들을 효과적으로 관리할 수 있는 쿠버네티스를 이용해 ARCUS 캐시 클러스터를 구동하는 방법을 소개하겠습니다.

Docker 컨테이너 환경에서 ARCUS 캐시 사용하기 was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Arcus 캐시에서 MaxBkeyRange 이용하여 최근 내역 자동 관리 방안

moonseop kim — Fri, 25 Feb 2022 08:03:18 GMT

SNS 혹은 쇼핑몰에서 사용자들에게 편리한 서비스 제공을 위해 사용자의 최근 내역(활동 내역, 조회한 상품 내역, 장바구니)들을 저장하여 제공하고 있습니다. 저장된 데이터는 영구적으로 저장하지 않고 최근 N일 내의 데이터만을 유지하며 사용자에게 제공하는 것이 일반적입니다.

대부분의 최근 내역은 DB에 저장하여 관리 합니다. 최근 내역의 관리를 위해서 주기적으로 스케줄링 작업을 통해 오래된 내역을 삭제해야 합니다. 뿐만 아니라 최근 내역은 사용자가 빈번하게 요청하는 데이터이기 때문에 반복적으로 DB 조회가 요청될 수 있고, 데이터 제공 시에 상품 등과 join 질의가 수행된다면 DB에 더 큰 부담이 됩니다. 이러한 스케줄링 작업 및 빈번한 조회 요청으로 부터 DB 부하를 줄이기 위해 최근 내역 데이터를 캐시에 저장해 제공할 수 있습니다.

하지만, 캐시에서도 최근 내역 데이터를 저장하여 관리하는 것은 복잡한 작업입니다. 예를 들어 TreeMap형태로 사용자의 내역 데이터를 저장한다고 가정하면, 오래된 내역을 제거하기 위해 아래와 같은 작업을 주기적으로 수행해야 합니다.

현재일자와 비교하여 제거할 내역의 시간 범위 계산
계산된 범위에 속한 오래된 내역 일괄 제거

주기적으로 DB에서 사용자의 전체 목록을 조회하여 캐시에 저장된 사용자들의 오래된 내역 데이터를 삭제하는 것은 DB뿐만 아니라 캐시 성능에도 영향을 미치어 캐시 사용 효율이 떨어지게 됩니다.

만약 이런 최근 내역 데이터를 캐시에서 설정한 일자로부터 경과한 데이터에 대해 스케줄 작업없이 자동으로 삭제해 준다면, 시스템 전체 성능 향상 뿐만아니라 응용 개발자의 입장에서 얼마나 편리할까요?

지금부터 이 모든 기능이 수행 가능한 Arcus의 기능을 소개하도록 하겠습니다.

Arcus에서 지원하는 아이템 유형은 key-value 유형과 list, set, map, b+tree 형태의 collection 유형이 있으며, 본 글에서는 b+tree 유형의 속성 정보 중 하나인 maxbkeyrange에 대해서 설명하고 사용 예시를 통해 최근 내역 데이터를 관리하는 방법을 소개하도록 하겠습니다.

B+tree

b+tree는 Arcus에서 지원하는 Collection 유형 중 하나로, leaf 노드에 구조의 elements를 정렬하여 저장하는 자료구조를 가지며, 아래 그림과 같습니다.

MaxBkeyRange

maxbkeyrange는 b+tree 유형에만 제공되는 b+tree only 속성 정보이며, Max(최대) Bkey(bkey) Range(범위) 말 그대로 bkey들의 최대 범위를 지정하는 속성 정보입니다.

더욱 자세하게 설명하면, maxbkeyrange는 b+tree내에 저장할 수 있는 제일 작은 bkey(smallest bkey)와 제일 큰 (largest bkey)의 사이의 최대 범위를 나타냅니다. b+tree에 maxbkeyrange를 설정하고 새로운 element를 추가할 때 maxbkeyrange 범위를 벗어나면 b+tree의 overflowaction 정책에 의해 기존 element가 제거되거나(smallest_trim) 새로운 element가 추가되지 않게(largest_trim) 됩니다.

아래 그림은 maxbkeyrange가 10인 b+tree에 bkey가 11인 새로운 element가 삽입된다고 했을 때의 모습입니다. maxbkeyrange 조건을 위배하지 않았으므로, 새로운 element가 정상적으로 삽입됩니다.

다음으로 bkey가 12인 element를 삽입합니다. 맨 앞 element의 bkey값은 1이고 추가되는 element의 bkey값은 12로 둘의 차이는 11이 되며 maxbkeyrange의 범위를 초과하게 됩니다. 이 경우, maxbkeyrange 설정 준수를 위해 현재 b+tree의 overflowaction 정책을 수행하게 됩니다.

현재 b+tree의 overflowaction 정책이 “smallest_trim”(최소 bkey를 가진 element 삭제)이라고 한다면, bkey가 1인 맨 앞의 element는 삭제되고 새로 추가되는 element가 b+tree에 삽입되게 됩니다. overflowaction 정책이 수행되고 새로운 element 삽입에 관한 결과는 아래의 그림과 같습니다.

코드로 살펴보기

다음은 위의 예시를 arcus-java-client 이용하여 응용 코드로 살펴 보겠습니다. maxbkeyrange를 10으로 overflowaction은 smallest_trim으로 설정한 뒤 15개의 elements를 삽입 했을 때 수행 결과를 살펴 볼 수 있습니다.

https://medium.com/media/ac3dc0971e443224833f29aed4e6b450/href

실행 결과, 맨 앞 element의 bkey와 맨 뒤 element의 bkey의 차이가 10이하(bkey 1 ~ bkey 11) 인 경우에는 element의 삭제가 일어나지 않고, 모든 element의 삽입이 이루어졌으며, 10을 초과시(bkey 1 ~ bkey 12) overflowaction 정책에 따라 맨 앞 element가 삭제되고 새로운 element가 삽입됨을 볼 수 있습니다.

b+tree 생성결과 :CREATED
1번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 1
2번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 2
3번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 3
4번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 4
5번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 5
6번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 6
7번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 7
8번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 8
9번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 9
10번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 10
11번째 element 삽입 결과 :STORED
맨 앞 element bkey : 1, 맨 뒤 element bkey : 11
12번째 element 삽입 결과 :STORED
맨 앞 element bkey : 2, 맨 뒤 element bkey : 12
13번째 element 삽입 결과 :STORED
맨 앞 element bkey : 3, 맨 뒤 element bkey : 13
14번째 element 삽입 결과 :STORED
맨 앞 element bkey : 4, 맨 뒤 element bkey : 14
15번째 element 삽입 결과 :STORED
맨 앞 element bkey : 5, 맨 뒤 element bkey : 15

attribute 정보
type=btree
expiretime=10
count=11
overflowaction=smallest_trim
maxbkeyrange=10
minbkey=5
maxbkey=15

지금까지 가장 작은 bkey와 가장 큰 bkey의 범위를 설정하고 그 범위가 초과하면, b+tree의 smallest_trim overflowaction 정책에 따라 동작이 수행되는 maxbkeyrange 속성에 대해 알아보고 해당 특성을 코드로 살펴보았습니다.

MaxBkeyRange 활용

다음으로는 maxbkeyrange를 활용하여 응용에서 적용해 볼 수 있는 예제에 대해서 알아보도록 하겠습니다.

이번 예제에서는 어떤 응용이 b+tree에 데이터를 캐싱하고 최근 5일 치의 데이터만을 유지한다고 가정합니다. 초 단위의 시간 값을 bkey로 사용한다면 maxbkeyrange는 5일 치에 해당하는 값인 432000(5 * 24 * 60 * 60)으로 지정합니다. 그리고 최근 값만을 유지하기 위해 overflowaction 정책을 “smallest_trim”으로 설정합니다. 따라서, 새로운 아이템이 추가될 시에 가장 오래된 아이템과 새로 추가된 아이템이 5일 차이가 난다면 가장 오래된 아이템을 캐시에서 자동으로 삭제하게 됩니다.

실제 응용 코드 구현

맨 처음 말씀드렸던 SNS 혹은 쇼핑몰에서 나의 최근 내역(활동 내역, 조회한 상품 내역, 장바구니)를 시간순으로 캐싱하는 요구사항이 있습니다. 이 데이터는 영구적으로 캐싱 되지 않고 최근 데이터를 기준으로 5일만 캐싱 된다고 합니다. 해당 요구 사항을 위 예제 설명을 이용해 코드로 구현해 보겠습니다.

코드 구현은 java를 이용하여 구현하도록 하겠습니다. 먼저 해당 요구사항에 맞게 코드를 구현할 MyService 클래스를 생성해 줍니다.

https://medium.com/media/5e8f4a0bdc4c5714e41d3df8c7160e22/href

MyService를 수행할 Test class를 생성하거나, Spring의 경우에는 Controller Layer를 통해 각 기능(생성 / 삽입 / 조회)별로 API 요청을 받아 해당 예제를 수행할 수 있습니다.

https://medium.com/media/27888a3d230be4935c8e4a2ecdf2e445/href

코드를 실행한 결과는 아래와 같습니다. 최근 내역부터 데이터가 출력됩니다.

1일차 데이터
testValue1
2일차 데이터
testValue2
testValue1
3일차 데이터
testValue3
testValue2
testValue1
4일차 데이터
testValue4
testValue3
testValue2
testValue1
5일차 데이터
testValue5
testValue4
testValue3
testValue2
testValue1
6일차 데이터
testValue6
testValue5
testValue4
testValue3
testValue2
testValue1
7일차 데이터
testValue7
testValue6
testValue5
testValue4
testValue3
testValue2
8일차 데이터
testValue8
testValue7
testValue6
testValue5
testValue4
testValue3
9일차 데이터
testValue9
testValue8
testValue7
testValue6
testValue5
testValue4
10일차 데이터
testValue10
testValue9
testValue8
testValue7
testValue6
testValue5

결과를 살펴보면, 가장 최근 데이터를 기준으로 5일(예제 코드에서는 5초) 내 데이터를 조회할 수 있습니다.

사용자가 직접 스케줄링 작업을 통해 유저들의 최근 데이터를 삭제를 수행하지 않고, maxbkeyrange 설정을 통해 캐시에서 자동으로 편리하게 최근 데이터를 유지할 수 있습니다.

마무리

지금까지 Arcus에서 제공하는 b+tree 유형에 대한 maxbkeyrange 속성을 알아보고 응용 사례까지 코드로 구현해 보았습니다. DB의 부담으로 최근 내역 데이터를 캐시에 저장하여 사용할 때, 최근 내역 데이터를 주기적으로 관리하기 위해서는 관리 대상인 아이템의 key 목록이 필요합니다. 하지만 maxbkeyrange를 설정하여 b+tree에 최근 내역 데이터를 저장하여 관리한다면, 관리 대상 아이템의 key 목록 없이 자동으로 최근 내역을 유지할 수 있습니다. 이는 key 목록 관리 및 스케줄링 구현이 필요한 복잡한 응용 개발의 부담을 줄일 수 있습니다. 그러므로 현재 최근 내역을 사용하거나, 사용 예정이거나 혹은 시스템 부담으로 인해 서비스를 제공하기 부담스러웠던 개발자분들에게 maxbkeyrange를 이용한 최근 내역 기능을 구현해 보실 것을 추천드립니다.

오늘 설명한 maxbkeyrange는 Arcus에서 제공하는 b+tree의 일부 기능입니다. b+tree 외에도 list, set, map과 같은 collection 유형이 존재하며 이를 다양하게 이용할 수 있습니다. 간단하게 key-value 형태의 저장 / 조회 기능만 이용하여 복잡한 응용 코드를 생산하기보다 Arcus에서 제공하는 고급진 기능들을 통해 간결하게 작성할 수 있습니다.

앞으로도 블로그를 통해 Arcus에서 제공하는 편리한 기능들을 소개하는 시간을 갖도록 하겠습니다. arcus-java-client의 추가적인 기능에 대해 궁금하신 분은 arcus-java-client-doc에서 더 많은 기능들을 살펴보실 수 있습니다. 감사합니다.

Arcus 캐시에서 MaxBkeyRange 이용하여 최근 내역 자동 관리 방안 was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Sustainable Caching Method in ARCUS and How To Apply It

N.M.G. — Mon, 01 Nov 2021 06:17:59 GMT

In large-scale applications for general users, when retrieving data (especially hot data), a high volume of requests towards the DB will result in a high load. Therefore a general method to reduce the load on the DB side and to provide a fast response is storing the frequently retrieved data in the distributed cache of an application.

When it comes to applying cache to the application, Demand-fill caching pattern is the most commonly used method. When an application requests data, first it will be checked in the cache-store, if data exists in the cache, retrieve data from the cache, otherwise, data will be retrieved from the database, stored into the cache, and after that, it will be returned. Hence a fast response can only be provided if data exists in the cache. Please check the ARCUS Common Cache Module Use with Basic Pattern Caching in Java Environment for more details on the demand-fill method.

However, the problem with this method is that when data in the database has been modified, this update won’t be reflected on the data in the cache. Therefore to reduce the data mismatch between DB and cache, Expire Time or Time-To-Live(TTL) is set. Because of this characteristic following issues may occur.

The difference in response latency to the request, in the cases of caching and expired caching.
DB load with a high volume of requests, right after the cache data expire until the data is cached again.

These are the problems that occur when the cached data has expired and all requests go to the database, especially the second issue is referred to as cache stampede.

Cache Stampede

remember lion king?

A stampede is a situation in which a group of large animals suddenly start running in the same direction in a sudden panicked rush. The same name and concept apply to a cache stampede problem when a popular cache item expires and is led by multiple requests for the item seeing a cache miss, re-requesting the same item from the database which causes high load and high response latency both at the same time. This can cause the following problems.

Delay of application response time
Duplicated writes: when retrieving the expired cache item from DB and caching the same data again and again

Cache Stampede Mitigation

Multiple approaches have been proposed to mitigate the cache stampede problem, one of the well-known approaches is “Optimal Probabilistic Cache Stampede Prevention” which has been published at the International Conference on Very Large Data Bases(VLDB). Here are some of the introduced/proposed methods from that paper.

External re-computation
○ Periodically regenerates cache items in the background process to prevent cache misses.
○ Cons.: burden of maintenance and periodically regenerated item list (key list) is required
Locking
○ Upon a cache miss, grant a lock to the first request to prevent duplicated DB retrievals and renew a cache.
○ Cons.: only duplicated retrievals and duplicated write issues caused by cache miss can be solved this way, but other response latency issues continue when the cache expires.
Probabilistic Early Expiration (PER)
○ Renew the cache with the calculated probability before the cache data expires using a stochastic algorithm.

Among the aforementioned methods, the main approach presented in detail in the paper is PER. To simply describe PER, it would be easy to explain it with the following images.

State of Basic Caching

The blue section is where the caching is applied, and the white part is where the cache got expired. Since a cache miss occurs only in the white part, it becomes a section where a lot of retrieval requests burden the database.

State of Caching with PER

This is a state in which caching is maintained without expiration by reaching before the cache item expires. Compare to the above-mentioned basic caching the recaching occurs before expiration time thus preventing cache item expiration.

Every time when a thread that processes a request retrieves the cache data, PER algorithm performs recaching with the random probability compared to the remaining expiration time, and as the expiration time is approaching closer, the probability of recaching also increases. More about the probabilistic algorithm will be discussed later.

Now, let’s find out through a simple test how the algorithm actually works.

Cache Stampede Mitigation: Test

Let's compare the performance outcomes of a cache expiration time for both basic caching and caching with PER by establishing the following test environments.

Test Conditions

Jmeter Settings

Measurement Factors
○ number of times cache miss occurred
○ response latency due to cache stampede
○ number of duplicated writes due to cache stampede

The results of the test of basic caching that doesn’t have any applied solution and caching with PER solution are as follows.

Caching (cache miss: 4-times, response latency: 500ms, duplicated writes: 242-times)

PER (cache miss: 0-times, response latency: 100~200ms, duplicated writes: 43-times)

Looking at the results, in the basic caching cache miss occurred due to the expiration, following many requests run to DB at the same time, thus resulting in cache stampede. Therefore there was a delay with responses at the DB side. Furthermore, due to the cache miss, all requests made for the cache item caused duplicated DB retrievals and cache writings.

On the other side, because PER algorithm is applied, recaching has been performed in advance and there was no cache miss in all parts that were scheduled to expire. Only recaching performers sent requests to DB and the response time and the number of duplicated writes were significantly reduced. Consequently, this confirms that PER is effective in preventing cache stampedes.

Practical Application Environment Test

In practical real applications, most DB retrieval times take from several to tens of milliseconds(ms). In this test, we have set the expiration time of cache items from dozens of seconds ~ to several minutes. Now let’s test how does PER actually works in real-world applications.

Test conditions

Jmeter Settings

Measurement Factors
○ number of times cache miss occurred
○ response latency due to cache stampede
○ number of duplicated writes due to cache stampede

The results of the test, composed for practical applications are as follows.

100tps (cache miss: 5-times, response latency: 0~100ms, duplicated writes: 95-times)

10tps (cache miss: 10-times, response latency: 100ms, duplicated writes: 20-times)

Obviously, when testing on the actual application environment, the results differ from the initial one.

Compared to the initial test, unlike before in 100tps where only DB retrieval time has changed, some of them are recached, some had cache misses, and even resulting in cache stampede(response latency and duplicate DB retrievals and cache write) repeatedly. And in the 10tps there was no recaching in all expiration periods, causing a cache miss. Now the question is why did we get different results?

Problems in Practical Application Environment Test

ARCUS Cache measures expiration time in seconds and does not expire precisely at the actual time.
○ A restriction on the application of the PER algorithm, which determines whether to recache in actual ms units.

In ARCUS, system calculations for item expiration are performed in seconds. I will use the above image to elaborate on my explanations. If the expiration time of different cache items is set to 3 seconds at different points between 0 and 1 second, then actual expiration will be performed simultaneously. For example, as shown in the picture, if you set expire time to 3 seconds at the point of 0th second, then it will expire at 3rd seconds, if you set expire time to 3 seconds at the point of 0.5th seconds, then it will also expire at 3rd seconds, not 3.5th seconds. The reason is the cache item expires with the ARCUS’s internal timer that works in seconds.

Even if the application measures the expiration time in detail with ms unit, because ARCUS processes expiration time in seconds, between application and ARCUS there is a gap of milliseconds, which affects the gap when determining whether or not to recache the item. Now that difference has been clarified, we can say that the reason why the repetitive cache misses occurred in the above test of 100tps is due to this problem.

2. Requests with low tps have a low re-caching probability.

Cache data with about 100 requests per second in practical applications corresponds to hot data. However, most of the data cached in the application will not/cannot be the hot data. If it’s a rare request several times per second or less, as shown on the right image above, there was no request for determining whether or not to recache, thus resulting in cache miss without recaching. The results can be checked in the graph of 10tps of the above test.

3. Shorter the DB retrieval time(computation time), the smaller the recaching determination interval, the lower the recaching probability.

Therefore, a clear understanding of the algorithm that determines to recache in PER is required. I will elaborate my explanation with the below-shown image.

First of all, the algorithm that determines recaching is the same as the above equation. Actual DB retrieval time(computation time) is multiplied by a probabilistic random value(-log(random()) and determines whether to recache or not, in comparison to the remaining expiration time(TTL).

For example, in the case of DB retrieval time of 500 ms, the probability of recaching is 10% when 1.0 ~ 0.9 seconds are left to the expiration time. As shown in the graph above, when the random value is 0.1,
500ms * 2.3(-log(0.1)) = 1150ms is calculated, and since it is larger than the actual TTL (900 ~1000 ms), it will be recached. Using the rest of the remaining random values(table 0.2 ~ 0.9) since the calculation result is smaller than the remaining expiration time, it won’t be recached. In the blue array(right) you can see the distribution range of random values that enable recaching at each time and check the recaching probability accordingly.

In practical real applications, since most DB retrieval times take from several to tens of milliseconds, if the corresponding equation is applied, the recaching determination interval is significantly reduced, thus it is less likely to recache even if there are many received requests. Of course, using a constant called beta it is possible to increase the range, however, it is very inconvenient for a user to apply constants differently considering the DB retrieval time for each API to which the algorithm is applied.

Summary of Results and Requirements

Now that we have understood the concept, let's define the constraints for PER algorithm in order to apply it in a real working environment.

Cache expiration at the exact time.
High tps requests (at minimum higher than 50tps).
Longer the DB retrieval time, the more effective it is.

Due to the above constraints, there are many problems when it comes to applying PER to ARCUS Common Module that actually operates in the real-world application environment. Therefore, it is necessary to define the requirements of JaM2in to apply to the ARCUS Common Module and attain a suitable algorithm.

Recaching is possible even if there is a cache expiration error.
Recaching is possible even at the low tps.
Recaching is possible even for cache data with a short DB retrieval time (recache regardless of DB retrieval time)
Fast response speed within several to tens of milliseconds in all sections, DB retrieval and write requests without duplication.
○ After determining recaching on the request thread, it requests recache on the background thread.
○ In the case of PER, recaching is requested when the remaining expiration time is very short. Hence, if it is processed in the background thread, recaching may not be completed before expiration.

Through these requirements, from the perspective of implementation, we designed and applied the new algorithm that can prevent cache stampede.

Sustainable Caching

Sustainable recaching(SUS) is a method designed by applying the PER algorithm. This method uses 10% of the cache expiration time as computation time instead of the actual DB retrieval time. Hence, the determination interval of recaching gets longer and allows recaching to take place even at low requests.

Instead of simply expanding the range using a constant(beta), it uses 1/10 of the expiration time by specifying it as the DB retrieval time(computation time). Therefore, this is the form of using cached items as much as 90% of their expiration time when their TTL remains 10% and performing recaching through a calculated probability based on that remaining 10%.

For example, if the expiration setting time is more than 10 seconds, the minimum time corresponding to 1/10 is 1 second, and since the recaching takes place within the range, recaching will be done even if a request is received with a small request amount corresponding to 1tps.

In addition, in order to reduce response delays, we did not directly perform caching logic when requesting recache, but rather asked to perform recaching operations in the background thread with sufficient time, and added redundant retrieval and write prevention.

A short summary of the types and properties of recaching is as follows.

PER
○ Prevents cache misses before the cache expires by recaching.
○ Uses DB retrieval time (computation time) for recaching.
○ Recaching proceeds immediately from the request processing thread as it approaches the expiration time.
SUS
○ PER and reaching determination algorithm logic are the same.
○ Recache using the expire time ratio when DB retrieval time (computation time) is less than 10% of expiration time.
○ Recaching proceeds in the background thread to prevent duplicated DB retrievals and cache writes.
Properties

The comparison of the corresponding algorithms through the example is as follows (DB retrieval time: 100ms, expire time: 30s)

Recaching range of the example is shown in the graph as follows.

SUS Algorithm in Application Environment: Test

In order to compare SUS algorithm performance, the test was conducted under the same environment and conditions as PER.

100tps (cache miss: 0-times, response latency: NONE, duplicated writes: 1-times)

10tps (cache miss: 0-times, response latency: NONE, duplicated writes: 1-times)

As it is clear from the graph, even when the algorithm is applied according to the actual requirements, it has a great effect on the results.

Since cached data is constantly retrieved without delay in response, it can have a similar response effect to receiving a response by caching the items as a sticky item. In addition, when recache is requested, by preventing duplicated retrievals and writings, we were able to reduce the load of DB as well.

Conclusion

In conclusion, we have checked the possible problems that may arise from the expiration time setting of the cache item. Among them, I have introduced you to the problem called a cache stampede that occurs when a lot of requests are received at the same time and how to solve it. We have implemented tests in order to apply to applications and designed a sustainable caching method that is applicable to ARCUS Common Module. SUS algorithm can give a fast response to the users not only for a large but also for a small number of requests regardless of the DB’s retrieval time. The future plan is to further enhance the SUS algorithm, with the additional features listed below.

Simultaneous request test by applying it to multiple APIs.
In order to perform reaching in the background, derivate the number of threads that are required, and the right size of the queue according to the number of application requests.
Comparison and quantification of the difference in performance(WAS throughput, DB load level) from the previous one when the Sustainable caching method was introduced.

Improvement of the algorithm is also needed.

When the expiration time is less than 1 second (in case of 1 second, DB retrieval time is set to 100ms), improve the algorithm that becomes the same as PER

As we have already mentioned Sustainable caching is an algorithm that reflects the requirements to solve the cache stampede problem, and there is still some room for improvements to be made. In the future, even after SUS is applied to the application, through many tests we will improve and modify the algorithm. Stay with us for more updates.

Reference:

▪☞ Optimal Probabilistic Cache Stampede Prevention

Sustainable Caching Method in ARCUS and How To Apply It was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

ARCUS에서 지속 가능한 캐싱 적용 방안

moonseop kim — Thu, 23 Sep 2021 08:36:22 GMT

일반 사용자를 대상으로 하는 대규모 응용에서는 데이터(특히, hot data) 조회 시에 DB로 많은 요청이 몰려 높은 부하가 발생합니다. 이러한 DB 단의 부하를 줄이기 위해, 자주 조회되는 데이터를 응용의 분산 캐시에 저장하여 신속하게 응답을 제공하도록 구현하는 것이 일반적인 방법입니다.

캐시를 응용에 적용할 때는 Demand-fill 방식을 많이 활용합니다. 데이터 조회 요청이 들어올 때 캐시에 데이터가 있다면 캐시의 데이터를 응답해주고, 없다면 DB 조회한 후 캐시에 데이터를 저장하고 응답해주게 됩니다. 따라서 캐시 데이터가 존재하는 동안은 빠른 응답을 제공해 줄 수 있습니다.

이 방식은 DB의 데이터가 수정될 때 캐시의 데이터와 불일치한다는 문제점이 있습니다. 그래서 DB와 캐시의 데이터 불일치 현상을 줄이기 위해 Expire time or Time-To-Live(TTL)를 설정합니다. (Demand-fill 방식 및 캐시를 응용에 적용하는 자세한 방법은 ARCUS 공통 모듈을 참고해주세요) 이러한 특징으로 인해 아래와 같은 문제가 생길 수 있습니다.

캐싱된 경우와 캐싱 만료된 경우에 요청에 대한 응답속도 차이 존재
캐시 데이터 만료 직후부터 데이터가 다시 캐싱될 때까지 수많은 조회 요청이 DB로 몰리는 현상

해당 문제는 캐시가 만료되어 DB 조회를 하면서 발생하는 문제이며 특히 2번의 문제를 cache stampede라고 지칭합니다.

Cache stampede

stampede란 많은 동물이 갑자기 빠르게 같은 방향으로 돌진하는 현상을 말합니다. cache stampede는 캐시 만료로 인해 많은 데이터 조회 요청이 DB로 갑자기 몰려 DB 부하 증가 및 그로 인한 조회 속도가 저하되는 현상으로 아래와 같은 문제를 일으킬 수 있습니다.

응용의 응답시간 지연
만료된 캐시 데이터를 다시 DB에서 조회하여 캐싱하는 작업이 중복하여 발생 즉, 캐시에 중복 쓰기(duplicate write) 발생

Cache stampede 방지대책

Cache stampede 문제를 해결하기 위한 논문이 VLDB라는 국제 학술대회에서 발표되었고, 그 논문에서 소개된 몇 가지 해결법은 아래와 같습니다.

External re-computation
○ 주기적으로 백그라운드 프로세스에서 캐시 아이템을 재생성하여 cache miss를 방지
○ 단점 : 주기적으로 재생성하는 아이템 목록(key 목록)이 필요
Locking
○ 중복으로 DB 조회 및 캐시 쓰기가 일어나지 않도록 cache miss가 발생한 처음의 요청에 Lock을 부여해 캐시를 갱신
○ 단점 : Cache miss 발생에 따른 중복 조회 및 쓰기의 문제만 해결 가능, 캐시 만료 시 응답지연 문제는 지속
Probabilistic Early Expiration(PER)
○ 확률적 알고리즘을 이용해 캐시 데이터가 만료되기 전 계산된 확률로 미리 캐시를 갱신

이 중 논문에서 자세하게 제시된 해결법은 PER입니다. PER을 그림으로 설명하면 아래와 같습니다.

기존의 캐싱 형태

파란 구간은 캐싱이 적용되어있는 구간이고, 하얀 구간이 캐시가 만료(expired)된 구간입니다. 하얀 구간에서 cache miss가 발생하므로 DB로 조회 요청이 몰리는 구간이 됩니다.

PER을 적용한 경우의 형태

캐시 아이템이 만료되기 전 재캐싱을 하여 만료 없이 캐싱이 계속 유지되는 형태입니다. 위의 그림(기본 캐싱)과 비교하여 보면 만료보다 앞서서 재캐싱이 일어나 만료가 발생하지 않는 것을 볼 수 있습니다.

PER은 요청을 처리하는 스레드가 캐시 데이터를 조회할 때마다 남은 만료 시간 대비 임의(random) 확률로 재캐싱을 수행하는 방안으로, 만료시간이 다가올수록 재캐싱 확률이 증가합니다. (확률을 이용한 알고리즘의 자세한 설명은 뒤에서 계속됩니다.)

그렇다면 실질적으로 해당 알고리즘이 어떠한 효과를 얻을 수 있는지 간단한 테스트를 통해 알아보겠습니다.

Cache stampede 해결방안 테스트

기본 캐싱과 PER 적용 경우에 캐시 만료 시의 효과를 비교해 보도록 하겠습니다. 테스트를 위한 환경은 아래와 같습니다.

테스트 조건

Jmeter 설정

측정 요소
○ cache miss 발생 횟수
○ cache stampede로 인한 응답 지연 속도
○ cache stampede로 인한 중복 쓰기 횟수

해당 기준으로 아무것도 적용하지 않는 기본 캐싱과 cache stampede 방지를 위해 PER을 적용하여 테스트한 결과는 아래와 같습니다.

캐싱 (cache miss : 4회, 응답지연 : 500ms, 중복쓰기 : 242회)

PER (cache miss : 0회, 응답지연 : 100~200ms, 중복쓰기 : 43회)

결과를 살펴보면 기본 캐싱에서는 캐시가 만료되어 cache miss가 발생했고, 많은 요청이 한꺼번에 DB에 몰려 cache stampede 현상이 발생했습니다. 그래서 DB단에서 응답지연 현상이 발생했습니다. 그뿐만 아니라 cache miss로 인하여 모든 요청이 아이템을 캐싱하기 위해 중복으로 DB 조회 및 캐시 쓰기를 하는 현상이 발생하였습니다.

반면에 PER을 적용했을 경우는 미리 재캐싱이 이루어졌기 때문에 만료가 예정된 모든 구간에서 cache miss가 발생하지 않았고, 재캐싱 수행자들만 DB를 조회하여 응답 시간과 중복쓰기 횟수가 현저히 줄어들었음을 알 수 있습니다. 이를 통해 PER이 cache stampede 방지에 효과가 있음을 확인할 수 있었습니다.

실제 응용 환경 적용을 위한 테스트

실제 응용에서는 DB 조회 시간이 대부분 수~ 수십ms 정도이며, 캐시 아이템의 만료시간도 짧게는 수십초 ~수분까지 설정하여 캐시를 사용하고 있습니다. PER이 실제 응용 환경에 적용이 가능한지 알아보기 위해 추가로 테스트를 진행해 보았습니다.

테스트 조건

Jmeter 설정

측정 요소
○ cache miss 발생 횟수
○ cache stampede로 인한 응답 지연 속도
○ cache stampede로 인한 중복 쓰기 횟수

운영 환경에 맞춘 테스트의 결과는 아래와 같습니다.

100tps (cache miss : 5회, 응답지연 : 0 ~ 100ms, 중복쓰기 : 95회)

10tps (cache miss : 10회, 응답지연 : 100ms, 중복쓰기 : 20회)

실제 응용 환경에 맞추어 테스트 했을 경우 처음 테스트 결과와는 다른 결과가 나오게 됩니다.

처음 테스트와 비교하여 DB조회 시간만 달라진 100tps에서는 이전과 다르게 일부는 재캐싱, 일부는 cache miss 및 그로 인한 cache stampede 현상(응답지연과 중복 DB 조회 및 캐시 쓰기)이 반복적으로 발생하였습니다. 그리고 10tps에서는 모든 만료 구간에서 재캐싱이 일어나지 않고 cache miss가 발생했습니다. 왜 이런 결과가 발생한 것 일까요? 그 이유는 아래와 같습니다.

응용 환경 적용 테스트를 통한 문제점

ARCUS cache는 초 단위의 만료시간 측정으로 정밀하게 실제 시간에 만료되지 않음
○ 실제 ms 단위로 재캐싱 여부를 결정하는 PER 알고리즘의 적용에 제한

ARCUS 내부에서는 아이템 만료를 위한 시스템 계산이 초 단위 형태로 수행됩니다. 위의 그림을 통해 설명하겠습니다. 서로 다른 캐시 아이템을 0~1초 사이의 다른 지점에서 만료시간(expire time)을 3초로 설정하였을 경우 실제 만료는 동시에 수행됩니다. 예를 들어 위 그림처럼 0초 지점에서 3초를 설정했다면 3초에 만료가 되며, 0.5초 지점에서 3초를 설정하였다면 3.5초의 지점이 아닌 3초 지점에서 만료가 됩니다.그 이유는 ARCUS 내부에서는 초 단위의 타이머(timer)를 가지고 캐시 아이템의 만료 처리를 하기 때문입니다.

응용에서 만료시간을 ms 단위까지 자세하게 측정하더라도 ARCUS에서는 초 단위로 만료를 처리하기 때문에 응용과 ARCUS 사이에 ms 만큼오차가 발생하며, 이는 재캐싱 여부를 판별할 때에도 오차로 인한 영향을 미치게 됩니다. 위 100tps 테스트에서 번갈아서 반복 형태로 cache miss가 일어난 이유의 원인이 해당 문제로 인해 발생한 것입니다.

2. tps가 낮은 요청은 재캐싱 확률이 낮음

실제 응용에서 초당 100회 정도의 요청이 있는 캐시 데이터는 hot data에 해당됩니다. 하지만 응용에서 캐싱하는 데이터는 대부분 hot data가 아닐것 입니다. 초당 수회 또는 그 이하의 드문 요청이면, 위의 오른쪽 그림과 같이 재캐싱 여부를 판별하는 구간에 요청이 들어오지 않아서 재캐싱은 이루어지지 않고 cache miss가 발생합니다. 그 결과는 위 테스트 10tps의 그래프를 통해 보실 수 있습니다.

3. DB 조회 시간(computation time)이 짧을 수록 재캐싱 판별구간이 줄어들어 재캐싱 확률이 감소

해당 이유에 대해서는 PER에서 재캐싱을 판별하는 알고리즘에 대한 지식이 필요합니다. 쉽게 그림으로 설명해 보겠습니다.

일단 재캐싱을 판별하는 알고리즘은 위의 수식과 같습니다. 실제 DB 조회 시간(computation time)에 확률적인 랜덤 값(-log(random())을 곱하여 잔여 만료 시간(TTL)과 비교해 재캐싱 여부를 결정합니다.

예를 들어 500ms의 DB 조회 시간의 경우 만료시간이 1.0 ~ 0.9초가 남은 시점에 재캐싱이 이루어질 확률은 1/10입니다. 위 그래프에서 보시는 것처럼 random 값이 0.1일때 500ms * 2.3(-log(0.1)) = 1150ms가 계산되어 실제 TTL(900 ~ 1000ms) 보다 크므로 재캐싱이 이루어지고 나머지의 랜덤 값(표 0.2 ~ 0.9)을 사용하여 계산한 결과는 잔여 만료 시간보다 작기 때문에 재캐싱이 이루어지지 않습니다. (노란색 그래프 안에 각 시간 별 random 값의 분포를 보실 수 있으며 그래프를 통해서 시간별 재캐싱 확률을 계산해 보실 수 있습니다.)

실제 응용에서는 DB 조회 시간이 수~ 수십ms 정도이기 때문에 해당 수식을 적용한다면 재캐싱 판별 구간이 상당히 줄어 많은 요청이 들어오더라도 재캐싱 요청을 할 가능성이 낮아집니다. 물론 범위를 늘려주기 위해 beta라는 상수를 이용해 설정할 수 있지만, 알고리즘을 적용한 API마다 DB조회 시간을 고려하여 상수를 다르게 적용하는 일은 사용자로서는 대단히 번거롭습니다.

결과 요약 및 요구사항

해당 내용을 정리하여 PER이 실제 환경에서 적용되기 위한 제약 조건을 정의하면 아래와 같습니다.

정확한 만료시간에 캐시 만료
높은 tps의 요청량 (최소 50tps 이상)
DB 조회 시간이 클수록 효과적

해당 조건은 실제 서비스를 하는 ARCUS 공통모듈에 적용하기에는 제약사항이 많습니다. 그래서 공통모듈에 적용하기 위해 필요한 잼투인(JaM2in)만의 요구사항을 정의하고 그에 맞는 방안 도출이 필요했습니다.

캐시 만료 오차 범위 내에서 재캐싱
낮은 tps에서도 재캐싱 가능
DB 조회 시간이 짧은 캐시 데이터에 대해서도 재캐싱 가능 효과 (DB 조회 시간과 무관하게 재캐싱)
지연시간 없이 모든 구간에서 수 ~ 수십ms 이내의 빠른 응답속도, 중복없는 DB조회 및 쓰기 요청
○ 요청 스레드에서 재캐싱 판별 후 백그라운드 스레드에 재캐싱 요청
○ PER의 경우 만료시간이 아주 짧게 남았을 때재캐싱 요청을 하므로 백그라운드 스레드에서 재캐싱 수행시 짧은 만료 시간으로 인해 만료전에 캐싱을 완료하지 못하고 캐시 만료가 일어날 수 있음

위와 같은 요구사항을 통해 조금 더 구현 관점에서 cache stampede를 방지 할 수 있는 알고리즘을 고안하여 적용하게 되었습니다.

지속가능 캐싱 (sustainable caching)

지속가능 재캐싱 방법은 PER 알고리즘을 응용하여 고안된 방법입니다. 해당 방법은 DB 조회 시간 대신에 캐시 만료 시간의 10% 기준으로 PER 알고리즘을 적용하여 재캐싱 여부를 판별하는 방법으로, 낮은 요청에도 재캐싱이 이루어지게 합니다.

단순히 상수(beta)를 이용해 범위를 확장하지 않고, 만료 시간(expire time)의 1/10을 DB 조회 시간(computation time)으로 지정하여 사용합니다. 이것은 실제 응용 서비스에서 캐시 아이템의 TTL이 10% 남았을 때 재캐싱을 요청하는 형태로 캐싱 된 아이템을 만료시간의 90%만큼 사용하고 나머지 10%의 지점에서 계산을 통해 재캐싱을 수행합니다. 또한 DB 조회 시간에 관계없이 만료 시간을 통한 캐싱이 이루어지므로 재캐싱 범위에 영향이 없게 됩니다.

예를들어 만료설정 시간이 10초 이상일 경우 1/10에 해당하는 최소 시간은 1초로 해당 범위에서 재캐싱이 이루어지기 때문에 1tps에 해당하는 작은 요청량으로 요청이 들어와도 재캐싱이 이루어 지게 됩니다.

추가적으로 응답 지연을 제거하기 위해 재캐싱 요청 시 직접 캐싱 로직 수행을 하는 것이 아니라 충분한 시간을 가지고 백그라운드에서 재캐싱 작업이 수행되도록 요청을 하며, 중복 조회 및 쓰기 방지 기능도 추가하였습니다.

간단하게 재캐싱 유형 및 속성을 간단하게 정리하여 소개하면 아래와 같습니다.

PER
○ 캐시 만료가 이루어지기 전 미리 재캐싱으로 miss 방지
○ DB 조회시간(computation time)을 활용한 재캐싱
○ 재캐싱 시 만료시간에 근접하여 재캐싱 진행하므로 해당 스레드에서 바로 재캐싱
SUS
○ PER과 재캐싱 판별 알고리즘 로직 동일
○ DB조회 시간(computation time)이 만료시간(expire time)의 10%미만 일 경우 expire time 비율을 활용해 재캐싱
○ 재캐싱 시 백그라운드 프로세스에서 진행하며 중복 DB 조회 및 캐시 쓰기 방지
속성

해당 알고리즘을 예제를 통해 비교 하면 아래와 같습니다 (DB조회 시간 : 100ms, 만료시간 : 30s)

그래프로 예제의 재캐싱 범위를 확인하면 아래와 같습니다.

응용 환경에서의 SUS 방법 테스트

SUS 적용 성능 비교를 위하여 PER과 동일한 환경과 조건에서 테스트를 진행했습니다.

100tps (cache miss : 0회, 응답지연 : NONE, 중복쓰기 : 1회)

10tps (cache miss : 0회, 응답지연 : NONE, 중복쓰기 : 1회)

결과는 그래프로도 한눈에 볼수 있듯이, 실제 요구사항대로 알고리즘이 적용되어 큰 효과를 볼 수 있었습니다.

응답 지연없이 계속해서 캐싱된 데이터가 조회 되므로 아이템을 sticky하게 캐싱하여 응답받는 것과 비슷한 응답 효과를 낼 수 있습니다. 또한 재캐싱 요청시에도 중복 조회 및 쓰기 방지를 통해 DB의 부하도 경감할 수 있었습니다.

마치며

캐시 아이템의 만료시간 설정으로 인해 발생할 수 있는 문제를 살펴보았습니다. 그 중 많은 요청이 동시에 들어왔을 때 발생하는 cache stampede에 대한 문제와 그 문제를 해결하는 방법에 대해 알아보고, 응용에 적용하기 위한 테스트와 ARCUS 공통 모듈에 적용하는 sustainable caching 방법까지 함께 소개해 드렸습니다. 해당 방법은 많은 요청량 뿐만 아니라 적은 요청량에서도 DB의 조회시간에 관계없이 사용자에게 일정하게 빠른 시간내에 응답을 줄 수 있는 방법입니다. 앞으로 응용에 적용하기 위해 추가적인 준비 사항은 아래와 같습니다.

여러개의 API에 적용하여 동시 요청 테스트
백그라운드에서 재캐싱을 수행하기 위해 응용의 요청 수 별로 필요한 Thread 개수와 Queue의 적정 크기 도출
지속가능 캐싱 방법 도입시 이전과의 성능상 차이점 (WAS 처리량, DB 부하정도) 비교 및 수치화

또한 해당 알고리즘의 개선도 필요합니다.

expire time이 1초 미만일 경우 (1초일 경우 DB조회 시간은 100ms로 설정됨) PER과 동일해 지는 알고리즘 개선

위와 같이 sustainable caching은 cache stampede를 해결하기 위해 요구 사항을 반영한 하나의 방안이며 개선점과 테스트할 사항이 존재합니다. 이후에도 응용에 적용하여 많은 테스트를 통해 수정과 개선이 이루어 질 것입니다. 이후의 포스팅도 기대해 주시기 바랍니다.

참고

ARCUS에서 지속 가능한 캐싱 적용 방안 was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Introducing Front Cache of ARCUS Spring

N.M.G. — Fri, 27 Aug 2021 09:19:50 GMT

Front Cache

Remote cache solutions that perform caching on separate servers such as ARCUS Cache have advantages in sharing data with each other that has been cached on multiple applications. However, on the remote cache, data response time sensitively gets affected depending on the occurrence of a large amount of network traffic or lack of resources due to a large number of temporary requests on the software. These kinds of problems can be solved by scaling a cluster or upgrading the system’s specifications, but in that case, there is a burden of increasing operation costs. On the other hand, there is also a software-wise solution that is relatively less expensive, and that is, caching data in the front of the remote cache using the local memory of the application. This method is called Front Cache, also known as local caching because it is cached in local memory. Caching data to the local memory is much faster than caching it to the remote server over the network. One of the advantages of Front Cache is its being not affected by network traffic, and let’s say if you are managing traffic load data, such as events, announcements, trend news then by applying Front Cache you can guarantee fast response time and high throughput for your application.

In order to be able to do that, we specifically implemented Spring Cache of Spring Framework for ARCUS and added the interface of Front Cache to the latest version of the ARCUS Spring(1.13.3)’s library. In this article, I will introduce you to the Front Cache features of ARCUS Spring, its behaviors, usage method, and precautions to be aware of.

How Front Cache works

The Front Cache provided by ARCUS Spring performs caching before the ARCUS. The organized behavioral structure based on the request types is as follows.

Retrieval of Cache Data

First, in order to retrieve data, application calls get API of Spring Cache interface.
Check if the data exists in the Front Cache. If it is, then we return the requested data.
If data does not exist in the Front Cache, check if the data exists in the ARCUS Cache. If it is, we store the data in the Front Cache and then return the requested data.

Store Cache Data

In order to save data, application calls put API of Spring Cache.
Then request storing data into ARCUS Cache.

3–1. (Option 1) Only when the data storing request to ARCUS is successfully completed, then store the data to the Front Cache.

3–2. (Option 2) Regardless of the result of data storing request to ARCUS, store the data to the Front Cache.

Removal of Cache Data

In order to delete data, application calls evict/clear API of Spring Cache.
Then request data removal from ARCUS Cache.

3–1. (Option-1) Only when the data removal request from ARCUS is successfully completed, remove the data from the Front Cache.

3–2. (Option-2) Regardless of the data removal request from ARCUS, remove the data from the Front Cache.

In the cases of storing and removing data, Option 1 is set as default, but you can change it with forceFrontCaching. If the ARCUS Cache is not available due to a network issue, Option 2 can still maintain the Front Cache function. If your data changes over time then Option 1, if it isn’t then Option 2 is recommended.

How to Use

ArcusCache class is specifically implemented for ARCUS from Spring Cache interface and relies on the abstracted interface below for front caching. It provides flexibility and allows to implement the interface directly to use other remote or local caches as front caching for ARCUS as shown below.

public interface ArcusFrontCache {

  Object get(String key);
  void set(String key, Object value, int expireTime);
  void delete(String key);
  void clear();
}

Although you don’t need to implement the interface by yourself, you can always use the DefaultArcusFrontCache class built into ARCUS Spring. This class caches the data into the application’s local memory and uses the EhCache library internally. Now we will try to apply this class to Spring-based applications to perform Front Caching before ARCUS Cache.

First, we create the DefaultArcusFrontCache class then set the dependencies on the ArcusCache class. For the purpose of the testing, we will set TTL(TimeToLive) of the Front Cache to 10 seconds, 20 seconds shorter than the ARCUS cache.

public class ArcusCacheConfiguration {

  @Bean
  public ArcusCache testCache() {
      ArcusCache arcusCache = new ArcusCache();
      arcusCache.setName("test");
      arcusCache.setServiceId("TEST-");
      arcusCache.setPrefix("TEST");
      arcusCache.setTimeoutMilliSeconds(800);
      arcusCache.setArcusClient(arcusClient());

      // Setting TTL of ARCUS Cache item
      arcusCache.setExpireSeconds(30);

      // Setting Front Cache Instance
      arcusCache.setArcusFrontCache(testArcusFrontCache());

      // Setting TTL of Front Cache item
      arcusCache.setFrontExpireSeconds(10);

     // Even if Store/Removal requests of ARCUS fail 
     // Set to perform Store/Removal requests of Front Cache
      arcusCache.setForceFrontCaching(true);
      return arcusCache;
  }

  @Bean
  public ArcusFrontCache testArcusFrontCache() {
    return new DefaultArcusFrontCache(

      // Cache name, for each instance names must be unique
      "test", /* name */
      
      // Maximum number of cache items to be stored
      // If max is exceeded, existing item will be removed by LRU
      10000, /* maxEntries */

      // when retrieving item instance from the Front Cache
      // Setting for retrieve the reference of item or copy of it
      // Retrieves the item's reference when set to false
      false, /* copyOnRead */

      // when storing item instance into the Front Cache
      // Setting for storing the reference of item or copy of it
      // Stores the item's reference when set to false
      false /* copyOnWrite */
    );
  }

  @Bean
  public ArcusClientPool arcusClient() {
    ArcusClientFactoryBean arcusClientFactoryBean = new ArcusClientFactoryBean();
    arcusClientFactoryBean.setUrl("1.2.3.4:1234");
    arcusClientFactoryBean.setServiceCode("test");
    arcusClientFactoryBean.setPoolSize(8);
    return arcusClientFactoryBean.getObject();
  }
}

Add the ArcusCache instance that you just created to the cache list of CacheManager.

@EnableCaching
@Configuration
public class CacheConfiguration implements CachingConfigurer {

  @Autowired
  private ArcusCache testCache;

  @Bean
  @Override
  public CacheManager cacheManager() {
    SimpleCacheManager arcusCacheManager =
        new SimpleCacheManager();
    arcusCacheManager.setCaches(
      List.of(testCache)
    );
    return arcusCacheManager;
  }

  @Override
  public KeyGenerator keyGenerator() {
    return new StringKeyGenerator();
  }

  @Override
  public CacheResolver cacheResolver() {
    return null;
  }

  @Override
  public CacheErrorHandler errorHandler() {
    return null;
  }
}

We have now completed the Front Cache configuration. Now let's apply @Cacheable Annotation to the service that you want to apply cache and then check if Front Cache has been applied correctly.

@Service
public class ProductService {

  @Autowired
  private ProductRepository productRepository;

  @Cacheable(value = "test", key="#product.id")
  public Product get(Product product) {
    return productRepository.select(productDto.getId());
  }
}

Every time when you send the request to the service that you have added caching feature, log below from the ArcusCache class will be printed.

(1) Initially there is no data in ARCUS Cache and Front Cache, therefore the first thing to do is to store data in both.
(2) Retrieve and return the data from Front Cache. Since we have set TTL of Front Cache for 10 seconds, ARCUS Cache will not look for it until data is expired in the Front Cache.
(3) When the data stored in Front Cache is expired after 10 seconds, lookup for data from Front Cache will be failed, thus the data will be retrieved from ARCUS Cache, and stored in the Front Cache.
(4) Same procedure in step (2) will be performed.

DEBUG 21-07-16 17:42:05 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:05 [ArcusCache:480] - trying to put key: TEST-PRODUCT:1266, value: com.jam2in.arcus.Product ... (1) Stores in ARCUS, Front Cache
DEBUG 21-07-16 17:42:07 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266 
DEBUG 21-07-16 17:42:07 [ArcusCache:454] - front cache hit for TEST-PRODUCT:1266 ... (2) Returns retrieved data from Front Cache
DEBUG 21-07-16 17:42:10 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:10 [ArcusCache:454] - front cache hit for TEST-PRODUCT:1266 ... (2)
DEBUG 21-07-16 17:42:15 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:15 [ArcusCache:454] - front cache hit for TEST-PRODUCT:1266 ... (2)
DEBUG 21-07-16 17:42:16 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:16 [ArcusCache:470] - arcus cache hit for TEST-PRODUCT:1266 ... (3) Returns retrieved data from ARCUS Cache since TTL of Front Cache is expired
DEBUG 21-07-16 17:42:17 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:17 [ArcusCache:454] - front cache hit for TEST-PRODUCT:1266 ... (4) Return retrieved data from Front Cache
DEBUG 21-07-16 17:42:18 [ArcusCache:448] - getting value by key: TEST-PRODUCT:1266
DEBUG 21-07-16 17:42:18 [ArcusCache:454] - front cache hit for TEST-PRODUCT:1266 ... (4)

The above example shows how a single Front Cache is created. But you can always configure N Front Caches instead of a single one. For example, if you want to use a unique front cache for each service, you can set it up as follows.

@Bean
public ArcusCache productCache() {
    ArcusCache arcusCache = new ArcusCache();
    arcusCache.setName("product");
    ... (omitted) ...
    arcusCache.setArcusFrontCache(productFrontCache());

    // Using Product Front Cache
    return arcusCache;
}

@Bean
public ArcusFrontCache productFrontCache() {

  // Create a product Front Cache
  return new DefaultArcusFrontCache(
   "productFront", 20000, false, false);
}

@Bean
public ArcusCache eventCache() {
    ArcusCache arcusCache = new ArcusCache();
    arcusCache.setName("event");
    arcusCache.setPrefix("EVENT");
    ... (omitted) ...
    arcusCache.setArcusFrontCache(eventFrontCache());

    // Using Event Front Cache
    return arcusCache;
}

@Bean
public ArcusFrontCache eventFrontCache() {

  // Create an Event Front Cache
  return new DefaultArcusFrontCache(
  "eventFront", 10000, false, false);
}

Please note that Front Cache instances created in the above example do not share data between themselves due to each instance has a different hash table. But if you want to share data of multiple ARCUS Cache instances with Front Cache, then you can create one Front Cache instance as shown below, and assign it to multiple ARCUS Cache instances.

@Bean
public ArcusCache productCache() {
    ArcusCache arcusCache = new ArcusCache();
    arcusCache.setName("product");
    ... (omitted) ...

    // Using share Front Cache    
    arcusCache.setArcusFrontCache(sharedFrontCache()); 
    return arcusCache;
}

@Bean
public ArcusCache eventCache() {
    ArcusCache arcusCache = new ArcusCache();
    arcusCache.setName("event");
    arcusCache.setPrefix("EVENT");
    ... (omitted) ...

    // Using share Front Cache     
    arcusCache.setArcusFrontCache(sharedFrontCache()); 
    return arcusCache;
}

@Bean
  // Using share Front Cache
  public ArcusFrontCache sharedFrontCache() { 
  return new DefaultArcusFrontCache("shared", 50000, false, false);
}

Warnings

The performance of Front Cache which is stored in the local memory of the application is much faster than a remote cache but it isn’t available in all types of cases. Please take a look at the following issues when using the Front Cache.

High Memory Usage

Front Cache stores the same data in the memory of all applications, therefore it has the disadvantages of requiring a lot of memory space. Depending on the number of expired and no longer referred cache data, it will cause frequent use of the Garbage Collection of JVM and result in low application performance. Therefore, considering memory usage of the application the maximum size and amount of the data that can be stored in the Front Cache must be estimated. If you are using the DefaultArcusFrontCache class of ARCUS Spring, you can set the maximum number of data that can be stored with the maxEntries property.

Data Mismatch

Generally, data mismatch occurs between applications due to the multiple applications are unable to share cache data. For example, if you perform a data change request to a particular application, other applications’ Front Cache does not reflect the changed data. Because of this, there is a problem with inconsistent data response for a request in a multi-server environment. We can resolve the data mismatch problem in the following manners:

We set short TTL for Front Cache. Even in the case of hot data that gets a lot of requests, with a short expiration time, it will still show high performance.
We set the Sticky Session on the load balancer at the front of the application and depending on the session we convey the request to the server that first processed it.
We only front cache small changes of data.

Dual Caching (in case of ARCUS Java Client Usage)

Java Client of ARCUS also has an internal Front Cache feature. However, there are some limitations such as TTL, the maximum number of data, and other properties of a Front Cache that must be shared for every cache target. If this limitation doesn’t concern you much, then using only ARCUS Java Client’s Front Cache would be enough. But if you want to use the Front Cache of ARCUS Spring with different properties per cache target then to prevent dual caching you have to disable ARCUS Java Client’s Front Cache function (it’s disabled by default). But if you are switching from ARCUS Java Client’s Front Cache to ARCUS Spring’s Front Cache then you need to directly disable the front cache function of ARCUS Java Client as shown below.

ConnectionFactoryBuilder factory = new ConnectionFactoryBuilder();
// Set to 0 to disable the Front Cache. Default value is 0.
factory.setMaxFrontCacheElements(0); 
ArcusClient client = new ArcusClient(SERVICE_CODE, factory);

Conclusion

In summary, I have introduced and explained the Front Cache’s features provided by ARCUS Spring. If you are aware of the warnings for the usage of front cache and apply safety measures, then you can experience improvement in the performance of request processing for more applications, rather than using ARCUS Cache alone. However, there are limitations for front cache to be applied. In order to be able to apply front cache to the more cache targeted areas synchronization feature must be included to eliminate inconsistencies in data between other front caches of multiple applications, as well as in ARCUS Cache.

ARCUS Spring project has been improving steadily, and we will keep you informed on further enhancements and additions to the front cache in our next articles.

Introducing Front Cache of ARCUS Spring was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Introducing ARCUS Single Cache (Dev.) on AWS Marketplace and How to Use It

N.M.G. — Wed, 16 Jun 2021 03:41:07 GMT

ARCUS Single Cache(Dev.) is configured as an AMI(Amazon Machine Image) to provide an easy and fast experience of ARCUS on AWS Marketplace. ARCUS Single Cache(Dev.) uses AMI configuration to support one-click deployment to ease the use of ARCUS Cache. You can check the ARCUS Single Cache (Dev.) on AWS Marketplace from here (URL).

What’s Amazon Machine Image?

An Amazon Machine Image or AMI is a packaged environment containing a software(OS, Application Server) configuration and other additional applications required to set up an instance to deliver a service or a part of it.

Previously, to use the ARCUS Cache, first you needed to clone it from its Github repository (URL) and go through various build/setup processes. This method can be a little bit difficult for first-time users. ARCUS Single Cache (Dev.) solves this difficulty, and it’s much easier and faster to experience ARCUS Cache. In this article, I’ll introduce you to ARCUS Single Cache(Dev.) and how to use it.

ARCUS Single Cache (Dev.)

ARCUS Single Cache(Dev.) has been implemented as a light version of the ARCUS Cache Cluster to experience the basic features. As the name suggests, ARCUS Single Cache(Dev.) supports a single cache owing to consist of only one node, thus being limited to only some of the features of ARCUS.

Configuration Details

Besides the clustering feature, ARCUS Single Cache(Dev.) limits available memory and connection resources while providing most of the basic main features for a single cache. Configuration details of ARCUS Single Cache(Dev.) are as follows:

▪ Memory size: 250MB
▪ Connection size: 1024

Provided Features

ARCUS Single Cache(Dev.) supports simple key-value data type and a collection (List, Set, Map, B+Tree) data structure that stores and views multiple values in a structured form in a single key. It also provides a prefix feature to form a group and manage keys. The details are as follows.

Cache Item

In addition to simple key-value, ARCUS Single Cache(Dev.) provides various item types in a collection data structure.

Key-Value: a simple key-value item that stores a single value.

Collection item

List item: an item that has a double-linked list of data elements.
Set item: an item that has an unordered set of unique data elements.
Map item: an item that has an unordered set of pairs.
B+tree item: an item that has a data set sorted by a b+tree key.

B+tree supports efficiently range search in both backward and forward directions as well as exact search. Each element of b+tree has a unique key and a set of elements is sorted by these unique keys in its b+tree structure.

Cache Key

Cache key identifies the data to be stored in the ARCUS Single Cache(Dev.). It has a syntax of :.

Prefix is a name preceding the cache key. You can group keys stored in a cache server to flush or view stat info in prefix units. prefix can be omitted but it’s recommended to be used as much as possible. prefix can only consist of: Uppercase letters, numbers, (_)underbars, (-)hyphens, (+)plus, and (.)dots characters.
delimeter (:)is a character used to separate prefix and subkey.
Subkey is a key commonly used in applications to distinguish cache items. subkey cannot contain spaces, and by default, it is recommended to use only alphanumeric characters.

So up to this point, I have given you general information about ARCUS Single Cache(Dev.) and from now on I will show you how to use it on AWS.

Creating ARCUS Single Cache (Dev.)

Before creating ARCUS Single Cache(Dev.) first thing you need to do is create an AWS account. If you don’t have an account, please create an account before proceeding. Now, login into AWS Management Console(URL) and let’s create an EC2 instance to experience ARCUS.

1: Click the Services tab on the upper left corner of Console.

2: Choose EC2 under Compute Category.

3: Once you’re in the EC2 Dashboard, click the Launch Instance.

4: Now Step-1: Choose an Amazon Machine Image (AMI) page will appear. Choose the AWS Marketplace that you’ll see on the left side of the page.

5: Search for an arcus, Select the ARCUS Single Cache(Dev.) and continue.

6: Now, on Step-2: Choose an Instance Type, select the instance type of your choice, but I’ll go with the t2.micro instance, as it's the vendor’s recommendation, and also Micro instances are free for up to 750 hours a month.

Once the instance type selection is completed, you can continue with further steps or you can jump right into Step-7: Review to proceed with a default configuration.

7: Now that all settings are complete, Step-7: Review page will appear. If you scroll down the page, you can check the port settings (SSH Connect(22), ARCUS ZooKeeper (2181), and ARCUS Memcached (11211).) to access ARCUS. In addition, all IPs (0.0.0.0/0) are opened/allowed by default. This may expose you to security issues, so we recommend checking your IP address and whitelisting it before you start.

8: Finally, to launch your instance choose an existing or create a new key-pair and download your key-pair (.pem) file. Put the .pem file into the same directory where your EC2 instance will run, it is important for SSH access.

Verify ARCUS

After connecting to the ARCUS Cache instance, via telnet we will check whether arcus-memcached is running and conduct a simple test. Now please enter your EC2 instance’s public IP and port, as shown below.

$ telnet [IP address] 11211

Now that you’re connected to ARCUS, try a stats command as shown below. The result of this command will show you ARCUS cache statistics such as PID, version info, memory usage, and computation performance.

$ stats

In addition, you can try some other commands(URL) on arcus-memcached. Note: the current ARCUS Documentation is only available in Korean. But you can use papago.naver.com for general translation.

Using ARCUS Single Cache (Dev.) with ARCUS Java Client

In order to use ARCUS Single Cache(Dev.) with Java application, you need ARCUS Java Client. In the following sections, I will show you the required settings to use Java Client.

Environment Settings

Once you have established the below-listed environment settings, please create a Java project to proceed.

Apache Maven (higher than version 4)
Java (higher than version 1.6)
Eclipse / Intellij IDEA

POM.XML Configuration

Once the project has been created, please update your POM file for installing dependencies as shown below.

<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">

4.0.0 
 
  com.navercorp.arcus
  arcus-quick-start
  1.0-SNAPSHOT
  jar

  
    UTF-8

  
  
    
      com.navercorp.arcus
      arcus-java-client
      1.13.0
    
    
 
    
      org.apache.logging.log4j
      log4j-core
      2.13.3
      true
    
    
      org.apache.logging.log4j
      log4j-api
      2.13.3
      true
    
    
      org.apache.logging.log4j
      log4j-slf4j-impl
      2.13.3
       
        
          org.slf4j
          slf4j-api
        
       
       true
    
    
      org.slf4j
      slf4j-api
      1.7.24

Create HelloARCUS.java

Now let's create HelloArcus.java within the project. You can use the below code as it is, but make sure to enter your EC2 instance’s public IP address as the `ADDRESS` variable.

import net.spy.memcached.ArcusClient;
import net.spy.memcached.ConnectionFactoryBuilder;import java.util.concurrent.Future;
import java.util.concurrent.TimeUnit;

//HelloArcus.java
public class HelloArcus {
  
//Enter your corresponding EC2 IP address into ADDRESS.
  private static final String ADDRESS = "YOUR INSTANCE IP:2181";  
//Default value for service code is `test`.
  private static final String SERVICE_CODE = "test";

public static void main(String[] args) throws InterruptedException 
{
  System.setProperty("net.spy.log.LoggerImpl",
  "net.spy.memcached.compat.log.SLF4JLogger");
    ArcusClient client =
      ArcusClient.createArcusClient(ADDRESS, SERVICE_CODE,
      new ConnectionFactoryBuilder());

//Enter key, expiredTime, value of your choice.
  client.set("test:hello", 30, "Hello Arcus!");

//Inquiry the saved value using key.
  Future future = client.asyncGet("test:hello");
  String hello = null;
    try {
      hello = (String) future.get(700, TimeUnit.MILLISECONDS);
    } catch (Exception e) {
      future.cancel(true);
      }

    if(hello == null) {
      hello = "not ok!";
    }
    System.out.println(hello);
 }
}

Create logging File

Next, we create a log4j2.xml file in the src/main/resources/ directory for logging.

Run

Now that all setups are completed, return back to the HelloArcus.java file and run it. If all setups have been completed normally, you should get the following message:

Hello Arcus!

In case of the following error messages, please check your settings again.

Up to now, I have shown you how to simply store and lookup data in ARCUS using Java client. There are many more useful features to discover using ARCUS Java Client, such as APIs that utilize collection data structure, asynchronous APIs, etc. that can be useful in the Application. Please refer to ARCUS Java Client documentation(URL) for more details.

Note: the current ARCUS Documentation is only available in Korean. But you can use papago.naver.com for general translation.

Conclusion

In summary, I have introduced and explained the basic usage of ARCUS Single Cache(Dev.). ARCUS Single Cache(Dev.) was developed as a light version of the ARCUS Cache Cluster to experience the basic features and to ease the use of ARCUS Cache. You can always access it for free up to 750 hours from AWS Marketplace using EC2: t2.micro instance. ARCUS Single Cache(Dev.) is good for small-scale projects to improve performance where caching is needed. In the future, we will provide new ARCUS products with enriching features as an AMI in AWS Marketplace.

ARCUS Single Cache (service purpose)
ARCUS Cache Cluster

▪☞ AWS Marketplace: ARCUS Single Cache(Dev.)

Introducing ARCUS Single Cache (Dev.) on AWS Marketplace and How to Use It was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Performance Test of ARCUS Data Persistence

N.M.G. — Wed, 16 Jun 2021 03:40:30 GMT

In the previous blog, Overview and Usage of Persistence Feature to Preserve Data Permanently in ARCUS Cache System, I have introduced ARCUS Persistence and showed you essential ways to send requests to it with telnet and memtier_benchmark tools. As I have mentioned in the previous blog, ARCUS Persistence has been implemented to guarantee high performance by minimizing the overhead of a data persistence feature so that there would be no significant difference from the performance of ARCUS Cache. In this article, we will compare the performances of the ARCUS Persistence to the original ARCUS caching and show the results of performance differences according to the Command Logging mode.

Test Environment

Performance test of ARCUS Persistence will be measured in accordance with establishing the following environmental requirements.

Test Machine Specs

Below listed specifications of a system on which the ARCUS Cache node runs. In this performance test, we have used only a single ARCUS node.

OS : CentOS 7.3.1611
CPU : 8vCPU
MEMORY : 8GB * 2
NETWORK : 1Gbps
DISK : HDD 50GB * 2
ETC :
▪ THP(Transparent Huge Pages) = madvise
▪ vm.swapiness = 30

Below listed specifications of a system on which the client that generates the load runs.

OS : CentOS 7.3.1611
CPU : 8vCPU
MEMORY : 8GB * 2
NETWORK : 1Gbps
DISK : SSD 50GB

ARCUS Server Running Options

The ARCUS Cache node that we have use runs on arcus-memcached 1.13.1 and a start command is as follows:

memcached -d -v -r -R 100 -t 6 -p 11500 -b 8192 -c 4096 -m 12000 \
-z localhost:2150 \
-X /home/test/arcus/lib/syslog_logger.so \
-E /home/test/arcus/lib/default_engine.so \
-e config_file=/home/test/arcus/conf/default_engine.conf

Considering the client load, the below-shown running options must be set properly.

t : number of worker threads
R : maximum number of requests per event
b : TCP backlog queue size
c : maximum number of clients that can be connected
m : maximum storage capacity (MB)

ARCUS Persistence Settings

In the default_engine.conf file that has the settings for ARCUS default engine operation, Persistence related settings also have been set as shown in the below sample. You can check the details of the Persistence setting in the Overview and Usage of Persistence Feature to Preserve Data Permanently in ARCUS Cache System. Therefore, here I will only show you the values to be set according to each test case.

# Persistence configuration
#
# use persistence (true or false, default: false)
use_persistence=true
#
# The path of the snapshot file (default: ARCUS-DB)
data_path=/disk2/test/arcus/ARCUS-DB
#
# The path of the command log file (default: ARCUS-DB)
logs_path=/home/test/arcus/ARCUS-DB
#
# asynchronous logging
async_logging=false
#
# checkpoint interval (unit: percentage, default: 100)
# The ratio of the command log file size to the snapshot file size.
# 100 means checkpoint if snapshot file size is 10GB, command log 
# file size is 20GB or more
chkpt_interval_pct_snapshot=100
#
# checkpoint interval minimum file size (unit: MB, default: 256)
chkpt_interval_min_logsize=256

use_persistence
▪ Set to true if you test ARCUS for storage purposes and if you test ARCUS for cache purposes set it to false.
data_path, logs_path
▪ We recommend that you set the data file and the log file paths to separate disk partition.
▪ This is because the snapshot of the checkpoint spends all IO resources of a disk, thus delaying command logging by worker threads that process client requests.
async_logging
▪ Set true for asynchronous logging test, false for synchronous logging test.
chkpt_interval_pct_snapshot, chkpt_interval_min_logsize
▪ We used a default value.

Performance Measurement Tools

We used memtier_benchmark 1.3.0 version to generate loads and measure performance. By supporting Memcached protocols we can perform insert (set) and retrieval (get) operations of the ARCUS KV commands. You can also generate keys in a uniform or Gaussian distribution and check the average throughput per second and the average/tail response time.

ARCUS Monitoring Tools

Hubble is the ARCUS Enterprise monitoring tool. Hubble collects the statistical information of an ARCUS instance and system resources of the host machine and shows them visually in a web browser. So we can easily observe ARCUS and system status through Hubble.

Performance Test Scenario

Comparison of Test Items

Performance variables to be checked and compared in this test.

Throughput - Average throughput
Latency - Average response time, a tail response time (90%, 95%, 99%)

Generate test data

Test data has been automatically generated by the memtier_benchmark.

Total data count: 50 million (6.5GB)
▪ Key size: 9~17Bytes (“memtier-1” ~ “memtier-50000000”)
▪ Data size : 50 Bytes
▪ Item size : Average 130 Bytes (total key, data, metadata)

Test Scenario and Procedure

For the performance test below, we will be measuring the performance of a cache mode, asynchronous logging mode, and synchronous logging mode with the memtier_benchmark. In these retrieval and update performance tests, we will generate keys only in a uniform distribution. The reason is in ARCUS entire data resides in memory, hence both uniform and Gaussian distributions will show the same performance results.

Performance Test of Insert
▪ Insert 50 million data,
▪ Execution command:

memtier_benchmark --threads=8 --clients=50 --data-size=50 \
--key-pattern=P:P --key-minimum=1 --key-maximum=50000000 \
--ratio=1:0 --requests=125000 --print-percentiles=90,95,99

Performance test of Retrieval
▪ After inserting 50 million data, generate keys in a uniform distribution and retrieve.
▪ Execution command

memtier_benchmark --threads=8 --clients=50 --data-size=50 \
--key-pattern=R:R --key-minimum=1 --key-maximum=50000000 \
--distinct-client-seed --randomize --ratio=0:1 --requests=125000 \
--print-percentiles=90,95,99

Performance tests of Mixed Update and Retrieval
▪ After inserting 50 million data, generate keys in a uniform distribution. Insert and retieve at the ratio of 1:9, 3:7, and 5:5.
▪ Execution command

memtier_benchmark --threads=8 --clients=50 --data-size=50 \
--key-pattern=R:R --key-minimum=1 --key-maximum=50000000 \
--distinct-client-seed --randomize --ratio=1:9 --requests=125000 \
--print-percentiles=90,95,99

Performance Test Results

Average throughput and average/tail response time have been organized according to persistence mode.

OFF is original cache performance,
ASYNC is asynchronous command logging mode,
SYNC is synchronous command logging mode.

Insert Operation

The performance test result of 50 million data insert operation is as follows:

The ARCUS Persistence performance was distinctly shown since all client requests consist of the insert operation. The result shows that the performance degradation associated with the persistence usage is not that great even compared to the original cache performance when it’s not used. Especially, synchronous logging mode completes update operation’s execution after verifying that worker threads recorded a command log onto a disk so that data can be fully recovered even if the ARCUS instance terminated abnormally at any point in time. The reason for such a high-performance result is that the worker threads have been implemented in a way that they can handle requests from other clients during the command log is being flushed to a disk by flush daemon thread.

Retrieval Operation

The performance test result of retrieving 50 million data in uniform distribution is as follows:

In the case of retrieval operations, command logging wasn’t executed, because there wasn’t any insert request. Thus, even if persistence mode would be used it will have the same performance results as an ARCUS cache.

Mixed Operation

The performance test result of mixed Update and Retrieval at a ratio of 1:9 is as follows:

The performance test result of mixed Update and Retrieval at a ratio of 3:7 is as follows:

The performance test result of mixed of Update and Retrieval at a ratio of 5:5 is as follows:

As you can see, even in the mixed operation’s performance there are no big differences with the persistence usage. Especially, because of the nature of the ARCUS system implementation, it has shown very good results on the performance of retrieval operation. The higher the volume of retrieval requests, the higher the performance regardless of whether the persistence mode is used or not. In the case of Update and Retrieval at a ratio of 1:9, the request throughput of synchronous logging mode(SYNC) differs only about 20K from the request throughput of the original ARCUS Cache(OFF).

Checkpoint Impact

Additionally, to verify the changes in request throughput of ARCUS instance during a checkpoint, we’ll show you the User/System CPU usage of an ARCUS host machine and the request throughput of ARCUS instance observed by Hubble.

P.S. Besides the below shown information, Hubble provides many other stats (network, disk, operation hit/miss, etc.) for analysis. But for now, we leave them out.

The above image shows the result of the performance test of a mixed operation of Update and Retrieval at a ratio of 1:9, operating in asynchronous logging mode with 50 million data insertion. The average throughput is about 150K combined with a throughput of update(yellow) and a throughput of retrieval(green). The time between 16:21 and 16:23 shows the checkpoint execution period. During that time throughput is temporarily reduced, and about 5GB was recorded to the snapshot file. As a future plan, we’re planning to slow down the checkpoint process further reducing the throughput degradation.

Conclusion

In this blog article, we have measured the performance difference of ARCUS Persistence according to command logging mode and the ARCUS Cache. As mentioned earlier, ARCUS Persistence was implemented to minimize the impact on the process of client requests, thus there isn’t any significant performance difference from the ARCUS cache. Especially, most real service workload patterns are a mixture of insert and retrieval operations, that have a high volume of retrieves, and our mixed operation test results showed a notably higher performance. The disk of this test environment is a defaultHDD provided by Naver Cloud Platform’s VM. If you use the NVMe SSD that provides high IOPs you can obtain a higher ARCUS performance. ARCUS Persistence is a good choice for most applications to ensure high performance where data preservation is required and smooth serviceability. We are continuously working on ARCUS Persistence optimization, trying to improve throughput, response time, and ease of use from an operational perspective.

Lastly, as a precaution for a use of ARCUS Persistence, because a checkpoint operation of ARCUS Persistence records the entire data from a memory to a disk, many disk IO resources will be used, which may delay command logging for the update operation. Therefore, disk partitioning between data files and log files is essential, in order to obtain high performance of ARCUS Persistence.

Previous blog posts:
▪☞ Overview and Usage of Persistence Feature to Preserve Data Permanently in ARCUS Cache System

Performance Test of ARCUS Data Persistence was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Dynamically Manage and Update Cache Target API Lists of ARCUS Application

N.M.G. — Fri, 21 May 2021 10:46:04 GMT

The ARCUS Common Module makes use of Spring AOP technology to easily apply ARCUS Cache in Java Applications. There are two ways to apply ARCUS Cache, (1) attach Annotation to the cache target API and (2) specify cache target APIs and their cache attributes in the Property file. More details on the ARCUS Common Module are described in the ARCUS Common Cache Module Use with Basic Pattern Caching in Java Environment article.

When updates of cache target API or cache attributes are required from the caching methods (Annotation method, Property file method) of ARCUS Common Module, Application redeployment was needed to modify and reflect these changes. Due to the need to make updates without Application redeployment, the dynamic Property management method was developed.

We have used ZooKeeper as ARCUS metadata storage, to keep the cache target API data(cache target API + cache attributes) and have made it possible to update them when it’s necessary. ARCUS Common Module detects and retrieves the real-time changes of cache target APIs with the ZooKeeper’s Watcher feature and dynamically applies the changes to ARCUS applications.

Management of Cache Target API Data in ZooKeeper

ZooKeeper has a directory structure of key-value items called znode (ZooKeeper Node) that stores and manages desired data in it. Below-shown ZooKeeper’s directory structure describes how to store and manage cache target API data of the ARCUS Application. At the top of the structure, there come /arcus_app and /cache_target_list directories followed by /service-code/ znodes of ARCUS Cache Cluster. The cache target API list and their attributes of each ARCUS Application are stored to sub znodes of the service code that the Application access.

ZooKeeper directory structure

The key of the znode can solely identify the cache target API by using a target name consisting of the ‘package-name.class-name.method-name’ and the value of the znode stores the JSON property as shown in the below example.

/* 
JSON Property of Cache Target
*/
{

/*
It is consisting of ‘package-name.class-name.method-name’ as a signature of Cache Target API. 
*/

  "target":"com.service.BoardService.getBoard",

/*
Assigns the ARCUS Cache Key's prefix generated from Cache Target API.
*/

  "prefix":"BOARD",

/*
Assigns the ARCUS Cache Item's TTL (Time to Live) generated from Cache Target API.
*/

  "expireTime":60,

/*
Specifies the key parameters (separated by commas) that will be used as ARCUS Cache Key generated from Cache Target API. 
*/

  "keyParams":["bno"],

/*
Automatic creation of ARCUS Cache Keys generated from Cache Target API: if true automatically create a cache key using all parameters; if false keyParams must be specified.
*/

  "keyAutoGeneration":false,

/*
Condition of whether to apply Arcus Cache on the Cache Target API.
*/

  "enable":true,

/*
Append the Arcus Cache Key creation time information after the Arcus Cache Key string. This is intended for to create different cache keys.
*/
  "keyDate":"KEY_DATE_NONE"

All arcus_apps, cache_target_list and service_code znodes that store cache target APIs are all generated in persistent type and the znodes that store the cache target APIs data are created in persistent and sequence types. The reason why we use the sequence type is, to easily determine creation, deletion, and other cache target API property updates. The cache target API property Update operation removes the existing znode and generates a new znode of the same key with updated property details. Thereby causes the corresponding znode’s sequence value to be increased.

For instance, let’s say when we want to update API’s cache property where the znode of com.service.BoardService.getBoard-0000000000 key already exists. On the update operation, an existing znode will be removed, and a new znode with com.service.BoardService.getBoard-0000000001 key will be created at the same time. Thereby, the ARCUS Application looks up only the child nodes in /arcus_apps/cache_target_list/service-code znode and allows you to check creation, deletion, and update with the sequence value in its own cache target API list.

The ARCUS’s Cache Common Module Lookups the Latest Cache Target API List from ZooKeeper (using ZooKeeper Watcher)

ARCUS Common Module uses a cache item map to store and manage the cache target API list of the corresponding service code which the Application access.

In order to receive real-time notifications, we register the ZooKeeper Watcher to the corresponding znode of service code. When the changes occur in the corresponding cache target API list, ZooKeeper Watcher notifies any occurred updates to ARCUS Common Module. Then, ARCUS Common Module reads the latest cache target API list. It looks up the updated cache target API list by comparing the sequence values of previous and latest cache target APIs. ARCUS Common Module also reads the property details of the updated cache target API as a JSON file from ZooKeeper and updates them on the cache item map.

Creation, Deletion, and Update of Cache Target API Properties

By directly using the ZooKeeper Command Line Interface tool — zkCli, you can create, delete and update cache target API property details as shown below. In the below example, the -s option is given to create a znode of sequence type. But it would be much efficient to make and use a script that executes the corresponding operations than operating directly with a zkCli tool.

// CREATION of Cache Target API
[ZK: localhost:2180(CONNECTED)]
create -s /arcus_apps/cache_target_list/service-code/com.service.BoardService.getBoard 
{
 "target":"com.service.BoardService.getBoard",
 "prefix":"BOARD",
 "expireTime":60,
 "keyParams":["bno"],
 "keyAutoGeneration":false,
 "enable":true,
 "keyDate":"KEY_DATE_NONE"
}

// DELETION of Cache Target API
[ZK: localhost:2180(CONNECTED)]
delete /arcus_apps/cache_target_list/service-code/com.service.BoardService.getBoard-0000000000

// UPDATE of Cache Target API
[ZK: localhost:2180(CONNECTED)]
create -s /arcus_apps/cache_target_list/service-code/com.service.BoardService.getBoard
{
 "target":"com.service.BoardService.getBoard",
 "prefix":"BOARD",
 "expireTime":60,
 "keyParams":["bno"],
 "keyAutoGeneration":false,
 "enable":true,
 "keyDate":"KEY_DATE_NONE"
}
[ZK: localhost:2180(CONNECTED)]
delete /arcus_apps/cache_target_list/service-code/com.service.BoardService.getBoard-0000000000

Performing creation, deletion, and update operations of cache target API directly using a zkCli tool can be uncomfortable for an operator. On the webpage, to efficiently operate and manage ARCUS Cache Cluster (including ZooKeeper Ensemble) JaM2in currently developing ARCUS Admin Tool. On the ARCUS Admin Tool, we have developed a cache target feature to manage the Application’s cache target APIs, as shown in the below example.

Below web page shows the cache targe API list of a specific service code. In this page, our service code’s name is a test, by clicking on a particularcache target API, you can check the details of JSON property.

The below image displays how to create a new cache target API. A user enters the property details that will be made as a JSON property and creates the corresponding cache target API. Relatively, delete and update operations of cache target API can be performed in the same manner on the ARCUS Admin Tool.

To recap briefly what has been discussed so far, when a user creates, deletes, or updates the cache target APIs through cache target feature on the ARCUS Admin Tool, these changes are stored and managed by ZooKeeper Ensemble, where ZooKeeper Watcher monitors these change events and passes them to the ARCUS Common Module which is attached to the Application. As explained earlier, ARCUS Common Module looks up for updates on cache target API and manages them with a cache item map. ARCUS Cache will be automatically applied only to cache target APIs that stored on the cache item map.

Application Method of the ARCUS Common Module

ARCUS Common Module itself is described in the ARCUS Common Cache Module Use with Basic Pattern Caching in Java Environment article, but here as well, I will give you a brief introduction on how to apply the ARCUS Common Module to the Java Application.

In order to apply ARCUS Common Module to the Java Application first, add the ARCUS Common Module dependency to the pom.xml file of the Java Application.


   ...
   
      com.jam2in.arcus
      arcus-app-common
      1.4.0
   
   ...

Then, create the ARCUS property file (arcus.properties) as shown below.

# ZooKeeper Ensemble Address 
# Used to Connect  ARCUS and Renew Cache Target List 
arcus.address=1.2.3.4:2181,1.2.3.4:2182,1.2.3.4:2183

# Arcus service code
arcus.serviceCode=test

# Connection Pool size of Arcus Client 
arcus.poolSize=8

# Operation timeout (milliseconds)
arcus.asyncOperationTimeout=700

# Global Prefix of Arcus Cache Key 
arcus.globalPrefix=RELEASE

Lastly, if you set up the Spring as shown below, the ARCUS Common Module application will be completed. The ARCUS Common Module makes it very easy for a user to apply ARCUS Cache and dynamically manage a cache target API list.

@Configuration
// Arcus Property Loading
@PropertySource("classpath:arcus.properties")
// @Aspect Use
@EnableAspectJAutoProxy(proxyTargetClass = true)
@Import(ArcusBeans.class)
public class ArcusConfiguration { 
 @Autowired
 private ArcusBeans arcusBeans;
 
 @Bean
 public static PropertySourcesPlaceholderConfigurer 
          propertySourcesPlaceholderConfigurer() {
  return new PropertySourcesPlaceholderConfigurer();
 }
 
 @Bean
 public ArcusStarter arcusStarter() {
  return new ArcusStarter(arcusBeans);
 } 
   
 @Bean
 public ArcusCacheAspect arcusAdvice() {
  return new ArcusCacheAspect(arcusBeans);
 }
}

Conclusion

In summary, I’ve introduced and explained how to dynamically create, delete and update cache target APIs without restart of the ARCUS Application, and how to store and manage cache target APIs using ZooKeeper. ARCUS Common Module through ZooKeeper Watcher detects real-time updates in the cache target API list and always maintains the latest updated list of cache target APIs. In the future, to improve the management feature of a cache target API, the following works will be processed:

HIT, MISS, RATIO indication of cache target API,
HISTORY feature of cache target API e.g. records of by whom and when the cache target API property details were updated,
A parser feature to view cache target API as a JSON string or JSON file.

Dynamically Manage and Update Cache Target API Lists of ARCUS Application was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.

Arcus Single Cache (Dev.)를 소개합니다

moonseop kim — Mon, 17 May 2021 02:39:13 GMT

ARCUS Single Cache (Dev.) 는 AWS Marketplace에서 ARCUS 캐시를 쉽고 빠르게 경험할 수 있도록 구성된 AMI입니다. AMI(Amazon Machine Image)는 인스턴스 시작하는 데 필요한 소프트웨어(OS, Application) 등이 포함된 템플릿으로, 사용자들이 EC2 인스턴스를 통해 실행할 수 있습니다. ARCUS Single Cache (Dev.)는 AMI 구성을 통해 개발자들이 ARCUS 캐시를 체험해 볼 수 있도록 one-click deployment 형태로 제작되었습니다. AWS ARCUS Single Cache (Dev.) 링크

기존 ARCUS 캐시를 사용하기 위해서는 Github에서 ARCUS를 clone받은 후, 빌드 및 각종 설정 과정을 거쳐야 합니다. 이 방식은 처음 사용자들에게 어려울 수 있습니다. 그래서 기존 방식보다 빠르고 간편하게 ARCUS 캐시를 체험해볼 수 있도록 ARCUS Single Cache (Dev.) AMI를 AWS Marketplace에 출시하게 되었습니다. 이 AMI 통해 ARCUS 캐시의 기본 기능을 경험해 볼 수 있습니다.

이번 포스팅에서는 ARCUS Single Cache (Dev.)를 소개하고 사용하는 방법을 간단하게 알려드리겠습니다.

ARCUS Single Cache (Dev.) 정보

ARCUS Single Cache (Dev.)는 개발자를 위해 기본 기능을 체험해 보는 목적으로 한정하여 일부 기능이 제한되어 있습니다. ARCUS는 다중 캐시 노드들로 구성되어 동작하는 클러스터 기능을 제공하지만, 이번 ARCUS Single Cache (Dev.) 제품은 이름에서도 알 수 있듯이 하나의 노드만으로 구성된 단일 캐시를 제공합니다.

구성 정보

ARCUS Single Cache (Dev.)는 ARCUS의 클러스터 기능이 제외된 단일 캐시 노드의 기능만을 제공하고, 사용할 수 있는 메모리와 커넥션 리소스에 제약이 있습니다. 구성정보는 아래와 같습니다.

Memory 크기 : 250MB
Connection 개수 : 1024

제공 기능

ARCUS의 단일 캐시에서는 Key-Value 형태의 기본 타입과 Collection(List, Set, Map, B+tree) 타입의 아이템 자료구조와 key들을 그룹화하여 관리하는 prefix 기능을 제공하며, 이를 체험해 볼 수 있습니다. 해당 상세정보는 아래와 같습니다.

Cache Item:

Key-Value : 하나의 데이터를 저장하는 구조

Collection item:

list : 데이터들의 linked list 구조
set : 유일한 데이터들의 집합 구조
map : 쌍으로 구성된 데이터들의 집합으로 field 기준의 hash 구조
b+tree : b+tree 키 기준으로 정렬된 데이터들의 집합을 가지는 구조

Cache Key:

prefix 단위로 cache server에 저장된 key들을 그룹화하여 flush 하거나 통계 정보를 볼 수 있는 유용한 기능
prefix : cache key 앞에 붙는 name space
delimeter : prefix와 subkey를 구분하는 문자로, 콜론(‘:’)을 사용합니다.
subkey : 일반적으로 cache 아이템을 구별하기 위한 Key입니다.

prefix와 subkey는 명명 규칙을 가지므로 주의해야 합니다. prefix는 영문 대소문자, 숫자, 언더바(_), 하이픈(-), 플러스(+), 점(.) 문자만으로 구성될 수 있으며, 이 중에 하이픈(-)은 prefix 명의 첫 번째 문자로 올 수 없는 제약이 있습니다. subkey는 공백을 포함할 수 없으며, 기본적으로 alphanumeric만을 사용하길 권장해 드립니다.

지금까지 ARCUS Single Cache (Dev.) 에 대해 알아보았고, 이제 ARCUS Single Cache (Dev.)를 생성 및 사용하는 방법을 소개하겠습니다.

ARCUS Single Cache (Dev.) 생성

ARCUS Single Cache (Dev.)를 생성하기에 앞서 선행되어야 할 사항이 한 가지 있습니다. 바로 계정을 생성하는 것입니다. AWS 계정이 없으신 분들은 계정을 생성하신 후 진행해주시길 바랍니다.

먼저 ARCUS 체험을 위한 EC2 인스턴스를 생성해 보도록 하겠습니다.

1 : EC2 인스턴스 생성을 위해 맨 왼쪽 위에 서비스 탭을 클릭합니다.

2 : 컴퓨팅 카테고리 밑의 EC2를 클릭합니다.

3 : 왼쪽의 대시보드를 통해 인스턴스를 클릭합니다.

4 : 맨 오른쪽 인스턴스 시작 버튼을 클릭합니다.

5 : 단계 1: Amazon Machine Image(AMI) 선택 페이지가 나옵니다. 바로 밑에 AWS Marketplace를 선택하고, 검색창에 arcus를 입력합니다.

6 : ARCUS Single Cache (Dev.) 제품이 나타난 것을 확인하고 오른쪽의 선택버튼을 눌러주세요.

7 : 단계 2: 인스턴스 유형로 넘어가서 원하시는 EC2 인스턴스를 선택합니다.

사용자가 원하는 용도에 맞는 인스턴스 타입을 선택하시면 됩니다. 간단히 체험할 목적이라면, ARCUS Single Cache (Dev.)의 인스턴스로 t2.micro를 추천합니다.

인스턴스 유형 선택이 완료했다면, 세부적인 구성(단계 3, 4, 5, 6) 또는 바로 검토 및 시작(단계 7 바로가기)을 진행할 수 있습니다. 빠른 시작을 하고 싶으시다면 검토 및 시작 버튼을, 추가하고 싶은 기능이 있다면 세부적인 구성 버튼을 눌러주세요.

8 : 모든 설정을 완료하면, 단계 7: 인스턴스 시작 검토 페이지가 나타납니다.

여기에서 필요한 설정 한 가지가 있습니다. 스크롤을 내려 보안 그룹 에 대한 설정 내용으로 이동해 주세요

그림과 같이 ARCUS 접근을 위한 port 설정을 확인할 수 있습니다. 인스턴스의 SSH 접근(22)과 ARCUS Zookeeper(2181), ARCUS Memcached(11211) 접근을 위한 port 설정이 제대로 되어있는지 확인합니다.

또한, 기본적으로 모든 IP(0.0.0.0/0) 접근이 허락되어 있습니다. 이 경우에 brute force attack에 노출될 수 있음으로 자신의 IP를 확인하여 whitelist로 등록 후 사용하시길 권고드립니다.

9 : 마지막으로 키페어(key pair) 설정을 마치면 모든 인스턴스 생성이 끝나게 됩니다.

키페어는 SSH 접속을 위해 중요하므로 안전하게 보관하시길 바랍니다.

ARCUS 동작 확인

ARCUS 캐시 인스턴스 생성 후, telnet을 이용하여 ARCUS Memcached 실행상태 확인 및 간단한 테스트를 해보도록 하겠습니다.

IP(인스턴스 public IP)와 port를 입력해주세요

$ telnet IP 11211

telnet 연결이 되었다면, 아래와 같이 stats 명령어를 입력해 보도록 합니다. 이 명령의 수행 결과로 PID, 버전, 메모리 사용량, 연산 수행 통계 등의 정보를 확인할 수 있습니다.

$ stats

이외에도 ARCUS Memcached에 직접 사용해 볼 수 있는 여러 command가 존재합니다. Command 살펴보기 를 통해 더 많은 테스트를 해 볼 수 있습니다.

ARCUS Java Client를 이용해 ARCUS Single Cache (Dev.) 체험

이번 단계는 Java Application에서 ARCUS 캐시 기능을 사용하는 단계입니다. ARCUS Java Client를 이용하여 ARCUS Single Cache (Dev.)를 체험해 보도록 하겠습니다. ARCUS Java Client 사용을 위해 아래와 같은 환경 구성이 필요합니다. 그리고 체험에 필요한 Java project를 생성해 주세요

필요 환경

Apache Maven (version 4 이상)
Java (version 1.6 이상)
Eclipse / Intellij IDE

pom.xml 설정

Project 생성이 완료되었다면, 의존성 설치를 위한 pom.xml 파일에 아래와 같이 입력해 주세요

<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
    4.0.0

    com.navercorp.arcus
    arcus-quick-start
    1.0-SNAPSHOT
    jar

    
      UTF-8

    
      
      
        com.navercorp.arcus
        arcus-java-client
        1.13.0
    
        
    
        org.apache.logging.log4j
        log4j-core
        2.13.3
        true
    
    
        org.apache.logging.log4j
        log4j-api
        2.13.3
        true
    
    
        org.apache.logging.log4j
        log4j-slf4j-impl
        2.13.3
          
            
               org.slf4j
               slf4j-api
            
            
          true
    
    
        org.slf4j
        slf4j-api
        1.7.24

HelloArcus.java 생성

다음은 Project내에 HelloArcus.java를 생성합니다.

생성된 파일에 아래와 같이 코드를 입력합니다. ADDRESS 변수에 여러분이 생성한 EC2 인스턴스의 Public IP address가 필요하니 꼭 확인하여 입력해주시기 바랍니다.

import net.spy.memcached.ArcusClient;
import net.spy.memcached.ConnectionFactoryBuilder;

import java.util.concurrent.Future;
import java.util.concurrent.TimeUnit;

//helloArcus.java
public class HelloArcus {
  
//해당하는 EC2 IP address를 ADDRESS 변수에 입력해주세요.
  private static final String ADDRESS = "YOUR INSTANCE IP:2181";  
//service code의 기본값은 "test"입니다.
  private static final String SERVICE_CODE = "test";

  public static void main(String[] args) throws InterruptedException {
  System.setProperty("net.spy.log.LoggerImpl",
  "net.spy.memcached.compat.log.SLF4JLogger");
     ArcusClient client =
        ArcusClient.createArcusClient(ADDRESS, SERVICE_CODE,
        new ConnectionFactoryBuilder());

// key, expiredTime, value를 원하는 값으로 입력해 주세요.
    client.set("test:hello", 30, "Hello Arcus!");

// key를 이용해 저장했던 값을 조회합니다.
    Future future = client.asyncGet("test:hello");
    String hello = null;
    try {
      hello = (String) future.get(700, TimeUnit.MILLISECONDS);
    } catch (Exception e) {
      future.cancel(true);
    }

    if(hello == null) {
      hello = "not ok!";
    }
    System.out.println(hello);
  }
}

logging 파일 생성

다음은 logging을 위해 src/main/resources/log4j2.xml 경로에 해당 파일을 생성합니다.

실행

이제 모든 설정이 완료되었으므로 다시 HelloArcus.java 파일로 돌아가 해당 파일을 실행합니다.

정상적으로 동작이 된다면 아래와 같은 메시지가 출력됩니다.

Hello Arcus!

만약 아래와 같은 에러가 발생했을 경우, 해당하는 설정을 다시 확인해 보시기 바랍니다.

이번 체험 단계에서는 ARCUS Java Client 통해 간단히 ARCUS에 데이터를 저장하고 조회하는 기능을 살펴보았습니다. ARCUS Java Client에서는 이외에도 더 많은 기능을 사용해 볼 수 있습니다. collection 구조를 활용할 수 있는 API 및 비동기 API 등 Application에서 유용하게 활용 가능한 기능들이 있습니다. 사용하고자 하는 기능 및 API에 대해 궁금하시다면 ARCUS Java Client - Docs를 참고해 주세요.

마치며

지금까지 ARCUS Single Cache (Dev.)를 생성하고 사용법을 알아보았습니다. ARCUS Single Cache (Dev.)은 ARCUS의 기본 기능을 체험하기 위한 목적으로 개발되었습니다. 하지만, 소규모 프로젝트에 caching 기능을 적용하기에 충분하며 성능 개선 효과를 볼 수 있습니다. 이번 체험을 통해 생성한 EC2 인스턴스를 캐싱이 필요한 곳에 적용해 보시길 추천해 드립니다. 향후 ARCUS의 여러 기능을 포함한 새로운 AMI 상품을 제공할 예정입니다.

ARCUS Single Cache AMI (service purpose)
ARCUS Cache Cluster

앞으로 새롭게 출시될 여러 형태의 ARCUS AMI를 기대해주시기 바랍니다.

Arcus Single Cache (Dev.)를 소개합니다 was originally published in JaM2in on Medium, where people are continuing the conversation by highlighting and responding to this story.