简介
常见的在线应用的服务,一般都会设置资源的扩缩来应对突然增大的访问量,常见的处理方式一般分为两种:
- VPA 纵向扩容,也就是增加单个服务的资源(CPU、MEM)。
- HPA 横向扩容,增加多个服务器,通过多副本加负载均衡实现。
本篇文章主要讲述第二种 HPA 。
CRD
kind: HorizontalPodAutoscaler apiVersion: autoscaling/v2beta2 metadata: name: vmirs-ljtian-1070a-test-vmirs namespace: ljtian-test spec: scaleTargetRef: kind: VirtualMachineInstanceReplicaSet name: ljtian-1070a-test-vmirs apiVersion: kubevirt.io/v1 minReplicas: 2 maxReplicas: 5 metrics: - type: Resource resource: name: cpu target: type: Utilization averageUtilization: 50 status: lastScaleTime: '2024-07-31T08:59:15Z' currentReplicas: 2 desiredReplicas: 2 currentMetrics: - type: Resource resource: name: cpu current: averageValue: 6m averageUtilization: 5