Supported Regions:上海

Running Mode and Scaling

This document introduces the running mode and scaling mechanism of CloudRun, helping you choose the most suitable configuration plan for your business scenario.

Operation Mode Overview

CloudRun provides the following running modes to cater to different business needs:

Always Automatic Scaling: Adjusts the number of instances automatically based on the load to maximize resource savings.
Continuous Operation: Maintains a fixed number of instances, suitable for scenarios with stable traffic.
Daytime Continuous Operation with Nighttime Automatic Scaling: Maintains fixed instances during working hours and automatically scales during off-peak hours.
Customization: Allows flexible configuration of auto-scaling and scheduled scaling policies.
Manual Instance Start/Stop: Provides full manual control over the number of instances.

Operation Mode Detailed Explanation

Always Automatic Scaling

In this mode, the CloudRun service automatically adjusts the number of instances based on the actual load:

Scaling Range: The number of instances can be automatically adjusted between 0-10.
Trigger Metric: You can select CPU utilization, memory utilization, or both as scaling trigger conditions.
Advantages: Maximizes resource savings, automatically handles traffic fluctuations, and reduces operational costs.

This mode is particularly suitable for business scenarios with significant or unpredictable traffic fluctuations.

Continuous Operation

In Continuous Operation mode, the service maintains a fixed number of running instances and does not perform automatic scaling.

Applicable Scenarios:

Service traffic is relatively stable with no significant peaks or troughs.
Ensures the service remains in a warm start state at all times, thus preventing cold start latency.
Core business with extremely high requirements for service availability

In this mode, you need to estimate the required number of instances based on business peak loads to ensure stable service operation.

Daytime Continuous Operation with Nighttime Automatic Scaling

This mode combines the advantages of Continuous Operation and automatic scaling:

Daytime Hours (8:00-24:00): Maintains a specified number of instances in continuous operation.
Nighttime Hours (0:00-8:00): Enables automatic scaling, allowing instances to scale down to 0.

This approach fits business scenarios with higher traffic during work hours and significantly decreased traffic during non-work hours, such as internal enterprise systems, office applications, etc.

Custom Icons

Customization Mode provides the most flexible configuration options, including:

Automatic Scaling:
- Configure the maximum and minimum quantity of instances
- Select CPU or memory utilization as the trigger condition
- Set the trigger threshold and monitoring interval
Scheduled Scale-out:
- Maintain the minimum number of instances within specified time periods
- The number of instances for scheduled scale-out must exceed the minimum number of instances set for automatic scaling.
- During scheduled periods, automatic scale-out can still occur but will not exceed the maximum instance limit

This mode is suitable for scenarios with predictable traffic patterns that require granular resource control, such as e-commerce promotions or scheduled task processing.

Manual Instance Start/Stop

In manual start/stop mode, instances are entirely manually controlled and do not scale automatically:

Instances do not automatically scale in due to lack of requests
Instances will not automatically scale out due to increased load
All instance modifications must be performed manually via the console or API

Operation Method:

Start Instance:
- Perform the "Start Instance" operation in the console version list
- Start instance via API call
- Each console operation starts one instance; starting multiple instances requires multiple operations.
Stop Instance:
- Perform the "Stop Instance" operation in the console instance list
- Stop instance via API call
- You can select specified instances to perform the stop operation

This mode is suitable for scenarios requiring full control over the instance lifecycle, such as development and testing environments or specific business requirements.

Instance resource usage

For manual instance start/stop, if you forget to stop the instance after starting it, unnecessary resource waste may occur, resulting in higher billing costs.

Scaling Out and In in Detail

Scaling Out Instances Process

Instance scaling consists of two stages:

From 0 to 1 Scaling (Cold Start):
- Trigger condition: The service receives a new request after being fully scaled in
- Start instance via API call
- Factors affecting time consumption: platform resource status, image size, business code startup time, etc.
- User experience: The first request may experience some delay
Scaling from 1 to Multiple Instances:
- Trigger condition: The average CPU/memory utilization of existing instances reaches or exceeds the configured scaling-out threshold.
- Process: Start a new instance and add it to the load balancer once ready
- Continuous monitoring: If the threshold is still exceeded after the detection cycle, additional scaling out will be performed
- Limitation: The number of instances does not exceed the configured maximum value

Instance Scaling Down Mechanism

Trigger condition: No access requests, no traffic flow, and idle for 10 minutes
Scale-in behavior: Reclaim and destroy idle instances
Minimum instances: When the minimum instance count is set to 0, scaling down to zero instances is possible
Subsequent requests: New requests after full scale-in will trigger the cold start process

Instance Termination Instructions

Instances will be completely destroyed after scale-in
Temporary files, in-memory data, etc. within the instance will be cleared together
Important: Ensure your application is stateless, and all persistent data should be stored in external storage services

Choose the Appropriate Operation Mode

Operation Mode	Applicable Scenarios	Cost Efficiency	Response Speed
Always Automatic Scaling	Large traffic fluctuations, unpredictable	Optimal	Possible cold start delay
Continuous Operation	Stable traffic, high availability requirements	Fixed, higher	Optimal (no cold start)
Daytime Continuous Operation with Nighttime Automatic Scaling	Work-hour intensive applications	Better	Optimal during work hours, possible delay during off-peak
Custom	Clear traffic patterns, requires fine-grained control	Optimizable	Can be optimized based on configuration
Manual Instance Start/Stop	Development and testing, special business requirements	Full control	Depends on timeliness of manual operations

Running Mode and Scaling

Operation Mode Overview​

Operation Mode Detailed Explanation​

Always Automatic Scaling​

Continuous Operation​

Daytime Continuous Operation with Nighttime Automatic Scaling​

Custom Icons​

Manual Instance Start/Stop​

Scaling Out and In in Detail​

Scaling Out Instances Process​

Instance Termination Instructions​

Choose the Appropriate Operation Mode​