Capability

Hot Config Reload (Zero-Downtime)

Change config. Keep connections. Where possible, every change is applied via soft reload; where unavoidable, a controlled hard restart fallback takes over.

TR7 Hot Config Reload aims to apply configuration changes to traffic-serving services without interrupting active sessions. Rules, certificates, backend pools, health checks, WAAP profiles, rate-limit policies and cache settings are among the changes that can be activated while live traffic continues to flow. TR7 applies soft reload through an independent management channel per pool. Each change is first analyzed: fields that can be applied at runtime go through soft reload, while creationConfig fields — service type, VIP, port, CPU, memory or namespace — trigger a controlled hard restart. The reload sequence follows a soft → hard fallback model. If soft reload fails, the system recovers via hard restart. Operators can see why any given change required a reload or a restart through the restartReasons output. The result: TR7 takes configuration changes out of the "restart the service and drop users" model and turns them into a pool-scoped, reason-transparent, controlled and operations-friendly reload flow.

Sessions dropped during a soft reload

5 min

Maximum job timeout for a pool edit operation

~12

Config field categories applicable via soft reload

When a configuration change cuts live traffic, every rule update becomes a maintenance window.

Configuration changes in the load balancing and security layer are unavoidable. New backends are added, WAAP rules are updated, certificates are renewed, health checks are adjusted, rate-limit policies are tightened, or new blocking rules are written in the middle of an attack. If every change requires a service restart, even small operational tasks turn into user session interruptions.

The classic restart model is especially problematic for TCP and long-lived connections. Active user sessions can be cut, file transfers can be left incomplete, API clients can receive errors, and a firewall rule change can cause a momentary access failure. As a result, teams start deferring necessary changes, and security and operational agility suffer.

The problem is most acute in WAAP and security rule sets. Waiting for a maintenance window to apply a rule update while a new attack pattern is actively being observed is not an acceptable position. Likewise, routine operations such as certificate renewal or backend pool expansion should not unnecessarily affect live traffic.

The right model is to distinguish between change types. Changes that can be applied at runtime — rules, certificates, health checks, backend lists and security profiles — should go through soft reload. Only changes that affect service creation parameters — VIP, port, namespace, resource limits or service type — should require a controlled restart.

TR7 Hot Config Reload makes this distinction: per-pool soft reload, automatic hard fallback, restart reason visibility and parallel operation flow apply configuration changes while minimising the impact on live traffic.

Our approach

TR7 designs configuration reload not as a single uniform restart operation, but as a layered operation flow that decides based on change type.

The pool management channel applies live reload commands independently

Each pool can receive a reload through its own management channel. This model ensures that a change in one pool is applied without affecting other pools.

Soft reload is tried first; restart fallback runs if it fails

TR7 uses the soft reload path that preserves live connections by default. If soft reload fails or the change requires a hard restart, the system switches to a controlled restart path.

Restart reason parsing gives the operator visibility into why

Config diff analysis determines which fields require a reload and which require a restart. Operators can see in advance whether a change can be applied live or involves a creationConfig modification that mandates a restart.

ADC and log operations run in parallel to reduce total edit time

Load balancing and log container operations can run in parallel when they are independent. This prevents pool edit time from being stretched by unnecessary sequential waits.

Capabilities

Hot Config Reload combines per-pool reload, change-type analysis, fallback, connection drain and parallel operations in a single management flow.

Soft reload applies config changes while preserving live connections

Soft reload activates the new configuration for eligible changes while allowing existing connections to complete naturally. The old worker enters drain mode and continues serving active sessions until they close. New connections are handled by the updated config. This approach removes the need to tie rule and certificate changes to a maintenance window.

Smart reload tries the soft path first and falls back to hard restart on failure

TR7 defaults to the soft reload path. If an error occurs during soft reload, the system recovers via hard restart fallback. This model offers operators a single safe flow: reload without downtime where possible, restart in a controlled manner where necessary. Failed reload attempts do not result in an indeterminate half-applied configuration state.

The forceHard option provides explicit restart for creation config changes

Some changes cannot be applied via live reload because they modify service creation parameters. Fields such as service type, CPU limit, memory limit, VIP, port, network namespace, image or binary version require a hard restart. In these cases, forceHard behavior makes the restart explicit and intentional. Operators can plan the impact of the change in advance.

Restart reason records make the impact of config changes visible

For each config field, the reason why a change requires a reload or restart can be held in the restartReasons list. This visibility answers the question "why does this change need a restart?" for the operations team. It makes decision-making clearer, particularly during change management and maintenance approval processes. Change impact stops being a black box.

Per-pool independent reload runs without affecting other services

Each pool is handled with its own runtime structure and management channel. While a WAAP profile, certificate or backend list changes in one pool, other pools continue running without interruption. This isolation narrows the blast radius of changes on large platforms. The operations team reloads only the relevant service, not the entire appliance.

Log container operations are managed independently from the load balancing layer

Log transport or log container configurations can be handled separately from the load balancing container. Changes on the log side therefore do not unnecessarily affect the traffic routing layer. Equally, a reload on the traffic side does not force the log pipeline to be interrupted. This separation gives more controlled change management across the platform.

Parallel execution reduces total edit time

If a change affects both the load balancing and log sides, eligible operations can run in parallel. Sequential waiting is reduced and total job time is shortened. This matters especially with large configurations or updates that affect multiple sub-components. The operation window is used more efficiently.

Connection drain during soft reload reduces the risk of session drops

When a soft reload is applied, the old worker is not killed immediately — existing connections are allowed to close naturally. New traffic is directed to the updated configuration while existing traffic completes its natural shutdown. This is critical for long-lived TCP connections and active user sessions. The goal is to avoid producing TCP resets or abrupt disconnections at the moment of the change.

Error counter-based automatic reload provides self-healing

When a specific error threshold is exceeded in pool statistics, an automatic reload can be triggered. This behavior helps transient runtime issues recover without waiting for manual intervention. For operators, this means an automatic health improvement signal for the service. Recurring reload causes should still be evaluated through monitoring and root cause analysis.

WAAP and Lua changes can be included in the soft reload scope

WAAP profile, whitelist, rule set and Lua script updates can be treated as live-reloadable changes. This enables rapid security policy updates during an attack and activates new logic without interrupting application traffic. Rule set changes are not tied to a full platform restart. This approach increases the agility of security operations.

Operational depth

Hot config reload operates together with the pool management path, command behavior, timeout, retry logic and the classification of which fields count as soft or hard changes.

Pool management socket

Each pool has a dedicated management socket. The reload command is sent to the management channel of the relevant pool. This structure is the foundation of per-pool independent reload behavior.

Reload command

Soft reload is triggered by a reload command sent to the management channel. If the command succeeds, the new config is activated and existing connections are protected by drain behavior. On failure, smart reload can apply a hard restart fallback.

Container naming model

Pool container names follow a consistent pattern linked to the poolId. This structure ensures that reload, restart, log cleanup and health check operations are applied to the correct container. Operation automation benefits from this deterministic naming.

Soft retry behavior

In the default model, soft reload is attempted once. If an exception or reload failure occurs, the system switches to the hard restart path. This approach avoids stretching operation time by repeatedly retrying a failing reload.

Job timeout limit

The total timeout for a pool edit job can be held at 5 minutes. This covers the full scope of log cleanup, reload and restart if necessary. Long-running jobs do not remain open in an indeterminate state on the operations screen.

Wait durations

Default waitBefore and waitAfter values can be run optimized at 0. This removes unnecessary fixed waits. Actual wait time is determined by the state of the operation and the system response.

When to use it

Push a new WAAP rule version without cutting live traffic

A new WAAP rule set or OWASP-based update can be included in the soft reload scope. Existing user sessions are preserved while the new security logic takes effect for incoming requests. There is no need to wait for a maintenance window during an active attack.

Renew a certificate without dropping existing connections

When a certificate approaching expiry is updated, the lbCertificates change can be applied via soft reload. New TLS connections use the updated certificate. Existing connections close naturally through drain behavior.

Expand the backend pool after autoscaling

New backend nodes can be added to the lbBackends list. After soft reload, new connections benefit from the expanded pool. Capacity is increased without affecting other pools or existing connections.

Tighten rate-limit policy rapidly during an attack

A new DDoS or rate-limit profile can be applied as a live config change. The policy takes effect on incoming requests within a short time. The operations team responds to the attack without creating additional downtime from a restart.

Frequently asked questions

Which config changes go through soft reload and which require a restart?

Runtime fields — rules (lbRules), backend lists (lbBackends), health checks (lbHealthChecks), certificates (lbCertificates), rate-limit/DDoS profiles, WAAP profiles and whitelists, cache, compression, session and timeout settings — fall within the soft reload scope. Fields that affect service creation parameters — service type, CPU/memory limits, VIP/port, network namespace or binary version — classify as creationConfig changes and trigger a controlled hard restart.

What happens if soft reload fails?

The smart reload logic engages: if soft reload fails, the system automatically takes the hard restart fallback path. This model prevents a failed reload attempt from leaving the service in an indeterminate or half-applied configuration state. Operators work through a single safe flow throughout.

What happens to active sessions during a soft reload?

The old worker is not terminated immediately. Connection drain behavior allows existing connections to close naturally. New connections are directed to the updated configuration while existing connections complete their natural shutdown. This matters for long-lived TCP connections and active user sessions.

Does reloading one pool affect other pools?

No. Each pool is handled independently with its own management channel and runtime structure. A change in one pool affects only that pool. This isolation narrows the blast radius of changes on large platforms and allows the operations team to work in a targeted way.

How can an operator see why a change requires a restart?

Config diff analysis records which fields can be applied via soft reload and which require a hard restart due to a creationConfig change in the restartReasons list. Operators can review this list to understand the impact and reason for a change before applying it. Change impact stops being a black box.

Does a WAAP rule set change require a maintenance window?

No. WAAP profile, whitelist and rule set updates fall within the soft reload scope. Existing user sessions are preserved while the new security logic is applied to incoming requests. Even during an active attack, a rule update does not need to wait for a maintenance window.

Configuration changes should not require a maintenance window

Apply WAAP rules, certificates, backend pools and rate-limit policies without dropping active sessions. Let's walk through a live setup in your own environment.