Quiesce Mode
Overview
This feature adds support for "Quiesce state" (or maintenance mode) for a Processing Unit This is the unit of packaging and deployment in the GigaSpaces Data Grid, and is essentially the main GigaSpaces service. The Processing Unit (PU) itself is typically deployed onto the Service Grid. When a Processing Unit is deployed, a Processing Unit instance is the actual runtime entity.. If a space is quiesced, it rejects all data-related operations with an exception, allowing the user to perform maintenance operations.
The GigaSpaces Polling Container and Notify Container are also aware of the Space Where GigaSpaces data is stored. It is the logical cache that holds data objects in memory and might also hold them in layered in tiering. Data is hosted from multiple SoRs, consolidated as a unified data model.'s quiesce state, and handle quiesce state changed events by implementing QuiesceStateChangedListener.
In addition to the GigaSpaces event containers mentioned above, a user-defined Bean can also handle quiesce state changed events by implementing QuiesceStateChangedListener.
Quiesce is usually triggered using the ProcessingUnit API, and is propagated to all Space instances and quiesce state changed listeners in the Processing Unit.
Quiesce mode can be also triggered using a CLI (see Quiesce CLI).
Uses for Quiesce State
There are several scenarios that can benefit from quiesce state. Some of these are described below.
Safe Shutdown
Use Quiesce state to shut down an application that contains a data grid and a mirror in a safe manner. This ensures that all operations completed on the grid are flushed to the mirror before undeploy.
To perform a safe shutdown:
- Enter quiesce mode.
- Wait for replication between the space and the mirror to complete (by monitoring the redolog).
- Undeploy the space and mirror.
Rolling System Upgrade
Use Quiesce mode to perform a rolling system upgrade on a live system (typically a Processing Unit code upgrade) and avoid inconsistencies related to processing of requests during the upgrade process.
To perform a rolling system upgrade with Quiesce mode:
- Enter Quiesce mode.
- Perform the hot-upgrade pattern:
2.1. Update the Processing Unit in the GSM Grid Service Manager. This is is a service grid component that manages a set of Grid Service Containers (GSCs). A GSM has an API for deploying/undeploying Processing Units. When a GSM is instructed to deploy a Processing Unit, it finds an appropriate, available GSC and tells that GSC to run an instance of that Processing Unit. It then continuously monitors that Processing Unit instance to verify that it is alive, and that the SLA is not breached..
2.2. Restart the backup instances.
2.3. Restart the primary instances. - Exit Quiesce mode and verify that the upgraded system resumes and works as expected,
Review additional use case examples here.
Space Behavior During Quiesce Mode
The Space behaves as follows when it is in Quiesce mode:
- Rejects all user data-related new operations by throwing a QuiesceException back to the user (replication of previous ongoing operations continues).
- Commit and abort are allowed for two-phase commit transactions.
- Commit and prepare of single-phase transactions are rejected.
- The Lease manager is paused, except for extend lease operations, in order to prevent losing notify listener templates.
Security
In a secure system, Manage PU This is the unit of packaging and deployment in the GigaSpaces Data Grid, and is essentially the main GigaSpaces service. The Processing Unit (PU) itself is typically deployed onto the Service Grid. When a Processing Unit is deployed, a Processing Unit instance is the actual runtime entity.
privileges are required to quiesce/unquiesce a Processing Unit.
Configuration
The Quiesce feature can be disabled by setting com.gs.engine.disableQuiesceMode
to true.
If Quiesce Mode This feature adds support for "Quiesce state" (or maintenance mode) for a Processing Unit. If a space is quiesced, it rejects all data-related operations with an exception, allowing the user to perform maintenance operations. is disabled during Availability Biased, the resolution process may fail.
If Quiesce Mode is disabled, hot maintenance to the Space cannot be performed.
Limitations
The following limitations and open issues apply to Quiesce mode:
- The Processing Unit should be intact before triggering a quiesce request.
- Although Quiesce handles the Processing Unit restart, it is not resilient regarding sudden network disconnections (in GigaSpaces grid component machines). Therefore, under rare conditions the quiesce request must be repeated manually by the user.
- Quiesce state changed events are propagated only to the components (beans) that are located within the Processing Unit context file (pu.xml), therefore custom components are not aware of quiesce state changed events (even if the component implements QuiesceStateChangedListener).
- Replication of the Quiesce state between primary and backup GSM is not yet supported. GSM failover may result in losing the quiesce state of the system. If this happens, you must repeat the quiesce request in order to re-inform the GSM about the current quiesce state.
Processing Unit API
The following ProcessingUnit interface API is used to trigger and manage Quiesce Mode.
// Requests quiesce request from the GSM.
// All space instances and listeners will switch to quiesced mode.
// If the GSM rejects the request an exception with the rejection failure will be thrown.
QuiesceResult quiesce(QuiesceRequest request);
// Requests unquiesce request from the GSM.
// All space instances and listeners will switch to unquiesced mode.
// If the GSM rejects the request an exception with the rejection failure will be thrown.
void unquiesce(QuiesceRequest request);
// Return true if the processing unit reached to desired state as well as all instances in the requested timeout, false otherwise.
boolean waitFor(QuiesceState desiredState, long timeout, TimeUnit timeUnit);
boolean waitFor(QuiesceState desiredState);
// Return the quiesce details of the processing unit
QuiesceDetails getQuiesceDetails();
Quiesce State Changed Listener
A user-defined component (Spring Bean) in a Processing Unit can implement QuiesceStateChangedListener
and be aware of quiesce state changed events.
public class CustomComponent implements ... ,QuiesceStateChangedListener {
...
public void quiesceStateChanged(QuiesceStateChangedEvent event) {
if (event.getQuiesceState() == QuiesceState.QUIESCED)
// stop interacting with the space to prevent getting exceptions
else
// resume interacting with the space
}
}
Sample Use Cases
Safely Undeploy a Stateful Processing Unit
QuiesceRequest request = new QuiesceRequest("Jacob: performing safe shutdown in 11:33 AM");
QuiesceResult result = pu.quiesce(request);
boolean quiesced = pu.waitFor(QuiesceState.QUIESCED, OPERATION_TIMEOUT, TimeUnit.MINUTES);
if (quiesced) {
System.out.println("All instances are QUIESCED, shutting down...");
// wait for redo log to drop to zero
...
pu.undeployAndWait(OPERATION_TIMEOUT, TimeUnit.MILLISECONDS)
}
else {
System.out.println("All instances were not QUIESCED within the given timeout");
// Print QuiesceDetails to figure out which instances were not QUIESCED
System.out.println("Details: " + pu.getQuiesceDetails());
// retry or do some logic
...
}
Rolling Upgrade on a Live System
QuiesceRequest request = new QuiesceRequest("Jacob: performing hot deploy in 11:33 AM");
QuiesceResult result = pu.quiesce(request);
pu.waitFor(QuiesceState.QUIESCED, OPERATION_TIMEOUT, TimeUnit.MINUTES);
// wait for redo log to drop to zero
...
// upgrade the system, restart backups and then primaries
...
QuiesceRequest resumeRequest = new QuiesceRequest("Jacob: resumming the system in 12:14 AM");
pu.unquiesce(resumeRequest);
pu.waitFor(QuiesceState.UNQUIESCED, OPERATION_TIMEOUT, TimeUnit.MINUTES);
System.out.println("the system was successfully upgraded");
REST API
The following are examples of entering and exiting Quiesce mode using the REST API REpresentational State Transfer. Application Programming Interface An API, or application programming interface, is a set of rules that define how applications or devices can connect to and communicate with each other. A REST API is an API that conforms to the design principles of the REST, or representational state transfer architectural style..
Quiesce
curl -X POST --header 'Content-Type: application/json' --header 'Accept: text/plain' 'http://localhost:8090/v1/deployments/myApp/quiesce'
Unquiesce
curl -X POST --header 'Content-Type: application/json' --header 'Accept: text/plain' 'http://localhost:8090/v1/deployments/myApp/unquiesce'