Introduction
Apache Geode is an in-memory data grid that provides real-time, consistent access to data-intensive applications throughout widely distributed cloud architectures. Its many features include fault-tolerance, high-availability, and WAN replication.
The multi-site or WAN topology is used to connect distinct clusters, they act as one distributed system when they are coupled, and they act as independent systems when communication between sites fails.
The Command Pattern is a behavioral design pattern in which an object is used to encapsulate all information needed to perform an action or trigger an event.
Why?
Gateway Senders are responsible for transmitting events from one site to another and, by default, only the following operations are replicated across WAN sites:
- Entry put.
- Entry create.
- Entry distributed destroy (not originated through expiration).
Sometimes we want to replicate more than that, specifically (but not limited to) notifying the remote sites about Region Events (like destroy or clear) that happened on the local cluster. In order to do so, we can implement the Command Region Pattern.
I’ve seen this pattern successfully and widely applied in a lot of projects over the years but haven’t found a single point of reference about it, now is a good time to share some examples and reasoning behind the pattern itself.
How?
The idea is pretty straightforward: a custom distributedCommand region will be used for distribution purposes, which will have a DistributedCommandCacheWriter attached so we can execute the commands locally. The DistributedCommand Region itself will exist on all clusters and the commands will be sent across the wire through Geode gateway-senders so remote clusters can receive them through Geode gateway-receivers and execute the unit of work as well.
- The client application creates a DistributedCommand instance and executes a put operation through the PROXY region.
- The CacheWriter is invoked and the DistributedCommand is executed locally.
- The DistributedCommand is added to the gateway-sender queue for distribution to remote clusters.
- The DistributedCommand is distributed and received by a gateway-receiver on a remote cluster.
- The DistributedCommand reaches a remote server, where the CacheWriter is invoked and the DistributedCommand is executed locally.
Implementation
-
Create a DistributedCommand interface.
public interface DistributedCommand { void execute(); default String getName() { return this.getClass().getSimpleName(); } }
-
Create as many implementations of the DistributedCommand interface as you need. Below is a dummy example, which does nothing but print “Hello World from ${clientId}!” in the logs.
public class HelloWorldCommand implements DistributedCommand, Serializable { private final static transient Logger logger = LogService.getLogger(); private String clientId; @Override public void execute() { logger.info("Hello World from {}!.", clientId); } }
-
Create a DistributedCommandCacheWriter, its only purpose is to execute the received command.
public class DistributedCommandCacheWriter extends CacheWriterAdapter<Long, DistributedCommand> implements Declarable { private final static transient Logger logger = LogService.getLogger(); @Override public void initialize(Cache cache, Properties properties) { } @Override public void beforeCreate(EntryEvent<Long, DistributedCommand> event) throws CacheWriterException { DistributedCommand distributedCommand = event.getNewValue(); logger.info("Executing distributed command {}...", distributedCommand.getName()); distributedCommand.execute(); logger.info("Executing distributed command {}... Done!.", distributedCommand.getName()); } }
-
Start two clusters (1 locator and 1 server each) for testing purposes.
gfsh> start locator --name=cluster1-locator --port=10334 --J=-Dgemfire.remote-locators=localhost[11334] --J=-Dgemfire.distributed-system-id=1 --J=-Dgemfire.jmx-manager-start=true --J=-Dgemfire.jmx-manager-port=1080 --J=-Dgemfire.jmx-manager-http-port=0 gfsh> start server --name=cluster1-server --locators=localhost[10334] --server-port=40401 gfsh> start locator --name=cluster2-locator --port=11334 --J=-Dgemfire.remote-locators=localhost[10334] --J=-Dgemfire.distributed-system-id=2 --J=-Dgemfire.jmx-manager-start=true --J=-Dgemfire.jmx-manager-port=1090 --J=-Dgemfire.jmx-manager-http-port=0 gfsh> start server --name=cluster2-server --locators=localhost[11334] --server-port=40402
-
Deploy the Commands and CacheWriter to both clusters (hint: use the gfsh deploy command).
-
Connected to cluster1, create the region and the gateway-sender.
gfsh> connect --locator=localhost[10334] gfsh> create gateway-sender --id=sender1 --remote-distributed-system-id=2 gfsh> create region --name=distributedCommand --type=REPLICATE_PROXY --gateway-sender-id=sender1 --cache-writer=org.apache.geode.tools.command.internal.DistributedCommandCacheWriter
-
Connected to cluster2, create the region and the gateway-receiver.
gfsh> connect --locator=localhost[11334] gfsh> create gateway-receiver gfsh> create region --name=distributedCommand --type=REPLICATE_PROXY --cache-writer=org.apache.geode.tools.command.internal.DistributedCommandCacheWriter
That’s it, you’re ready to start replicating your units of work across clusters!.
Example
The following client application simply connects to cluster1 and inserts the HelloWorldCommand.
public class TestClass {
public static void main(String[] args) {
ClientCacheFactory clientCacheFactory = new ClientCacheFactory()
.addPoolLocator("localhost", 10334);
ClientCache clientCache = clientCacheFactory.create();
Region<Long, DistributedCommand> region = clientCache
.<Long, DistributedCommand>createClientRegionFactory(ClientRegionShortcut.PROXY)
.create("distributedCommand");
region.put(1L, new HelloWorldCommand("TestApplication1"));
}
}
It’s easy to see in the logs from both servers (cluster1-server and cluster2-server) that the command was received and correctly executed by both clusters.
[info 2020/04/10 13:57:24.341 IST <ServerConnection on port 40401 Thread 2> tid=0x51] Executing distributed command HelloWorldCommand...
[info 2020/04/10 13:57:24.341 IST <ServerConnection on port 40401 Thread 2> tid=0x51] Hello World from TestApplication1!.
[info 2020/04/10 13:57:24.341 IST <ServerConnection on port 40401 Thread 2> tid=0x51] Executing distributed command HelloWorldCommand... Done!.
-------------------------------------------------------------------
[info 2020/04/10 13:57:24.873 IST <ServerConnection on port 5152 Thread 1> tid=0x42] Executing distributed command HelloWorldCommand...
[info 2020/04/10 13:57:24.873 IST <ServerConnection on port 5152 Thread 1> tid=0x42] Hello World from TestApplication1!.
[info 2020/04/10 13:57:24.873 IST <ServerConnection on port 5152 Thread 1> tid=0x42] Executing distributed command HelloWorldCommand... Done!.
What’s Next?
Check out geode-command-region-pattern and play around with it, it allows you to test your commands in a distributed fashion using the geode-dunit module.
There are several other really useful things that can be done through the usage of this pattern, like distributing a Region.destroy() or Region.clear() operation, execute a transaction on the remote cluster, the possibilities are endless, give it a try!
Looking for other interesting use cases?, check the following articles:
- Geode Distributed Sequences
- Publishing Apache Geode Metrics to Wavefront
- Ingest, Store and Search JSON data with Apache Kafka and Apache Geode
Need help with a complex problem or want to validate your solution?, share some details with the users lists.