Point-to-point communication between servers usually works just fine when one or two instances are communicating.
data:image/s3,"s3://crabby-images/1a88b/1a88bfcee87c68b29002aada158d37441f2d5456" alt=""
However, if you increase the number of applications, the total number of connections increases.
data:image/s3,"s3://crabby-images/ce981/ce981347001a7270988ca475a38e5fde4c05b554" alt=""
In fact, the total number of connections increases as a square of the number of application instances. So, if you are running 100 instances, you will be maintaining O(100^2) connections. The scaling becomes quite painful.
What can you do? This depends on your application but one strategy you may want to employ is using a message queue as an intermediary between systems. In this model, an application sends a single message to the message queue, and any interested listeners can read that message as they are able.
data:image/s3,"s3://crabby-images/f61e5/f61e5d57d4bea4f0ab7822b244a0eefd170df87f" alt=""
This design reduces the total number of messages sent, and provides de-coupling of each individual application, further increasing scalability.