"there's little for diagnosing concurrency issues in production beyond analysing multiple stack traces, data from the ThreadMXBean and jstack -l for java.util.concurrent.locks"
"If you will be sending the same data component repeatedly, you can marshall that once and cache the marshalled form or a reference to it (ideally in a send buffer close to the wire) for improved scalability"