Distributed Array Optimization

I’ve got a distributed array split across a bunch of workers.

Each worker grabs portions of the distributed array (which creates subarrays)
they processes their local part while accessing those subarrays.
Two questions.

  1. will every access to the subarray create a network rpc? if so how would I avoid that? (ie: just copy the data locally)
  2. Is there a good way to profile the remote accesses in the darray?