Is this topic relevant? The model in their example is based on Chain instead of FastChaint.
Thanks for the additional information on TensorFlow. Maybe this is one of the reasons why people in PINNs use multiple neural networks for estimating each variable? Sum of Jacobian is clearly not the best approach.