About all I can offer is some suggestions. You may have already tried these.
See if you can recreate the resistance issue on a test setup (not your working equipment).
We can do a test setup on a system that has been down for years. It would work nicely for that. As for what is failing, on the night that I described, we received the report that the network was down, but everything was running when we got up there. But it's no problem shutting down a network for a look-see if we aren't running production. So we went ahead and did this and that's when we found the 141 ohms. So it was surely on its way out.
Define what is failing as precisely as you can. For example, no data received or sent? By one device or more? Does it happen on random devices or do a few devices fail repeatedly? If it's the latter, is there anything different about these devices (cable length?) Does the problem go away on its own sometimes besides the two "walk bys"?
It's largely limited to five networks, and the usual fix is to take the network down and using an ohmmeter, monitor the resistance as we open and close connectors. We had been looking at it with a scope, but it became apparent that a ohmmeter worked better. What appears to be happening is the ControlBoss connectors are coming loose and creating either a high-impedance junction, or in extreme cases, complete loss of one of the CAN signals or the Network power that goes along with it.
Sometimes the fault is announced by a "Bus Off" condition (essentially a heart attack for a CAN network), any other time it would be some or all of the devices on the network disappearing.
The cabling was done using factory made cables. And while that might sound attractive, you end up having to do something with the excess cables. In our case, they are looped back and stuffed in the cable tray. As you know, excess length in a CAN network is never a good idea. That and the minimum bending radius of the cable is ignored to get it folded back (these are 8 conductor cables).
Because of the cable size (approximately 5/8" - 3/4" in diameter), the weight and stiffness can cause connectors to come loose.
As for failures based on cable length, there seems to be no difference based on length. Any cable can fail, it really makes no difference.
And for the problem going away on its own, we never seem to have the time to wait, so I'll never know.
Can you monitor the signal levels and noise?
We can monitor it through a few different methods. We can either use a DeviceNet monitor that does a good job of tracking numerous different types of faults (rise time, noise, voltages in regard to ground and in regard to the other CAN signal, along with about twenty other different things). But it falls short on an accurate representation of the waveform. For that, I've got a Rigol DS1102E that I can connect to my laptop to collect data in real time.
I'm familiar with our normal noisy signals. It would be interesting to see what's different when this failure occurs.
The bad thing is that we never know when or where this is going to happen. And where...
Thanks!
John