I looked into this (it was really bugging me) and may have found an explanation - take this with a grain of salt because I'm not 100% sure (and I'd like to check this on my own HD rig). It seems to make sense though.
The TDM version of C1 is most likely incurring three samples of latency because of the inherent latency of the TDM bus (which, according to McDSP, is 3 samples). The plug-in itself most likely has zero samples (or minimal) internal latency.
The RTAS version, since it's first on the track, is not reflecting any latency from the TDM bus, as its processing requirements are being handled by the CPU. However, any native plug-in still has to deal with latency created by whatever playback buffer is selected, which would give the RTAS version of C1 a maximum latency of whatever the playback buffer is set to (which is likely much more than 3 samples).
Make sense? I think so but I'd like to look into it more. If anyone else has any insight it'd be more than welcome.