The Key in my opinion is in this sentence:
The register data in the response message are packed as two bytes per register (word), with the binary contents right justified within each byte. For each register, the first byte contains the high order bits (MSB) and the second contains the low order bits (LSB).
Try to read a simple register like 771 e772 are one word length