Likely GMP bug

  10% speedup.

A great speedup for an important functions.

(It will be a slowdown for one obsolete platform, 64-bit Pentium 4.
There, a 64-bit right shift has a latency 7 or 8 cycles depending on
where the count is located.  I will not lose sleep over this, in
particular as it will typically use asm.)

