> This particular macro is not currently problem free.  Let's compare
> lshift+add_n and submul_1 on some machines:
>              lshift+sub_n            submul_1
> athlon64        3.87                     2.5
> core2           3.3                      4.5
> pentium4/64     7.3                     14.9
> ultrasparc 3    7.75                    23
> power4/ppc970   5.0                     10

 Have you incorporated my mpn_rshift improvements for Core2?  (and
sped up lshift the same way?)  I haven't really been paying attention
since I've been busy with other stuff.  I was going to put some
finishing touches on the setup code, but I never got back to it since
I got busy with other projects...

