From: David Miller <davem at davemloft.net> Date: Thu, 07 Mar 2013 01:06:55 -0500 (EST) > I'll test your routines with the obvious fix in a moment. With the one-liner fix both of your new implementations work. submul_1 is now much better, about 5.8 cycles per limb on T4.