Awesome improvements! With the current code, bqsrtinv is measurably faster than sqrt for small sizes, but it eventually became slower. Is it evident why it becomes slower for large operands? -- Torbjörn Please encrypt, key id 0xC8601622