GMP «Arithmetic without limitations» GMP mpn anomalies
Last modified:2021-04-15 03:07


The purpose of these measurements is to highlight possible improvements of GMP's low-level functions. The comparisons might include C fallbacks; in such cases the result indicates the need for asm implementation.

This is a new page, and it will therefore have glitches:

  1. Results for notoriously inaccurate machines should either be suppressed or else be given greater tolerance (e.g. pentium4, bulldozers).
  2. The tolerance ranges might be too narrow in some cases.

system/abi under-performing
function
factor
(expected lower)
comparison
function
odn2/64: addlsh2_n 1.03 sublsh2_n
verm/64: addlsh2_n 1.15 addlsh_n
piri/64: addlsh2_n 1.27 addlsh_n
suri/64: addlsh2_n 1.27 addlsh_n
mati/64: addlsh2_n 1.34 addlsh_n
bwl/64: addlsh_n 1.07 addmul_1.3
sky/64: addlsh_n 1.08 addmul_1.3
bt1/64: addlsh_n 1.11 rsblsh_n
bt1/64: addlsh_n 1.12 addmul_1.3
bt2/64: addlsh_n 1.14 addmul_1.3
k10/64: addlsh_n 1.14 addmul_1.3
k8/64: addlsh_n 1.16 addmul_1.3
bd1/64: addlsh_n 1.50 addlsh2_n
gege/64: addlsh_n 1.53 addlsh1_n
bd2/64: addlsh_n 1.60 addlsh1_n
bt1/64: addlsh_n 1.70 addlsh2_n
bt2/64: addlsh_n 1.72 addlsh2_n
bd1/64: addlsh_n 1.73 addlsh1_n
bt1/64: addlsh_n 1.74 addlsh1_n
slm/64: addlsh_n 1.75 addlsh1_n
slm/64: addlsh_n 1.75 addlsh2_n
bt2/64: addlsh_n 1.84 addlsh1_n
element/64: addlsh_n 2.35 addlsh1_n
element/64: addlsh_n 2.35 addlsh2_n
beagle/32: addmul_1.3 0.84 addmul_2
bt1/64: addmul_1.3 1.11 submul_1.3
hwl/64: addmul_1.3 1.37 mul_1.3
parks/32: addmul_1.3 1.4 mul_1.3
bt2/64: addmul_2 2.16 addmul_1.3
sky/64: addmul_2 2.45 addmul_1.3
bwl/64: addmul_2 2.46 addmul_1.3
suri/64: copyd 1.15 copyi
piri/64: copyd 1.16 copyi
bt2/32: copyi 1.07 copyd
sky/64: copyi 1.08 copyd
bt1/64: mul_1.3 1.04 addmul_1.3
slm/32: mul_1.3 1.07 addmul_1.3
nanom4/64: mul_basecase 2.01 mullo_basecase
element/64: mul_basecase 2.21 mullo_basecase
k8/64: mullo_basecase 0.64 mul_basecase
bt1/64: mullo_basecase 0.73 mul_basecase
verm/32: mullo_basecase 0.73 mul_basecase
g5/mode64: mullo_basecase 0.74 mul_basecase
gege/32: mullo_basecase 0.77 mul_basecase
slm/32: mullo_basecase 0.82 mul_basecase
odxu4/32: mullo_basecase 0.83 mul_basecase
piri/32: mullo_basecase 0.83 mul_basecase
suri/32: mullo_basecase 0.83 mul_basecase
mati/32: mullo_basecase 0.84 mul_basecase
bt2/32: mullo_basecase 0.85 mul_basecase
bd1/32: mullo_basecase 0.86 mul_basecase
bd2/32: mullo_basecase 0.86 mul_basecase
cnr/32: mullo_basecase 0.87 mul_basecase
k10/32: mullo_basecase 0.87 mul_basecase
pnr/32: mullo_basecase 0.87 mul_basecase
nhm/32: mullo_basecase 0.88 mul_basecase
bd4/32: mullo_basecase 0.90 mul_basecase
sky/32: mullo_basecase 0.92 mul_basecase
k8/32: mullo_basecase 0.93 mul_basecase
odc1/32: mullo_basecase 0.93 mul_basecase
tinker/32: mullo_basecase 0.93 mul_basecase
wsm/32: mullo_basecase 0.93 mul_basecase
bt1/32: mullo_basecase 0.95 mul_basecase
element/32: mullo_basecase 0.96 mul_basecase
beagle/32: mullo_basecase 1.02 mul_basecase
ivygentoo32/32: mullo_basecase 1.07 mul_basecase
bwl/32: mullo_basecase 1.09 mul_basecase
parks/32: mullo_basecase 1.1 mul_basecase
sbr/32: mullo_basecase 1.12 mul_basecase
hwl/32: mullo_basecase 1.13 mul_basecase
nhm/64: redc_1 1.27 mul_basecase
wsm/64: redc_1 1.27 mul_basecase
g5/mode64: redc_1 1.29 mul_basecase
verm/64: redc_1 1.33 mul_basecase
slm/32: redc_1 1.38 mul_basecase
cnr/64: redc_1 1.39 mul_basecase
pnr/64: redc_1 1.40 mul_basecase
gege/32: redc_1 1.45 mul_basecase
k10/32: redc_1 1.45 mul_basecase
verm/32: redc_1 1.45 mul_basecase
piri/64: redc_1 1.48 mul_basecase
suri/64: redc_1 1.49 mul_basecase
mati/64: redc_1 1.53 mul_basecase
k8/32: redc_1 1.57 mul_basecase
mati/32: redc_1 1.60 mul_basecase
bt2/32: redc_1 1.61 mul_basecase
nhm/32: redc_1 1.64 mul_basecase
piri/32: redc_1 1.64 mul_basecase
bd2/32: redc_1 1.65 mul_basecase
pnr/32: redc_1 1.65 mul_basecase
bd1/32: redc_1 1.66 mul_basecase
bd4/32: redc_1 1.66 mul_basecase
suri/32: redc_1 1.66 mul_basecase
cnr/32: redc_1 1.68 mul_basecase
bt1/32: redc_1 1.71 mul_basecase
odc1/32: redc_1 1.76 mul_basecase
tinker/32: redc_1 1.77 mul_basecase
wsm/32: redc_1 1.81 mul_basecase
element/32: redc_1 1.82 mul_basecase
beagle/32: redc_1 1.92 mul_basecase
sky/32: redc_1 1.95 mul_basecase
ivygentoo32/32: redc_1 2.00 mul_basecase
bwl/32: redc_1 2.05 mul_basecase
sbr/32: redc_1 2.07 mul_basecase
parks/32: redc_1 2.1 mul_basecase
hwl/32: redc_1 2.10 mul_basecase
element/64: rsblsh1_n 2.11 addlsh1_n
verm/64: rsblsh2_n 1.12 rsblsh_n
piri/64: rsblsh2_n 1.27 rsblsh_n
suri/64: rsblsh2_n 1.27 rsblsh_n
mati/64: rsblsh2_n 1.35 rsblsh_n
g5/mode64: rsblsh2_n 1.44 addlsh2_n
element/64: rsblsh2_n 2.01 addlsh2_n
gege/64: rsblsh_n 1.53 rsblsh1_n
bt1/64: rsblsh_n 1.55 rsblsh2_n
bt1/64: rsblsh_n 1.58 rsblsh1_n
bd2/64: rsblsh_n 1.63 rsblsh1_n
bt2/64: rsblsh_n 1.72 rsblsh2_n
bd1/64: rsblsh_n 1.74 rsblsh1_n
slm/64: rsblsh_n 1.75 rsblsh1_n
slm/64: rsblsh_n 1.75 rsblsh2_n
bt2/64: rsblsh_n 1.83 rsblsh1_n
bt1/64: rsh1add_n 1.16 rsh1sub_n
verm/64: rsh1add_n 1.20 addlsh1_n
slm/64: rsh1add_n 1.33 addlsh1_n
tinker/32: rsh1add_n 1.34 addlsh1_n
beagle/32: rsh1add_n 1.36 addlsh1_n
verm/64: rsh1sub_n 1.23 addlsh1_n
slm/64: rsh1sub_n 1.33 addlsh1_n
tinker/32: rsh1sub_n 1.34 addlsh1_n
beagle/32: rsh1sub_n 1.36 addlsh1_n
g5/32: sbpi1_bdiv_r 1.25 mul_basecase
slm/32: sbpi1_bdiv_r 1.34 mul_basecase
g5/mode64: sbpi1_bdiv_r 1.35 mul_basecase
verm/32: sbpi1_bdiv_r 1.38 mul_basecase
gege/32: sbpi1_bdiv_r 1.40 mul_basecase
k10/32: sbpi1_bdiv_r 1.40 mul_basecase
nhm/64: sbpi1_bdiv_r 1.41 mul_basecase
wsm/64: sbpi1_bdiv_r 1.41 mul_basecase
k8/32: sbpi1_bdiv_r 1.52 mul_basecase
mati/32: sbpi1_bdiv_r 1.53 mul_basecase
cnr/64: sbpi1_bdiv_r 1.56 mul_basecase
pnr/64: sbpi1_bdiv_r 1.56 mul_basecase
cnr/32: sbpi1_bdiv_r 1.57 mul_basecase
nhm/32: sbpi1_bdiv_r 1.57 mul_basecase
pnr/32: sbpi1_bdiv_r 1.58 mul_basecase
piri/32: sbpi1_bdiv_r 1.59 mul_basecase
bd2/32: sbpi1_bdiv_r 1.60 mul_basecase
suri/32: sbpi1_bdiv_r 1.60 mul_basecase
bd1/32: sbpi1_bdiv_r 1.62 mul_basecase
bd4/32: sbpi1_bdiv_r 1.63 mul_basecase
bt2/32: sbpi1_bdiv_r 1.69 mul_basecase
element/32: sbpi1_bdiv_r 1.70 mul_basecase
bt1/32: sbpi1_bdiv_r 1.71 mul_basecase
wsm/32: sbpi1_bdiv_r 1.74 mul_basecase
tinker/32: sbpi1_bdiv_r 1.84 mul_basecase
beagle/32: sbpi1_bdiv_r 1.92 mul_basecase
sky/32: sbpi1_bdiv_r 1.92 mul_basecase
ivygentoo32/32: sbpi1_bdiv_r 1.98 mul_basecase
parks/32: sbpi1_bdiv_r 2 mul_basecase
sbr/32: sbpi1_bdiv_r 2.02 mul_basecase
bwl/32: sbpi1_bdiv_r 2.08 mul_basecase
hwl/32: sbpi1_bdiv_r 2.11 mul_basecase
bd1/32: sqr_basecase 0.65 mul_basecase
bd2/32: sqr_basecase 0.65 mul_basecase
k10/32: sqr_basecase 0.65 mul_basecase
k10/64: sqr_basecase 0.65 mul_basecase
odc1/32: sqr_basecase 0.65 mul_basecase
bd4/32: sqr_basecase 0.66 mul_basecase
gege/32: sqr_basecase 0.66 mul_basecase
k8/64: sqr_basecase 0.66 mul_basecase
nhm/32: sqr_basecase 0.66 mul_basecase
slm/32: sqr_basecase 0.66 mul_basecase
beagle/32: sqr_basecase 0.68 mul_basecase
bwl/32: sqr_basecase 0.68 mul_basecase
ivygentoo32/32: sqr_basecase 0.68 mul_basecase
g5/32: sqr_basecase 0.69 mul_basecase
g5/mode32: sqr_basecase 0.69 mul_basecase
hwl/32: sqr_basecase 0.69 mul_basecase
sky/32: sqr_basecase 0.69 mul_basecase
cnr/32: sqr_basecase 0.70 mul_basecase
pnr/32: sqr_basecase 0.70 mul_basecase
sbr/32: sqr_basecase 0.70 mul_basecase
wsm/32: sqr_basecase 0.70 mul_basecase
element/32: sqr_basecase 0.72 mul_basecase
piri/32: sqr_basecase 0.72 mul_basecase
suri/32: sqr_basecase 0.73 mul_basecase
parks/32: sqr_basecase 0.75 mul_basecase
k8/32: sqr_basecase 0.76 mul_basecase
gege/64: sqr_basecase 0.81 mul_basecase
bt1/32: sqr_basecase 0.82 mul_basecase
bt1/64: sub_n 1.25 add_n
element/32: submul_1.3 1.26 addmul_1.3
ivygentoo32/32: submul_1.3 1.27 addmul_1.3
cnr/32: submul_1.3 1.31 addmul_1.3
pnr/32: submul_1.3 1.31 addmul_1.3
verm/64: submul_1.3 1.40 addmul_1.3