GMP «Arithmetic without limitations» GMP mpn anomalies
Last modified:2018-08-20 22:33


The purpose of these measurements is to highlight possible improvements of GMP's low-level functions. The comparisons might include C fallbacks; in such cases the result indicates the need for asm implementation.

This is a new page, and it will therefore have glitches:

  1. Results for notoriously inaccurate machines should either be suppressed or else be given greater tolerance (e.g. pentium4, bulldozers).
  2. The tolerance ranges might be too narrow in some cases.

system/abi under-performing
function
factor
(expected lower)
comparison
function
piri/64: addlsh2_n 1.26 addlsh_n
zen/64: addlsh2_n 1.29 addlsh_n
bwl/64: addlsh_n 1.06 addmul_1.3
paddy/64: addlsh_n 1.06 addmul_1.3
slm/64: addlsh_n 1.06 addmul_1.3
sky/64: addlsh_n 1.07 addmul_1.3
bobcat/64: addlsh_n 1.10 addmul_1.3
jag/64: addlsh_n 1.10 addmul_1.3
king/64: addlsh_n 1.14 addmul_1.3
k8/64: addlsh_n 1.16 addmul_1.3
gege/64: addlsh_n 1.54 addlsh1_n
pile/64: addlsh_n 1.61 addlsh1_n
jag/64: addlsh_n 1.69 addlsh2_n
bobcat/64: addlsh_n 1.74 addlsh1_n
tutu/64: addlsh_n 1.74 addlsh1_n
jag/64: addlsh_n 1.83 addlsh1_n
slm/64: addlsh_n 2.03 addlsh1_n
slm/64: addlsh_n 2.03 addlsh2_n
element/64: addlsh_n 2.35 addlsh1_n
element/64: addlsh_n 2.35 addlsh2_n
beagle/32: addmul_1.3 0.84 addmul_2
hwl/64: addmul_1.3 1.38 mul_1.3
glm/32: addmul_1.3 1.39 mul_1.3
jag/64: addmul_2 2.07 addmul_1.3
bobcat/64: addmul_2 2.10 addmul_1.3
paddy/64: addmul_2 2.44 addmul_1.3
bwl/64: addmul_2 2.45 addmul_1.3
sky/64: addmul_2 2.45 addmul_1.3
piri/64: copyd 1.17 copyi
g5/mode32: copyi 1.05 copyd
exca/32: copyi 1.06 copyd
bobcat/32: copyi 1.11 copyd
slm/32: mul_1.3 1.07 addmul_1.3
slm/64: mul_1.3 1.10 addmul_1.3
element/64: mul_basecase 2.21 mullo_basecase
k8/64: mullo_basecase 0.64 mul_basecase
gcc2-power8/mode64: mullo_basecase 0.71 mul_basecase
gcc1-power7/mode64: mullo_basecase 0.72 mul_basecase
g5/mode64: mullo_basecase 0.74 mul_basecase
gege/32: mullo_basecase 0.79 mul_basecase
slm/32: mullo_basecase 0.81 mul_basecase
glm/32: mullo_basecase 0.82 mul_basecase
king/32: mullo_basecase 0.82 mul_basecase
odxu4/32: mullo_basecase 0.83 mul_basecase
zen/32: mullo_basecase 0.85 mul_basecase
jag/32: mullo_basecase 0.86 mul_basecase
pile/32: mullo_basecase 0.86 mul_basecase
piri/32: mullo_basecase 0.86 mul_basecase
tutu/32: mullo_basecase 0.87 mul_basecase
cnr/32: mullo_basecase 0.88 mul_basecase
pnr/32: mullo_basecase 0.88 mul_basecase
nhm/32: mullo_basecase 0.90 mul_basecase
exca/32: mullo_basecase 0.91 mul_basecase
paddy/32: mullo_basecase 0.91 mul_basecase
element/32: mullo_basecase 0.92 mul_basecase
odc1/32: mullo_basecase 0.93 mul_basecase
sky/32: mullo_basecase 0.94 mul_basecase
k8/32: mullo_basecase 0.95 mul_basecase
parks/32: mullo_basecase 0.97 mul_basecase
bobcat/32: mullo_basecase 0.98 mul_basecase
tambo/32: mullo_basecase 1 mul_basecase
beagle/32: mullo_basecase 1.02 mul_basecase
ivygentoo32/32: mullo_basecase 1.08 mul_basecase
sbr/32: mullo_basecase 1.11 mul_basecase
bwl/32: mullo_basecase 1.13 mul_basecase
hwl/32: mullo_basecase 1.15 mul_basecase
nhm/64: redc_1 1.26 mul_basecase
g5/mode64: redc_1 1.30 mul_basecase
slm/32: redc_1 1.37 mul_basecase
pnr/64: redc_1 1.39 mul_basecase
cnr/64: redc_1 1.40 mul_basecase
gege/32: redc_1 1.43 mul_basecase
king/32: redc_1 1.46 mul_basecase
piri/64: redc_1 1.50 mul_basecase
zen/64: redc_1 1.50 mul_basecase
glm/32: redc_1 1.54 mul_basecase
k8/32: redc_1 1.58 mul_basecase
tambo/32: redc_1 1.6 mul_basecase
jag/32: redc_1 1.62 mul_basecase
piri/32: redc_1 1.65 mul_basecase
nhm/32: redc_1 1.66 mul_basecase
tutu/32: redc_1 1.66 mul_basecase
zen/32: redc_1 1.66 mul_basecase
pile/32: redc_1 1.67 mul_basecase
bobcat/32: redc_1 1.69 mul_basecase
pnr/32: redc_1 1.69 mul_basecase
cnr/32: redc_1 1.70 mul_basecase
exca/32: redc_1 1.71 mul_basecase
element/32: redc_1 1.76 mul_basecase
odc1/32: redc_1 1.76 mul_basecase
sky/32: redc_1 1.91 mul_basecase
beagle/32: redc_1 1.93 mul_basecase
paddy/32: redc_1 1.94 mul_basecase
parks/32: redc_1 2 mul_basecase
ivygentoo32/32: redc_1 2.01 mul_basecase
bwl/32: redc_1 2.05 mul_basecase
sbr/32: redc_1 2.07 mul_basecase
hwl/32: redc_1 2.11 mul_basecase
k8/64: rsblsh1_n 1.25 addlsh1_n
element/64: rsblsh1_n 2.12 addlsh1_n
piri/64: rsblsh2_n 1.27 rsblsh_n
zen/64: rsblsh2_n 1.28 rsblsh_n
element/64: rsblsh2_n 2.00 addlsh2_n
gege/64: rsblsh_n 1.54 rsblsh1_n
pile/64: rsblsh_n 1.62 rsblsh1_n
jag/64: rsblsh_n 1.71 rsblsh2_n
bobcat/64: rsblsh_n 1.75 rsblsh1_n
tutu/64: rsblsh_n 1.75 rsblsh1_n
jag/64: rsblsh_n 1.84 rsblsh1_n
slm/64: rsblsh_n 2.03 rsblsh1_n
slm/64: rsblsh_n 2.03 rsblsh2_n
slm/64: rsh1add_n 1.32 addlsh1_n
glm/64: rsh1add_n 1.33 addlsh1_n
beagle/32: rsh1add_n 1.36 addlsh1_n
glm/64: rsh1sub_n 1.33 addlsh1_n
slm/64: rsh1sub_n 1.33 addlsh1_n
beagle/32: rsh1sub_n 1.36 addlsh1_n
zen/64: sbpi1_bdiv_r 1.26 mul_basecase
g5/32: sbpi1_bdiv_r 1.28 mul_basecase
g5/mode64: sbpi1_bdiv_r 1.34 mul_basecase
slm/32: sbpi1_bdiv_r 1.35 mul_basecase
king/32: sbpi1_bdiv_r 1.40 mul_basecase
gege/32: sbpi1_bdiv_r 1.42 mul_basecase
nhm/64: sbpi1_bdiv_r 1.43 mul_basecase
k8/32: sbpi1_bdiv_r 1.51 mul_basecase
glm/32: sbpi1_bdiv_r 1.52 mul_basecase
nhm/32: sbpi1_bdiv_r 1.57 mul_basecase
pnr/64: sbpi1_bdiv_r 1.57 mul_basecase
cnr/32: sbpi1_bdiv_r 1.59 mul_basecase
cnr/64: sbpi1_bdiv_r 1.59 mul_basecase
pile/32: sbpi1_bdiv_r 1.59 mul_basecase
pnr/32: sbpi1_bdiv_r 1.59 mul_basecase
tutu/32: sbpi1_bdiv_r 1.59 mul_basecase
tambo/32: sbpi1_bdiv_r 1.6 mul_basecase
piri/32: sbpi1_bdiv_r 1.60 mul_basecase
zen/32: sbpi1_bdiv_r 1.60 mul_basecase
jag/32: sbpi1_bdiv_r 1.61 mul_basecase
exca/32: sbpi1_bdiv_r 1.62 mul_basecase
bobcat/32: sbpi1_bdiv_r 1.67 mul_basecase
element/32: sbpi1_bdiv_r 1.68 mul_basecase
paddy/32: sbpi1_bdiv_r 1.89 mul_basecase
parks/32: sbpi1_bdiv_r 1.9 mul_basecase
sky/32: sbpi1_bdiv_r 1.90 mul_basecase
beagle/32: sbpi1_bdiv_r 1.92 mul_basecase
ivygentoo32/32: sbpi1_bdiv_r 1.97 mul_basecase
sbr/32: sbpi1_bdiv_r 2.02 mul_basecase
bwl/32: sbpi1_bdiv_r 2.06 mul_basecase
hwl/32: sbpi1_bdiv_r 2.10 mul_basecase
odc1/32: sqr_basecase 0.65 mul_basecase
slm/32: sqr_basecase 0.65 mul_basecase
g5/mode32: sqr_basecase 0.66 mul_basecase
gege/32: sqr_basecase 0.66 mul_basecase
glm/64: sqr_basecase 0.66 mul_basecase
k8/64: sqr_basecase 0.66 mul_basecase
king/64: sqr_basecase 0.66 mul_basecase
nhm/32: sqr_basecase 0.66 mul_basecase
tutu/32: sqr_basecase 0.66 mul_basecase
bwl/32: sqr_basecase 0.67 mul_basecase
exca/32: sqr_basecase 0.67 mul_basecase
gcc115/64: sqr_basecase 0.67 mul_basecase
gcc115/64: sqr_basecase 0.67 mul_basecase
beagle/32: sqr_basecase 0.68 mul_basecase
glm/32: sqr_basecase 0.68 mul_basecase
ivygentoo32/32: sqr_basecase 0.68 mul_basecase
paddy/32: sqr_basecase 0.68 mul_basecase
sky/32: sqr_basecase 0.68 mul_basecase
hwl/32: sqr_basecase 0.69 mul_basecase
sbr/32: sqr_basecase 0.69 mul_basecase
gcc1-power7/mode64: sqr_basecase 0.70 mul_basecase
pnr/32: sqr_basecase 0.70 mul_basecase
cnr/32: sqr_basecase 0.71 mul_basecase
element/32: sqr_basecase 0.72 mul_basecase
zen/32: sqr_basecase 0.72 mul_basecase
g5/32: sqr_basecase 0.73 mul_basecase
parks/32: sqr_basecase 0.73 mul_basecase
piri/32: sqr_basecase 0.73 mul_basecase
tambo/32: sqr_basecase 0.73 mul_basecase
gcc2-power8/mode64: sqr_basecase 0.75 mul_basecase
k8/32: sqr_basecase 0.76 mul_basecase
gege/64: sqr_basecase 0.82 mul_basecase
element/32: submul_1.3 1.26 addmul_1.3
hwl/32: submul_1.3 1.27 addmul_1.3
cnr/32: submul_1.3 1.32 addmul_1.3
pnr/32: submul_1.3 1.32 addmul_1.3