GMP 5.0 is upwardly source and binary compatible with 4.x, and 3.x versions,
except for applications that use the semi-documented mpn_bdivmod
function.
Changes in GMP 5.0.1
BUGS FIXED
- Fat builds fixed.
- Fixed crash for huge multiplies when old FFT_TABLE2 type of parameter
selection tables' sentinel was smaller than multiplied operands.
- The solib numbers now reflect the removal of the documented but preliminary
mpn_bdivmod function; we correctly flag incompatibility with GMP 4.3.
GMP 5.0.0 has this wrong, and should perhaps be uninstalled to avoid
confusion.
SPEEDUPS
- Multiplication of large numbers has indirectly been sped up through
better FFT tuning and processor recognition. Since many operations
depend on multiplication, there will be a general speedup.
FEATURES
- More Core i3, i5 an Core i7 processor models are recognised.
- Fixes and workarounds for Mac OS quirks should make this GMP version
build using many of the different versions of "Xcode".
MISC
- The amount of scratch memory needed for multiplication of huge numbers
have been reduced substantially (but is still larger than in GMP 4.3.)
- Likewise, the amount of scratch memory needed for division of large
numbers have been reduced substantially.
- The FFT tuning code of tune/tuneup.c has been completely rewritten,
and new, large FFT parameter selection tables are provided for many
machines.
- Upgraded to the latest autoconf, automake, libtool.
Changes in GMP 5.0.0
BUGS FIXED
- None (contains the same fixes as release 4.3.2).
SPEEDUPS
- Multiplication has been overhauled:
- Multiplication of larger same size operands has been improved with
the addition of two new Toom functions and a new internal function
mpn_mulmod_bnm1 (computing U * V mod (B^n-1), B being the word base.
This latter function is used for the largest products, waiting for a
better Schoenhage-Strassen U * V mod (B^n+1) implementation.
- Likewise for squaring.
- Multiplication of different size operands has been improved with the
addition of many new Toom function, and by selecting underlying
functions better from the main multiply functions.
- Division and mod have been overhauled:
- Plain "schoolbook" division is reimplemented using faster quotient
approximation.
- Division Q = N/D, R = N mod D where both the quotient and remainder
are needed now runs in time O(M(log(N))). This is an improvement of
a factor log(log(N))
- Division where just the quotient is needed is now O(M(log(Q))) on
average.
- Modulo operations using Montgomery REDC form now take time O(M(n)).
- Exact division Q = N/D by means of mpz_divexact has been improved
for all sizes, and now runs in time O(M(log(N))).
- The function mpz_powm is now faster for all sizes. Its complexity has
gone from O(M(n)log(n)m) to O(M(n)m) where n is the size of the modulo
argument and m is the size of the exponent. It is also radically
faster for even modulus, since it now partially factors such modulus
and performs two smaller modexp operations, then uses CRT.
- The internal support for multiplication yielding just the lower n limbs
has been improved by using Mulders' algorithm.
- Computation of inverses, both plain 1/N and 1/N mod B^n have been
improved by using well-tuned Newton iterations, and wrap-around
multiplication using mpn_mulmod_bnm1.
- A new algorithm makes mpz_perfect_power_p asymptotically faster.
- The function mpz_remove uses a much faster algorithm, is better tuned,
and also benefits from the division improvements.
- Intel Atom and VIA Nano specific optimisations.
- Plus hundreds of smaller improvements and tweaks!
FEATURES
- New mpz function: mpz_powm_sec for side-channel quiet modexp
computations.
- New mpn functions: mpn_sqr, mpn_and_n, mpn_ior_n, mpn_xor_n, mpn_nand_n,
mpn_nior_n, mpn_xnor_n, mpn_andn_n, mpn_iorn_n, mpn_com, mpn_neg,
mpn_copyi, mpn_copyd, mpn_zero.
- The function mpn_tdiv_qr now allows certain argument overlap.
- Support for fat binaries for 64-bit x86 processors has been added.
- A new type, mp_bitcnt_t for bignum bit counts, has been introduced.
- Support for Windows64 through mingw64 has been added.
- The cofactors of mpz_gcdext and mpn_gcdext are now more strictly
normalised, returning to how GMP 4.2 worked. (Note that also release
4.3.2 has this change.)
MISC
- The mpn_mul function should no longer be used for squaring,
instead use the new mpn_sqr.
- The algorithm selection has been improved, the number of thresholds have
more than doubled, and the tuning and use of existing thresholds have
been improved.
- The tune/speed program can measure many of new functions.
- The mpn_bdivmod function has been removed. We do not consider this an
incompatible change, since the function was marked as preliminary.
- The testsuite has been enhanced in various ways.
The GMP 5 release would not have been possible without the very devoted work
of Niels Möller and Marco Bodrato. As usual, Torbjörn Granlund coordinated the
development and release, and did a fair amount of development work himself.
Please see the GMP manual
for a complete list of GMP contributors.
There is a public repository for GMP, please see
the GMP repository usage instructions for more
information.
Torbjörn's work on GMP is sponsored
by Stiftelsen för Strategisk
Forskning, through CIAM.
Please send comments about this page to gmp-discuss@gmplib.org
Copyright 2009, 2010 Free Software Foundation
Verbatim copying and distribution of this entire article is permitted in any medium, provided this notice is preserved.