Add int.length method

storchaka · January 13, 2024, 7:34am

There are algorithms that depend on the number of bits per digit be divisible by 5.

Also, the marshal format uses 15-bit digits. It is easy and efficient to split 30-bit digit on 15-bit digits.

Stefan2 · January 15, 2024, 7:14pm

Which algorithms? And should that maybe be mentioned in longintrepr.h (which mentions marshal and a few other things)?

tim.one · January 15, 2024, 7:37pm

Just noting that gmpy2 (the popular Python wrapper for the GMP library) already supports len() for its integer type:

>>> import gmpy2
>>> for i in range(17):
...     print(i, len(gmpy2.mpz(i)))
0 1
1 1
2 2
3 2
4 3
5 3
6 3
7 3
8 4
9 4
10 4
11 4
12 4
13 4
14 4
15 4
16 5

So it’s the bit length. Although unlike the .bit_length() method, len() returns f for 0 instead of .bit_length()'s 0 result:

>>> z = gmpy2.mpz(0)
>>> len(z), z.bit_length()
(1, 0)

hansgeunsmeyer · January 16, 2024, 12:52am

gmpy2 also happens to have an “mpz.num_digits” function (with default decimal base)

>>> import gmpy2
>>> gmpy2.mpz(0xFFFF).num_digits()
5
>>> gmpy2.mpz(0xFFFF).num_digits(16)
4

vovavili · January 16, 2024, 1:01am

I would imagine this to be faster, though less readable:

import math

print(int(math.log10(abs(int(n)))) + 1)

MegaIng · January 16, 2024, 1:06am

If you read a few more messages, you get shown that this can have wrong results, platform dependent: Add int.__length__ method - #11 by mdickinson

Stefan2 · January 16, 2024, 1:39pm

Mark Dickinson:

>>> import math
>>> digit_length1 = lambda n: len(str(n))
>>> digit_length2 = lambda n: 1 + math.floor(math.log10(n))
>>> n = 10**15 - 1
>>> digit_length1(n) == digit_length2(n)
False

Potentially only your larger formula is wrong and one would have to think that through, so I prefer to say that math.log10(10**15) == math.log10(10**15 - 1), which shows that log10 itself already can’t distinguish these two numbers which have different lengths.

blhsing · February 21, 2024, 8:28am

Mark Dickinson:

I can fix that! Ignoring behaviour of zeros and negative values, on my macOS 13.6 / Intel machine I see this:
>>> import math
>>> digit_length1 = lambda n: len(str(n))
>>> digit_length2 = lambda n: 1 + math.floor(math.log10(n))
>>> n = 10**15 - 1
>>> digit_length1(n) == digit_length2(n)
False
Your results may vary: this is using the platform libm at some level, so is system-dependent.

To avoid the floating-point imprecision of math.log10 and the sign inconsistency of str one can use decimal.Decimal instead:

import math
from decimal import Decimal

n = 10 ** 15 - 1
print(1 + math.floor(math.log10(n))) # outputs 16
print(len(str(n))) # outputs 15
print(len(Decimal(n).as_tuple().digits)) # outputs 15

Topic		Replies	Views
Beginner Question \| len() function Python Help	4	395	August 9, 2022
Issue with converting to integer Python Help help	6	606	February 14, 2023
Bug in int('+␣42')? Core Development	7	928	April 30, 2023
List length operation "len(list)" should include (dot) operator format " list.len() " Ideas	2	946	November 3, 2021
ISSUE WITH MORE THAN 15 digits Python Help	9	4121	April 3, 2020

Add int.__length__ method

Related Topics

Add int.length method