XOR operand between bytes

Deejuha · August 2, 2022, 9:15pm

Hello,

How do you think about supporting XOR ^ operand between bytes objects?

Equivalent python code adding to the inherited class

    def __xor__(self, other):
        """
        Overrides xor operator in order to xor bytes.

        """
        return bytes(x ^ y for x, y in zip(self, other))

Melendowski · August 3, 2022, 2:03am

Why not support all bit wise operations then?

I’m not intimately familiar with an application for this in python. My experience with bit masking is operating on integers for setting individual bits which are turning on/off register values for setting up peripherals on microcontrollers, but that’s in C.

Deejuha · August 3, 2022, 5:55am

To be honest I’ve proposed XOR because of that this is the only one operand which I’ve realized that I need it

But hey, your idea is cool, I’m even not aware about possibilities of bytes objects.

XOR is required in some security related calculations like Miyaguchi-Preneel compression where you’re xoring some messages all together, I find it really useful

Dutcho · August 3, 2022, 4:10pm

How would this (any of the bitwise operators) work with bytes of unequal lengths, i.e. len(self) != len(other)?
As given, zip ignores the excess bytes of the longer of the two, but doubt that is by intent

Zeturic · August 3, 2022, 4:15pm

I see three basic options for unequal lengths.

Raise an exception if the two aren’t the same length.
Behave like zip and truncate to the shorter length.
Once the shorter one runs out, just take the other’s byte values verbatim. This would be analogous to or-ing with 0, and-ing with 255, and xor-ing with 0 for those bytes, all of which are no-ops.

It’s hard to really say without some concrete use cases, but I suspect the typical way this would be used is between two bytes of equal lengths, in which case (2) and (3) would hide what is probably an error.

Dutcho · August 3, 2022, 4:25pm

Indeed those possibilities. Or a fourth:

extend the shorter with zero bytes

This is what int.from_bytes(…, byteorder='little') does, so consistent between bytes and int.

My question was intended to clarify what’s exactly proposed

Deejuha · August 3, 2022, 4:29pm

Let me introduce first use case:

As a user I would like to use those operands in security operations where bytes has equal range (plain or cipher blocks).

Dutcho · August 3, 2022, 4:32pm

And for plain text not being a multiple of the cipher length, you’d extend the plain text, right?
So that’s zip(…, …, strict=True).

Deejuha · August 3, 2022, 4:40pm

Apologize, could you rephrase it?
Basically most of crypto stuff bases on fixed size blocks, so I meant that if they were not - I would like to get informed about that (Exception)

So that’s for this first use case.
I would try to get a new soon

Edit:
Rewrite → rephrase

Edit2:
By meaning of “Fixed” I meant that in this use case both sides of operands would be equal, because both has its own, same fixed size

Dutcho · August 3, 2022, 5:15pm

About extend: I assume if the plain text length is not a multiple of the cipher text, the code will extend the last block of plain text with additional bytes to match the cipher length before xor’ing them

About strict zip: it’ll fail on unequal lengths, see PEP 618 – Add Optional Length-Checking To zip | peps.python.org

Does that clarify my remark?

Deejuha · August 4, 2022, 5:04am

Hello - yes, I think I’ve got the point now, thanks!

Regarding extend - no, this use case will not extend plain text to fit somewhere, in this use case cipher and plaintext are the same length in those particular functions messages (plain) and cipher are blocks with the same length.

Regarding strict zip - yea, I get it ^^ but as I’ve said - same length in this use case so strict would be used in equivalent python code.

I would propose find the new use cases before any decision.

storchaka · August 4, 2022, 11:23am

It was already discussed earlier.

It would be strange to only implement ^, but not |, & and ~.

But bytes objects are collections, and operators | and & already defined for sets which also collections, and some set methods accept arbitrary iterables, including bytes objects. I afraid that it could lead to some errors.

I think that it would be better to use functions for bitwise operations on bytes. It is less ambiguous. And you can add other useful functions: set or clear bits in the specified range, test whether all bits in the specified range are set or clear, shift bits, etc, etc. You can also look at existing implementations for bitarrays or bitsets.

Topic		Replies	Views
Bitwise Operators Python Help	4	624	September 23, 2022
Bitwise Operator Negation Python Help	21	1491	October 9, 2023
Binary Logic and Bitwise Operators in Python Python Help	6	577	November 2, 2023
Int to bytes conversion confusion Python Help	6	5398	February 9, 2021
Get single-byte bytes objects from a bytes object Python Help	44	2264	January 2, 2024

XOR operand between bytes

Related Topics