Replacements for PyMapping_HasKey(), PyMapping_HasKeyString(), PyObject_HasAttr(), and PyObject_HasAttrString()

storchaka · August 30, 2023, 7:28am

Functions PyMapping_HasKey(), PyMapping_HasKeyString(), PyObject_HasAttr(), and PyObject_HasAttrString() have a flaw – they clear all errors raised inside (in custom __getitem__, __eq__, __hash__ or __getattr__, unhashable key or just a memory error). The similar flaw in hasattr() was fixed in Python 3, but these C API functions have no way to report an error.

So we need new functions in replacement. What should be their names? PyDict_GetItemWithError() replaced PyDict_GetItem() which has the same flaw. Should we add the WithError suffix in new names? Or use other suffix, Ex or 2 as in some other C API?

github.com/python/cpython

C API: Add replacements for PyObject_HasAttr() etc

opened 05:55AM - 26 Aug 23 UTC

serhiy-storchaka

type-feature topic-C-API

# Feature or enhancement ### Has this already been discussed elsewhere? No res…ponse given ### Links to previous discussion of this feature: https://github.com/python/cpython/issues/75753 https://github.com/python/cpython/issues/106672 ### Proposal: Functions `PyDict_GetItem()`, `PyDict_GetItemString()`, `PyMapping_HasKey()`, `PyMapping_HasKeyString()`, `PyObject_HasAttr()`, `PyObject_HasAttrString()` and `PySys_GetObject()` have a flaw -- they clear any error raised inside the function, including important and critical errors. They cannot be fixed, because the user code which use them do not handle errors. There are replacements free from this flaw for `PyDict_GetItem()` (`PyDict_GetItemWithError()` and `PyDict_GetItemRef()`) and, in some applications, to `PyDict_GetItemString()` (`PyDict_GetItemRefString()`). We need new functions similar to `PyMapping_HasKey()`, `PyMapping_HasKeyString()`, `PyObject_HasAttr()`, `PyObject_HasAttrString()` which return three-state value (`1` - yes, `0` -- no, and `-1` --error). What should be their names? Add the `WithError` suffix? Add the `Ex` sufix? Add the `2` suffix?

github.com/python/cpython

Avoid suppressing all exceptions in PyObject_HasAttr()

opened 05:46PM - 24 Sep 17 UTC

serhiy-storchaka

type-bug interpreter-core extension-modules

BPO | [31572](https://bugs.python.org/issue31572) --- | :--- Nosy | @pitrou, @vs…tinner, @serhiy-storchaka PRs | <li>python/cpython#3723</li><li>python/cpython#3724</li><li>python/cpython#3725</li><li>python/cpython#3726</li><li>python/cpython#3727</li><li>python/cpython#3728</li><li>python/cpython#3729</li><li>python/cpython#3731</li><li>python/cpython#4081</li> Dependencies | <li>bpo-32787: Better error handling in ctypes</li><li>bpo-32788: Better error handling in sqlite3</li> <sup>*Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.*</sup> <details><summary>Show more details</summary><p> GitHub fields: ```python assignee = None closed_at = None created_at = <Date 2017-09-24.17:46:45.898> labels = ['extension-modules', 'interpreter-core', 'type-bug'] title = 'Avoid suppressing all exceptions in PyObject_HasAttr()' updated_at = <Date 2018-12-05.14:44:23.566> user = 'https://github.com/serhiy-storchaka' ``` bugs.python.org fields: ```python activity = <Date 2018-12-05.14:44:23.566> actor = 'serhiy.storchaka' assignee = 'none' closed = False closed_date = None closer = None components = ['Extension Modules', 'Interpreter Core'] creation = <Date 2017-09-24.17:46:45.898> creator = 'serhiy.storchaka' dependencies = ['32787', '32788'] files = [] hgrepos = [] issue_num = 31572 keywords = ['patch'] message_count = 12.0 messages = ['302873', '304759', '304764', '304787', '304795', '306084', '306085', '306086', '306087', '310091', '316580', '331121'] nosy_count = 3.0 nosy_names = ['pitrou', 'vstinner', 'serhiy.storchaka'] pr_nums = ['3723', '3724', '3725', '3726', '3727', '3728', '3729', '3731', '4081'] priority = 'normal' resolution = None stage = 'patch review' status = 'open' superseder = None type = 'behavior' url = 'https://bugs.python.org/issue31572' versions = [] ``` </p></details>

erlendaasland · August 30, 2023, 8:05am

I’d prefer the explicit WithError suffix to the more cryptic Ex and 2 suffixes.

vstinner · August 30, 2023, 2:19pm

Would it be possible to change the function to start reporting errors? Is it going to break C extensions?

If it’s not possible, WithError is sadly the least bad suffix IMO. I concur with Erlend here.

storchaka · August 30, 2023, 4:16pm

I think it is impossible. The code which uses PyMapping_HasKey() most likely will interpret the returned -1 as a true value. Currently it gets a false value, so this is a surprising change in behavior. It can follow different branch of code after this. It also leaves a raised exception, so the following successful return from the function can cause a crash or SystemError.

I get rid of most of uses of these functions in CPython by replacing them with private functions which finally found way in the limited C API under names PyObject_GetOptionalAttr() and PyMapping_GetOptionalItem(). The reason why I have not added the replacements earlier is that I was not sure that we need special functions for testing the existence of the attribute or the key, if the above functions can be used for this. But they need a variable to store the retrieved value and the following Py_DECREF(), so special *Has* functions are more convenient in some cases. It is also easier to port code from using PyObject_HasAttr() to use PyObject_HasAttrWithError() than to use PyObject_GetOptionalAttr().

if (PyObject_HasAttr(obj, attrname)) {
    // found
}
else {
    // not found
}

to

int rc = PyObject_HasAttrWithError(obj, attrname);
if (rc < 0) {
    // error
}
else if (rc) {
    // found
}
else {
    // not found
}

or

if (PyObject_HasAttrWithError(obj, attrname) > 0) {
    // found
}
else if (PyErrOccurred()) {
    // error
}
else {
    // not found
}

instead of

PyObject *tmp;
if (PyObject_GetOptionalAttr(obj, attrname, &tmp) < 0) {
    // error
}
else if (tmp) {
    Py_DECREF(tmp);
    // found
}
else {
    // not found
}

or

PyObject *tmp;
int rc = PyObject_GetOptionalAttr(obj, attrname, &tmp);
if (rc < 0) {
    // error
}
else if (rc) {
    Py_DECREF(tmp);
    // found
}
else {
    // not found
}

We can of course make PyObject_GetOptionalAttr() accepting NULL, so that PyObject_GetOptionalAttr(obj, attrname, NULL) is the same as PyObject_HasAttrWithError(obj, attrname), but it adds an overhead to every call or PyObject_GetOptionalAttr() and just look more ugly. We have PyDict_Contains() despite the existence of a number of PyDict_Get*() functions.

storchaka · September 6, 2023, 8:18pm

Oh, there is the “C API” sub-category. Moved this topic there.

Here is an implementation.

github.com/python/cpython

gh-108511: Add C API functions which do not silently ignore errors

python:main ← serhiy-storchaka:PyObject_HasAttrWithError-PyMapping_HasKeyWithError

opened 08:13PM - 06 Sep 23 UTC

serhiy-storchaka

+310 -111

Add the following functions: * PyObject_HasAttrWithError() * PyObject_HasAtt…rStringWithError() * PyMapping_HasKeyWithError() * PyMapping_HasKeyStringWithError() * Issue: gh-108511 ---- :books: Documentation preview :books:: https://cpython-previews--109025.org.readthedocs.build/

It includes changes in the code to use PyObject_HasAttrWithError() where it is appropriate. There are no use cases for PyMapping_HasKeyWithError() in the CPython code.