Leaky abstractions and CPython performance

kumaraditya303 · August 22, 2022, 10:55am

Background

Historically, CPython has exposed most if not all of the structures definitions used by the interpreter/runtime as public structures rather than opaque structures which should have been implementation details. It was with recent efforts to organize the header files in three parts:

Limited C API
Public C API
Internal C API

This post is all about Public C API and not about Limited C API & stable ABI.

Motivation

This post is mostly inspired by lazily compute line numbers and efficient implementation of integers . Specifically, modifying the structures of builtin objects for better performance or reduced memory usage.

Taking lazily compute line numbers as example of this, I proposed to change the implementation of the exception handling to defer the creation of traceback objects for most exceptions and make it lazy more details. The PR defers the computations of the line numbers of the traceback objects from frame objects. This change alone provides a 25% speed up when handling exceptions. There will be more speedup once we implement the whole idea of lazy tracebacks.

There were concerns about backwards compatibility as with this change, the tb_lineno of traceback objects will be computed lazily. The issue is that the PyTracebackObject structure is exposed in the public header file so can be regarded as a public structure.

However I have some objections to this

The structure is not documented at all. There is no mention of this structure in the official documentation
The structure is not used as an argument to any of the C API functions.
There are no comments in the header file about what the C fields mean or represent.

@markshannon proposed to change the name of the C level field, providing the C API function and document this is in porting to 3.12. I agree with this and seems the best way forward, however I created this post to get agreement on what is the rules on modifying undocumented structures. There are discussions around changing the long object implementation to reduce memory usage and use a tagged pointer or two structures etc but when that would happen there would be same concerns about backwards compatibility so it seemed worth discussing and standardizing to me.

So here is my proposal about modifying undocumented structures inspired by @markshannon proposal.

First document the change in the what’s new entry and provide code snippets for the change.
If the field is widely used, then consider adding a C API function for it else provide a Python level equivalent code in what’s new entry.
If we do end up providing C API function, if possible provide a compatibility shim for it in pythoncapi-compat to ease the code bases supporting old Python versions.
Change the C level field name.

In this particular case a 25% or more performance improvement in exception handling which is a core feature of the language feels significant to me and seems worth it.

Consider sharing your thoughts on this. Thanks!

encukou · August 22, 2022, 1:22pm

Please also request a PEP 387 (Backwards Compatibility Policy) exception from the Steering Council. It might be a straightforward one to grant, but please don’t set a bad precedent by skipping that step. Consistency on determining things like what’s “widely used” is a big reason we have a SC.

For the proposal: Another option is adding API for all fields (and constructor, perhaps?), and making the struct fully internal, so users can switch at once.
Adding API for all fields and then only changing+renaming one would limit breakage now and make future breaking changes easier.

kumaraditya303 · August 22, 2022, 2:01pm

Okay, since we are requesting SC for a grant then moving traceback objects to internal API and providing API functions sounds better. I’ll create a GitHub issue and check with Cython and others.

njs · August 22, 2022, 2:18pm

IMO, it shouldn’t matter whether a structure is documented or not. What matters is how much real-world stuff breaks when you change something, and kind of transition plan you can come up with.

In the case of PyTracebackObject, I know jinja2 used to mess with the internal fields directly – though I think they stopped recently, so it’s probably fine here? But if we’re talking about our general policy for making changes, I think “does this break jinja2?” is a much more important question than “is this structure layout documented?”.

davidism · August 22, 2022, 7:54pm

Jinja2 sets tb.tb_next instead of messing with ctypes, and uses CodeType.replace instead of building new code objects. So we’re only using documented Python interfaces now.

Jinja2 and Werkzeug both have code that inspects and changes tracebacks for debugging purposes. It’s easy to run our test suite if you want to test a change that might affect those interfaces.

I don’t want to hold up any improvements up on our account. As long as there’s a deprecation warning and I can figure out code to continue to support both versions of something until we drop the old one, I’m fine with Python removing things.

guido · August 22, 2022, 11:58pm

I don’t think we’re ready to define a standard policy for these kind of changes yet. I do agree with Nathaniel that “what breaks in practice” is more important than “was it documented”, though I assume jinja2 is just a random example that Nathaniel happened to think of, not literally the thing to check for. A slightly less random example might be Cython, which does a lot of C-level hackery for performance, and is a fundamental dependency of most of the Scientific Python ecosystem.

I agree that PEP 387 is important, as is the ABI (if anything is in the ABI it must not change). I’m not sure whether you are really planning to request an exemption from the SC about the full C-level definition of traceback objects – if you do, what kind of documentation do you plan to provide to back up your request?

Lastly – can you edit your messages to make the hyperlinks to GH issues more readable? The default (?) text you pasted from GitHub has a lot of boilerplate besides the issue number and title, and that makes the messages harder to read.

merwok · August 23, 2022, 12:40am

Sadly that’s discord trying to improve what people do and automatically replacing link text with page title fetched from the resource. I don’t know if that can be overriden, or if we must avoid inline links in this forum.

guido · August 23, 2022, 4:48am

This is getting off-topic, but let me try this:

Here is a link to GH-95238

It seems to work with explicit markup (i.e., [text](link)).

kumaraditya303 · August 23, 2022, 11:01am

I already said that before creating a issue on steering council I will contact Cython which may be affected on the GitHub issue to measure the impact. Also if the proposal is accepted and Cython is found to be affected I will fix it ASAP, I take the responsibility of fixing it.

Regarding the standard policy, IIRC there was a proposal about semi stable APIs but there doesn’t seems to be much progress on it otherwise I would have moved it to the “semi stable” API. As for today, no such thing exists so moving to the internal API and providing C API functions seems the best way forward.

To avoid confusion I already provided in background that this does not affects stable ABI so it would be better to avoid it in this discussion as it is irrelevant.

Do you see any issues with that? My next plan is to avoid materializing frame objects for tracebacks and that would require making tb_frame field lazy so getting an exemption for the whole structure seems better. Regarding documentation, the struct would be documented as opaque and we will provide getter function for the fields. I will properly document this in porting to 3.12. Feel free to share any other ideas you have.

Done.

tiran · August 23, 2022, 12:08pm

I still recommend that we should try hard to keep our code backwards compatible with Cython whenever possible or feasible. Projects often ship generated C files in their source distribution and do not auto-generated new C files by default. This behavior is recommends by Cython:

It is strongly recommended that you distribute the generated .c files as well as your Cython sources, so that users can install your module without needing to have Cython available.

It is also recommended that Cython compilation not be enabled by default in the version you distribute. Even if the user has Cython installed, he/she probably doesn’t want to use it just to install your module. Also, the installed version may not be the same one you used, and may not compile your sources correctly.

That means most users won’t get the fix unless a project releases a new version with updated C files. Or users jump through additional hoops to install Cython and force each project to re-generate the files. The Cython docs do not recommend a single option or env var to force re-generation. Projects typically have different approaches… (yeah, it’s annoying).

da-woods · August 23, 2022, 4:54pm

Just to repeat what I said on the Github issue - I don’t think the tb_line change should affect Cython so please don’t spend too long considering the implications of breaking Cython (specifically) with it.

Making tb_frame lazy would require an update to Cython (although it looks like a fairly small one - 2 lines). I’d be very surprised if you got to 3.12 without needing an update to Cython somewhere so it probably isn’t worth worrying too much about easily fixed stuff.

I recall Victor Stinner had tool for scanning top PyPI projects (or something similar?) to try to who uses what APIs, so that might be worth a look? (Obviously there’s a whole world of closed-source stuff that it doesn’t cover, but it’s a useful first pass)

guido · August 23, 2022, 5:40pm

I think the next step is for Kumar to draft something about the C-level traceback structure to the SC. @kumaraditya303 I’d be happy to help.

From the SC notes that were just posted in draft (PR) form on the SC repo it seems they approved PEP 689 (unstable API) back in May but are waiting for some edits before it is labeled as accepted. So we could propose to move the traceback struct into the unstable API.

brettcannon · August 23, 2022, 7:17pm

There’s been a bunch of discussion about it on the SC. @encukou was/did bring it back to python-dev to discuss naming.

encukou · August 24, 2022, 8:35am

I thought I’ve deferred PEP 689 for now, but I see it’s part of a bigger PR that’s stuck. I opened #2769 with a bit of explanation.
Basically, the edits the SC requested sound trivial, but point to an issue I think should be solved first. Opening that conversation is on my TODO list… for a while now.

kumaraditya303 · August 24, 2022, 10:14am

Okay, I assume that means “unstable APIs” are out of question now as the PEP is being deferred (for now).

kumaraditya303 · August 24, 2022, 10:18am

Thanks for your help!

The PEP is being deferred for now so moving the structure to internal API seems the only way possible.

guido · August 24, 2022, 6:40pm

Okay, well making the details of the traceback at the C level internal doesn’t sound too bad.

encukou · August 25, 2022, 8:25am

To me, adding accessors & hiding the struct itself sounds like a good thing to do, even if/when we have the unstable API tier. Opting in to unstable API would require extensions to changes their code anyway, switching from member access to functions isn’t much worse.

guido · August 25, 2022, 4:23pm

Well, adding getter (and setter) functions doesn’t automatically make an API stable (consider the API for creating new code objects). And it adds complexity to the implementation (sometimes also to the caller) so it isn’t automatically what I would try first.

However, getters are essential when something must be computed lazily, as is the case here.

In the end, API evolution will always be a tricky thing in C.

pablogsal · August 25, 2022, 11:04pm

Disclaimer: this comment doesn’t reflect my opinion or advice on how we should proceed neither a core dev nor as a steering council member.

As a datapoint: seems that it will be tensorflow. blender and many other projects uses the field directly. Although I assume it will be not too hard to port in many of these cases. In any case, this seems that is going to break a lot of projects, even if the fix is easy. A 5-minute search on GitHub shows a bunch of other projects that seem to use it as well and will be affected (including the ones I mentioned):

github.com

krawiah/tensorflow/blob/32ad61688a1c92bb42c1217c17ad212ffafda3c0/tensorflow/compiler/xla/python/traceback.cc#L104


      
              throw std::runtime_error("tb_frame argument must be a frame");
            }
            tb = PyObject_GC_New(PyTracebackObject, &PyTraceBack_Type);
            if (tb) {
              tb->tb_next =
                  tb_next == Py_None
                      ? nullptr
                      : reinterpret_cast<PyTracebackObject*>(tb_next.release().ptr());
              tb->tb_frame = reinterpret_cast<PyFrameObject*>(tb_frame.release().ptr());
              tb->tb_lasti = tb_lasti;
              tb->tb_lineno = tb_lineno;
              PyObject_GC_Track(tb);
            }
            return py::reinterpret_steal<py::object>(reinterpret_cast<PyObject*>(tb));
          }
          #else
          
          
static py::object MakePythonTraceback(py::object tb_next, py::object tb_frame,
                                                int tb_lasti, int tb_lineno) {
            py::handle traceback_type(reinterpret_cast<PyObject*>(&PyTraceBack_Type));
            return traceback_type(tb_next, tb_frame, tb_lasti, tb_lineno);

github.com

blender/blender/blob/594f47ecd2d5367ca936cf6fc6ec8168c2b360d0/source/blender/python/intern/bpy_traceback.c#L184


      
                   tb && (PyObject *)tb != Py_None;
                   tb = tb->tb_next) {
                PyObject *coerce;
                const char *tb_filepath = traceback_filepath(tb, &coerce);
                const int match = ((BLI_path_cmp(tb_filepath, filepath) == 0) ||
                                   (ELEM(tb_filepath[0], '\\', '/') &&
                                    BLI_path_cmp(tb_filepath + 1, filepath) == 0));
                Py_DECREF(coerce);
          
          
      if (match) {
                  *lineno = tb->tb_lineno;
                  /* used to break here, but better find the inner most line */
                }
              }
            }
          }

github.com

ZeroCool940711/Sandbox-Game-Engine/blob/5c3884aa82da70b7e8d44f11666e432c9de9e2a7/src/lib/pyscript/py_traceback.cpp#L78


      
          	{
          
          
		MF_ASSERT( fd == fd_ );
          		bool isDone = false;
          
          
		// Read as much as we can, note: all context is saved from this loop
          		while (!isDone)
          		{
          			// Only append to the current buffer if this is the final line
          			// or if we risk an overflow if we don't
          			if (lineUpTo_ < tb_->tb_lineno - 1 || lineBufUpTo_ > BUFFLEN / 2)
          				lineBufUpTo_ = 0;
          			
          			// Half fill the buffer
          			int output = read(fd_, lineBuf_ + lineBufUpTo_, BUFFLEN / 2);
          			
          			// IO isn't ready right now, call back later
          			if (output == EAGAIN)
          			{
          				// Make sure is registered with the nub
          				s_pNub->registerFileDescriptor( fd_, this );

github.com

gf712/python-cpp/blob/69355b1f2c1e9d41c3d3a0df28c16b45cbc6aa1c/src/executable/bytecode/instructions/ForIter.cpp#L23


      
          		const auto &next_value = (*iterable_object)->next();
          		if (next_value.is_err()) {
          			auto *last_exception = next_value.unwrap_err();
          
          
			// FIXME: this shold be done somewhere more centralized and where we can easily get the
          			// instruction index and line number
          			size_t tb_lineno = 0;
          			size_t tb_lasti = 0;
          			PyTraceback *tb_next = last_exception->traceback();
          			auto traceback =
          				PyTraceback::create(interpreter.execution_frame(), tb_lasti, tb_lineno, tb_next);
          			ASSERT(traceback.is_ok())
          			last_exception->set_traceback(traceback.unwrap());
          
          
			interpreter.raise_exception(last_exception);
          
          
			if (!interpreter.execution_frame()->catch_exception(last_exception)) {
          				// exit loop in error state and handle unwinding to interpreter
          				return Err(static_cast<BaseException *>(last_exception));
          			} else {
          				interpreter.execution_frame()->pop_exception();

github.com

xfwduke/antlr4_parser/blob/6f84c934d1530d3fb79d547aea0380425504b67d/base/python_utils.cpp#L42


      
          #include "base/string_utilities.h"
          
          
std::string format_python_traceback(PyObject *tb) {
            PyTracebackObject *trace = (PyTracebackObject *)tb;
            std::string stack;
          
          
  stack = "Traceback:\n";
            while (trace && trace->tb_frame) {
              PyFrameObject *frame = (PyFrameObject *)trace->tb_frame;
              stack += base::strfmt("  File \"%s\", line %i, in %s\n", PyString_AsString(frame->f_code->co_filename),
                                    trace->tb_lineno, PyString_AsString(frame->f_code->co_name));
              PyObject *code = PyErr_ProgramText(PyString_AsString(frame->f_code->co_filename), trace->tb_lineno);
              if (code) {
                stack += base::strfmt("    %s", PyString_AsString(code));
                Py_DECREF(code);
              }
              trace = trace->tb_next;
            }
            return stack;
          }

github.com

swordfeng/pyjs/blob/ea14764110f565d7f09e4c8c6c9bdfd64aafe932/src/error.cc#L31


      
          stackStream << PyUnicode_AsUTF8(errName) << ": " << PyUnicode_AsUTF8(errMessage) << std::endl;
          
          
// python stack
          std::vector<std::string> frames;
          PyTracebackObject *last = nullptr, *tb = reinterpret_cast<PyTracebackObject *>(traceback.borrow());
          while (tb != nullptr) {
              last = tb;
              std::ostringstream frameStringStream;
              frameStringStream << PyUnicode_AsUTF8(tb->tb_frame->f_code->co_name)
                      << " (" << PyUnicode_AsUTF8(tb->tb_frame->f_code->co_filename)
                      << ":" << tb->tb_lineno << ")";
              frames.push_back(frameStringStream.str());
              tb = tb->tb_next;
          }
          
          
if (last) {
              // print line
              PyObjectWithRef linecache(PyImport_ImportModule("linecache"));
              PyObjectWithRef getline(PyObject_GetAttrString(linecache, "getline"));
              PyObjectWithRef args(PyTuple_New(2));
              PyTuple_SetItem(args, 0, PyObjectMakeRef(last->tb_frame->f_code->co_filename).escape());

github.com

dstone64/Orbital/blob/9a0985ac444c0f2cf0bd85f42019a393545c4ccd/Orbital/QPyEngine/QPyEngineUtils.cpp#L98


      
          ErrorReport(QString& qErrStr)
          {
          	if (PyErr_Occurred()) {
          		std::string errStr;
          		PyObject * pType, *pVal, *pTraceback;
          		PyTracebackObject * ptb;
          
          
		PyErr_Fetch(&pType, &pVal, &pTraceback);
          		PyString_ToStdString(pVal, errStr);
          		if ((ptb = (PyTracebackObject *)pTraceback) != NULL) {
          			int errLine = ptb->tb_lineno;
          			unsigned int tb_no = 1;
          			std::string errFile;
          
          
			PyString_ToStdString(ptb->tb_frame->f_code->co_filename, errFile);
          			errStr.append("\nFile " + errFile + ", line " + std::to_string(errLine) + "\n");
          
          
			while (ptb->tb_next && tb_no < 10) {
          				ptb = ptb->tb_next;
          				errLine = ptb->tb_lineno;
          				PyString_ToStdString(ptb->tb_frame->f_code->co_filename, errFile);

github.com

JarrettWendt/FIEAEngine/blob/0bf7e89cd66fec29550f7d7a1a11f5cf398c27e5/source/Library/python/Exception.cpp#L17


      
          {
          	void Exception::HandleErrors()
          	{
          		PyObject* type, * value;
          		PyTracebackObject* traceback;
          		PyErr_Fetch(&type, &value, reinterpret_cast<PyObject**>(&traceback));
          		if (type)
          		{
          			std::stringstream stream;
          			stream << "file: " << Util::ToString(traceback->tb_frame->f_code->co_filename) << std::endl;
          			stream << "line: " << traceback->tb_lineno << std::endl;
          			stream << "error: " << Util::ToString(value);
          			PyErr_Clear();
          			const std::string str = stream.str();
          			throw py::Exception(str);
          		}
          	}
          }

github.com

h4ck3rm1k3/django-admin-nuitka/blob/4435a581d680e6d57390f4631a4816884cd7a382/internal.cpp#L93


      
                      PyObjectTempKeeper1 make_tuple1;
                      {
                          PyObjectTemporary tmp_exception_type( CALL_FUNCTION_WITH_POSARGS( PyExc_TypeError, PyObjectTemporary( MAKE_TUPLE1( PyObjectTemporary( BINARY_OPERATION_REMAINDER( _python_str_digest_6f69449e3cbe19d8aaa066664eccb812, PyObjectTemporary( ( make_tuple1.assign( impl_function_2_get_callable_name_desc_of_module___internal__( _python_var_called.asObject1() ) ), MAKE_TUPLE2( make_tuple1.asObject0(), PyObjectTemporary( LOOKUP_ATTRIBUTE( PyObjectTemporary( BUILTIN_TYPE1( _python_var_star_arg_dict.asObject() ) ).asObject(), _python_str_plain___name__ ) ).asObject() ) ) ).asObject() ) ).asObject() ) ).asObject() ) );
                          RAISE_EXCEPTION_WITH_TYPE( tmp_exception_type.asObject(), PyObjectTemporary( MAKE_TRACEBACK( frame_guard.getFrame() ) ).asObject() );
                  }
                  }
              }
              else
              {
                  PyTracebackObject *tb = _exception.getTraceback();
                  frame_guard.setLineNumber( tb->tb_lineno );
                  _exception.setTraceback( tb->tb_next );
                  tb->tb_next = NULL;
          
          
        throw;
              }
          }
          PyObjectTemporary _python_tmp_iter( MAKE_ITERATOR( _python_tmp_keys.asObject() ) );
          PyObjectTemporary _python_tmp_dict( PyDict_New() );
          while( true )
          {

github.com

KLayout/klayout/blob/8705de49f7a4ac082aab036251206b984d2c664d/src/pya/pya/pyaUtils.cc#L66


      
          
          
  if (exc_type) {
          
          
    //  fetch traceback
              //  TODO: really decref the stack trace? how about the other objects in the stack trace?
              std::vector <tl::BacktraceElement> backtrace;
              if (exc_traceback) {
                PyTracebackObject *traceback = (PyTracebackObject*) exc_traceback.get ();
                for (PyTracebackObject *t = traceback; t; t = t->tb_next) {
          #if PY_VERSION_HEX >= 0x030B0000
                  backtrace.push_back (tl::BacktraceElement (python2c<std::string> (PyFrame_GetCode(t->tb_frame)->co_filename), t->tb_lineno));
          #else
                  backtrace.push_back (tl::BacktraceElement (python2c<std::string> (t->tb_frame->f_code->co_filename), t->tb_lineno));
          #endif
                }
                std::reverse (backtrace.begin (), backtrace.end ());
              }
          
          
    if (PyErr_GivenExceptionMatches (exc_type.get (), PyExc_SyntaxError) && PyTuple_Check (exc_value.get ()) && PyTuple_Size (exc_value.get ()) >= 2) {
          
          
      const char *sourcefile = 0;

github.com

mmmarvin/PixPaint/blob/f61faedb02d4e6c711b7d5ae841dac39e2d38ddb/src/embed/script_errors.cpp#L45


      
          
          
try {
            PyObject* type, * value, * traceback;
            PyErr_Fetch(&type, &value, &traceback);
          
          
  bp::handle<> htraceback(traceback);
            bp::object otraceback(htraceback);
            bp::handle<> htype(type);
            bp::object otype(htype);
          
          
  auto err_line_number = bp::extract<long>(otraceback.attr("tb_lineno"))();
            auto err_filename = bp::extract<std::string>(otraceback.attr("tb_frame").attr("f_code").attr("co_filename"))();
            auto err_funcname = bp::extract<std::string>(otraceback.attr("tb_frame").attr("f_code").attr("co_name"))();
            auto err_type = bp::extract<std::string>(otype.attr("__name__"))();
            auto err_msg = bp::extract<std::string>(value)();
          
          
  // emulate pytohn error
            error_msg = std::string("Traceback (most recent call last):\n") +
                        std::string(" File \"") + err_filename + std::string("\", line ") +
                        std::to_string(err_line_number) + std::string(", in ") + err_funcname + std::string("\n") +
                        err_type + std::string(": ") + err_msg;