PEP 734: Multiple Interpreters in the Stdlib

thomas · March 25, 2024, 3:26pm

The SC has been evaluating and discussing PEP 734 since it was sent our way last month (PEP 734 -- Multiple Interpreters in the Stdlib · Issue #234 · python/steering-council · GitHub), and we have some concerns. This is partly with my RM hat on, having seen the kinds of oversights we’ve had to fix in 3.12 late in the release cycle of 3.12.0, and as backported fixes. It’s understandable, given the kind of changes that are required, their scope and the limited experience (and limited number of eyeballs) involved, but it does worry me. On top of that we have a completely new API that we have basically no experience with, even though it’s somewhat similar to other solutions we’ve seen (like concurrent.futures).

How invasive is the PEP 734 implementation in the interpreter? To what extent could it be released as a PyPI package first, before we consider it for inclusion in the standard library? If it can’t entirely be a separate package, could we make a private module with the minimum number of hooks for such a PyPI package to work? Having it be a separate package would make it much easier to evolve the API and, for example, fix problematic semantics, since you can release at will and users can always pin to an older version of the package while they make their code work with the newer version.

Also, the SC would like to invite you to come talk to us about the PEP, either at the regularly scheduled office hour, or if that doesn’t work for you, we can schedule something separately.

eric.snow · March 25, 2024, 6:38pm

Thanks for the update. I appreciate your thoughtfulness.

Most of those fixes related to subinterpreters generally, especially isolated interpreters. The effect of PEP 734 is primarily in how it makes the existing feature more available and thus exposes existing flaws. What is your concern relative to this PEP?

At its core, PEP 734 has only a few parts:

expose the existing C-API (which has been around a long time) via a lightly-wrapping abstraction
provide a simple, safe way for one interpreter to run code in another (similarly a light implementation on top of existing C-API)
provide a basic mechanism for safely sending data from one interpreter to another

(There’s also a small amount of “sugar” on top of that.)

I’d consider the first part to be uncontroversial. The second part is focused and simple.

The third part is fairly important because subinterpreters aren’t nearly as useful without a way to communicate between them. That became clear almost immediately in the early PEP 554 discussions. PEP 734 presents an API that is almost identical to the existing queue.Queue, so there shouldn’t be any real surprises for users.

From my perspective, the PEP 734 API presents very little risk. My guiding principle from the beginning has been to present a minimal API on which we could build. Based on the many discussions this PEP has had, and on the practical experience with it that I and others have had with the implemented API, I’m confident it is a good starting point.

To me, that’s the key thing here: we want a solid foundation in the stdlib on which people may start using multiple interpreters in their programs. We can build from there as appropriate.

There are many things I’ve implemented that were mostly inspired by the needs of PEP 734 or because of my experiences with it, but most (or nearly all?) of them are valuable on their own. They’ve certainly helped us identify flaws in CPython over the last five years.

When it comes to invasiveness, I’ve been careful to keep a lot of the relevant code isolated to certain files. Nearly all the C-API I’ve added is strictly limited to the internal API (Include/internal/pycore_*.h).

If you were to ask what we would rip out if PEP 734 were rejected, I’m not sure that I’d put anything on that list. Basically it is valuable in testing subinterpreters, at the least.

To put it a different way, there are only a handful of things in the repo that are currently only used by the PEP 734 implementation (and they are almost all consolidated in specific files). That includes only one piece of runtime state: the internal “cross-interpreter data registry”, which I’d already like to replace with a new type slot (via a separate PEP) rather than adding a public API.

Early on I worked hard to implement PEP 734 in a way that I could publish it on PyPI. In fact, my plan has been to publish such a package for use with 3.12. The same could be done for 3.13, though I think it’s better suited for the stdlib.

My main concern with publishing a PyPI module is that there will be less exposure to the feature and less accessibility. I’m certainly biased here, but I’m convinced the multiple interpreters feature is a great benefit to users and want it in their hands with the least friction possible. Again, I’m confident that the minimal API provided by PEP 734 is the right place to start.

That minimal module is basically what PEP 734 specifies. We could certainly move the PEP 734 implementation to a PyPI package, either my building on the necessary existing internal C-API or via what is currently the _xxsubinterpreters module. (That said, I don’t see the value in doing so over adding the new stdlib module.)

I’ll meet at the office hours this week.

ncoghlan · March 28, 2024, 2:38am

I think this is true for the proposed semantics, but I’m not 100% convinced it is true for the exact naming choices.

Specifically, given the PEP (and presumably documentation) terminology focuses on “shareable objects” (vs arbitrary objects), the name of the syncobj flag on interpreters.Queue objects seems odd. It feels like share_data would better describe the behaviour being requested.

For the notion of operating on PyPI for a release cycle, the natural API split point would presumably be to have the _interpreters module in the standard library, and put interpreters on PyPI (including InterpreterPoolExecutor).

The main advantage I see to this more conservative approach is that it would allow some of the open questions in the PEP to be deferred until the module’s promotion to the standard library (whether that comes next release or later), specifically:

default behaviour of Queue objects when the interpreter adding the object to the queue goes away before the object is retrieved. The current default feels prone to “errors passing silently without being explicitly silenced” to me, akin to the original handling of async tasks that never get scheduled.
exact API design for InterpreterPoolExecutor (in particular, how the new interpreters are configured, and how the pools support execution of configuration code in a way that ensures every interpreter in each pool is configured exactly once. This will presumably be similar to ThreadPoolExecutor and ProcessPoolExecutor, but I’m not sure the existing initializer API will quite be sufficient for InterpreterPoolExecutor)
whether some improved ergonomics are feasible for cross-interpreter exception handling (e.g naming a parameterless context manager as a dotted-string when calling Interpreter.exec, so the actual execution in the other interpreter runs inside a with statement using that context manager)
building out a list of other not-yet-shareable object types where it would be genuinely helpful to be able to share them
ensuring the data buffer sharing works as expected with other data buffer exporters (NumPy, etc)

While the initial level of adoption will definitely be lower than for a generally available standard library module, I think the adoption you’ll see will be from exactly those you most want at this stage of feature publication: folks that aren’t happy with the trade-offs between threads and processes, and are actively looking for something that strikes the middle ground of combining low-overhead data sharing with strong default data separation.

barry · April 10, 2024, 9:36pm

Hi Eric,

Thank you again for coming to the Steering Council’s office hours to discuss PEP 734. We all found it very helpful to talk about the PEP with you in real time.

After much discussion, the Steering Council thinks that the best way forward with this module is to maintain it separately for now, release it on PyPI, and let it mature there for a while before including it in the stdlib. There are several reasons leading us to this decision.

We think the API needs more real-world usage before it can be deemed stable. It may indeed be the best API available, but without some maturation on PyPI, we can’t really know for sure. With the module being independently developed and released, its API can evolve much more quickly than it can once the module is in the stdlib. It will also not have to adhere to the strict backward compatibility and deprecation rules of the stdlib. The Steering Council thinks this is a good policy in general for new stdlib packages, and plans to propose this as “standard operation procedure” for most new packages.

You expressed a concern about the maintenance burden of a PyPI release, but we think that’s solvable. You should be able to recruit co-maintainers, either from the current cohort of core developers, or from users of the subinterpreters package. We’re also confident that we can get help with setting up any automation needed for testing and releases. It shouldn’t be much more of a burden developing and releasing it independently for now than it would be in the stdlib.

We are going to mark the PEP as Deferred, and we can always reevaluate stdlib inclusion for a future version of Python.

Cheers,
-Barry (on behalf of the Steering Council)

steve.dower · April 11, 2024, 12:24pm

My understanding of this PEP is that would mean Eric can add the API as private but exposed (e.g. a _subinterpreters module) and then use a PyPI package to expose it? That allows some freedom to change it in a subsequent release, but the PyPI package probably wouldn’t amount to much more than from _subinterpreters import * (and InterpreterPoolExecutor, which I agree could live outside of the stdlib).

In other words, it’s “provisional” except we don’t mark stuff as provisional anymore so it’s just “private”.

The whole concept is based around exporting internal functionality, so it’s not like the API can be separated from the interpreter. Behaviour changes have to occur within the runtime, not within the module, and the interesting development is all at a level higher than proposed here (again, except InterpreterPoolExecutor). Right now, ctypes would be needed to access our internal C APIs to get the same behaviour, and I don’t think there’s really any way to do the synchronisation needed to actually make that work.

I’m sure Eric presented this analogy, but this module here is essentially exposing os.fork so that 3rd parties can develop multiprocessing outside of the stdlib. If the SC is okay with exposing os._fork for now, then I’m sure that’ll be workable, but if the SC is going to be surprised at the new internal APIs showing up with no stdlib users, it’d be good to clear that up sooner rather than later.

eric.snow · April 11, 2024, 5:56pm

Thanks for the taking the time to consider the PEP and for being clear about the position of the Steering Council. While I don’t agree with the decision and would have liked more direct discussion with the Steering Council, I’m on board and ready to move forward.

FWIW, I do see the point you (and Alyssa) have made about uncertainty with the design choices (e.g. names), regardless of the scale of the proposal. I also agree that community feedback from an implementation of the PEP on PyPI has a good chance of identifying potential improvements.

(Personally, I consider the advantage of exposure in the stdlib to outweigh the risk of having to tweak the API in later versions, given the small proposed surface area. That said, I’ll readily admit that my perspective is skewed toward the value I anticipate the new module to add for Python users, making it hard for me to tell if I’m weighing things fairly.)

Here’s my plan for 3.13:

wrap up the various various 3.13 fixes I have in flight (or planned)
rename the _xxsubinterpreters module to _interpreters
work on a PyPI package (3.12/3.13) that uses _interpreters (still keeping it minimal)
look for collaborators

There are alternatives for the second point: expose a bunch of necessary internal C-API in the public API, or use the internal C-API directly (i.e. with Py_BUILD_CORE). However, I’d much prefer using the low-level _interpreters module as it make a lot of things simpler.

eric.snow · April 11, 2024, 6:38pm

Also, I have some feedback for the Steering Council on how things have played out with this PEP, involving level-of-interaction and clarity on the decision. I’m not interested in complaining/venting, but rather want to identify some key good/bad parts of my experience here, in the hope that it helps the Steering Council and the community. (I’d also like to see what we can do to better support the Steering Council, who are volunteers like the rest of us, but have a distinct (and challenging) role as gatekeepers.)

Where would be the best place to start such a discussion?

barry · April 11, 2024, 11:38pm

That seems like a good plan to me ^[1]. Keep in mind too that the PEP is deferred so you can definitely come back and ask for a re-evaluation for a future Python release.

wearing my core dev hat, not necessarily SC member hat ↩︎

barry · April 11, 2024, 11:41pm

Speaking with my SC member hat on, I certainly would welcome feedback. Maybe start with another SC office hours session? If you wanted to write down your thoughts first, an email to the steering council?

rtobar · September 26, 2024, 4:28am

Hi all,

Yesterday I opened this discussion about the state of PEP 734 in the “Python Help” category in this forum. It’s been a day getting a few views, but no replies, and I fear I might have posted the message in the wrong category – maybe I should have written it here directly instead. This is my first time opening a discussion in this forum, so I wasn’t sure what the rules were.

In any case, I thought mentioning it here might attract the correct audience. Maybe this cross-posting is frowned-upon though, so please excuse me in advance if that’s the case. Thanks!

eric.snow · September 26, 2024, 10:10pm

No, you did the right thing. I just haven’t gotten to your post yet. It will probably be today. Sorry for the wait.

TkTech · November 11, 2024, 5:57am

When using InterpreterPoolExecutor, the started sub-processors do not seem to fully inherit the sys.path of the parent, which does work for all other types of executors. I ran into this when using sub-interpreters inside of a pytest-based tests, as the started jobs did not get the test directory in their sys.path. This is under 3.13 using the latest backport.

I’m not entirely convinced this is a bug as it’s a logical outcome, but it’s certainly unexpected when compared to the other executors.

eric.snow · November 11, 2024, 10:05pm

I’d say this is worth a bug report. Please open an issue and feel free to CC me (@ericsnowcurrently) on it.

ZeroIntensity · November 13, 2024, 10:12pm

For reference: InterpreterPoolExecutor workers do not inherit modifications made to sys.path before starting. · Issue #126714 · python/cpython · GitHub

a-reich · April 28, 2025, 5:02pm

I saw the PEP was resubmitted to the SC - any expectation on whether they will make a decision in time for the 3.14 release?

barry · June 5, 2025, 7:35pm

After discussing PEP 734 before and during PyCon, with you and many others, and confirming with @hugovk that he will issue a feature freeze exception for Python 3.14, the Steering Council approves PEP 734, with one suggested change (see below).

The Python community is incredibly grateful for your diligence, tenacity, and expertise in bringing the subinterpreters feature to fruition. Despite the Zen of Python’s “There’s Only One Way To Do It”, we believe that giving Python developers options for parallelism is ultimately in their best interest. Choosing which mechanism to use for concurrent programming in Python is a complex decision, and subinterpreters brings an important and useful option to the table. Besides that, all the work you’ve done “under the hood” has helped improve Python in general, and that is also greatly appreciated.

We have one request. We think rather than adding a new top-level subinterpreters module in the stdlib, a better place for it would be concurrent.interpreters. This should make the module easily discoverable, and it gives more validity to the concurrent package. We imagine that if we add additional APIs for free threading in the future, they might also go under the concurrent package.

If you’re agreeable, then congratulations! Please update the PEP accordingly and coordinate with Hugo on any PRs requiring an exception.

Barry, on behalf of the Steering Council

eric.snow · June 5, 2025, 8:35pm

This is great news!!! Thanks for taking the time and making this possible for 3.14. I definitely look forward to all the things the community will be able to invent with multiple interpreters once we have this stdlib module.

As to concurrent.interpreters, I’m not sure it fits. The module is meant to minimally expose the existing runtime feature, rather than being specifically designed around supporting concurrency. Using interpreters to enable parallelism and human-friendly concurrency were certainly my motivations. However, interpreters don’t actually provide any concurrency themselves. Other features, like threads, provide the actual concurrency. Thus, I wouldn’t expect to find the module as concurrent.interpreters.

FWIW, I totally support making better use of concurrent. That’s a package where I would expect to find general, high-level, concurrency-related utilities. For example, if we were to add a CSP module some day, I’d expect to find it at concurrent.csp. Honestly, multiprocessing would probably be more appropriate as concurrent.multiprocessing (or similar), given that it is exactly a high-level abstraction of concurrency. I can totally see us adding different data structures and helpers related to concurrency, like thread-safe containers.

Interpreters aren’t that sort of thing though. Rather than a high-level abstraction, they are a fundamental, low-level runtime feature. It would be more appropriate to group the module with other modules that expose the runtime, like sys, builtins, atexit, types, inspect, dis, etc. I suppose it could be sys.interpreters. However, I’m not sure about setting that precedent (of a sys submodule) without more thought. Historically such modules have all been top-level.

That said, the queues that are a part of PEP 734 do specifically support concurrency. They are important because using multiple interpreters is substantially less meaningful without a way to communicate between them. Exposing cross-interpreter queues in a submodule of the concurrent package could definitely make sense, though I’m not sure what we’d call it. Probably not concurrent.queue, but maybe concurrent.isolated.queues? Or perhaps it would work under concurrent.queue if the type were named something like IsolatedQueue or CrossInterpreterQueue. Anyway, we could sort that out for 3.15 and just expose interpreters.create_queue() for 3.14 (which is what is already implemented).

So…here are the options that make sense to me:

keep it as interpreters, keep interpreters.create_queue(), change the name to CrossInterpreterQueue, and sort out a dedicated home for the queues in 3.15 (with a separate PEP or something)
likewise, but sys.interpreters
like (1) and (2), but sort out the home for queues in 3.14

Realistically, it’s too late to sort out queues for 3.14 other than to keep interpreters.create_queue(). It might be a bit late to be deciding about adding a sys submodule, but, honestly, the more I think about it the more I’m okay with it. (For 3.15, we could even consider moving more stuff to sys submodules.)

Ultimately, if this boils down to a decision between concurrent.interpreters or wait until 3.15 then I’ll go along with the former to get the module out sooner. I think it’s that important to get it into the hands of users.

-eric

encukou · June 5, 2025, 9:23pm

AFAIK, (sub-)interpreters always were about isolation. Isolated GILs are a great application of course, but so is, for example, isolating sys.path to serve two web apps with conflicting versions of a common dependency.

ncoghlan · June 6, 2025, 2:36am

@ericsnowcurrently While interpreters aren’t necessarily about concurrent execution, they are about maintaining concurrent state (as opposed to repeatedly initialising and finalising the main interpreter, or mutating the main interpreter’s global state).

We certainly have more concurrency related things outside the concurrent namespace than inside, but putting interpreters there seems reasonable enough to me to not be worth worrying further about (resolving arguable naming decisions is one of the reasons we have the SC, after all).

encukou · June 6, 2025, 6:01am

Oh, of course, put it in under the name the SC chose! There’ll always be opposition to any idea, don’t feel blocked by that.

Yeah, but usually there’s some prior public discussion.
The decision is arguable, but I have not seen the arguments. More importantly, I’m not sure the SC has seen the arguments :‍)

That’s certainly a meaning of the word concurrent. What’s less clear to me is whether Python should adopt this meaning.
Note that the proposed module provides a synchronous, blocking API (plus one 4-line helper that wraps call in a new thread).

Compare: In turtle, you usually call functions like forward to move the turtle around. But you can also create concurrent turtles, and control each one individually. It’s even possible to have concurrent turtle windows (TurtleScreens), though there isn’t a convenient Python API for that use case.

Or: list is concurrent—you can have several lists at once, and appending to one doesn’t affect the others.

I think that multiple is a better term, and it’s better to only use concurrent as an antonym to synchronous/blocking.
Of course, it’s an arguable opinion, and you don’t need to convince me that I’m wrong. All I ask is that a SC member is aware of my dissent.