TensorRT packaging relies on nested pip execution

groodt · April 20, 2024, 11:22pm

I’ll admit I felt a little sick in my stomach when I stumbled upon this gem:


class InstallCommand(install):
    def run(self):
        # pip-inside-pip hack ref #3080
        run_pip_command(
            [
                "install",
                "--extra-index-url",
                nvidia_pip_index_url,
                *tensorrt_submodules,
            ],
            subprocess.check_call,
        )

        super().run()

You can see some of the underlying motivation for the pip-ception here:

github.com/NVIDIA/TensorRT

Fix sdist installation

NVIDIA:main ← ddelange:fix-sdist

opened 07:05AM - 22 Jun 23 UTC

ddelange

+85 -22

Fixes #3078 This PR ~removes~ amends the 'pip inside pip' anti-pattern and up…dates the PyPI description: > To install, please execute the following: > ``` > pip install tensorrt --extra-index-url https://pypi.nvidia.com > ``` > Or add the index URL to the (space-separated) PIP_EXTRA_INDEX_URL environment variable: > ``` > export PIP_EXTRA_INDEX_URL='https://pypi.nvidia.com' > pip install tensorrt > ``` > When the extra index url does not contain `https://pypi.nvidia.com`, a nested `pip install` will run with the proper extra index url hard-coded. I tried to find the source code of ERROR.txt inside `tensorrt_libs` sdist on official PyPI (so I could update the message with the `PIP_EXTRA_INDEX_URL` env var), but couldn't find it. Can you update that error message accordingly? Similarly this one will also need an update: https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html

github.com/NVIDIA/TensorRT

Improve frontend Python packaging

opened 08:11AM - 04 Oct 23 UTC

closed 09:24PM - 07 Nov 23 UTC

ralbertazzi

triaged

The Python packages of tensorrt are currently divided into [three libraries](htt…ps://github.com/NVIDIA/TensorRT/tree/release/8.6/python/packaging): - `tensorrt_bindings`, which contains the actual Python bindings - `tensorrt_libs`, which contains the tensorrt system libraries and depends on python cuda packages - the `tensorrt` frontend While `tensorrt_bindings` and `tensorrt_libs` are easily installable libraries, the `tensorrt` packages seems to have a weird installation mechanism that relies on internally installing the other two libraries in its `setup.py`. This makes it hard to use the frontend with package managers such as Poetry which rely on PEP 517 builds. Couldn't `tensorrt` simply _require_ `tensorrt_bindings` and `tensorrt_libs`, just as for example `tensorrt_bindings` requires some cuda dependencies? This would make the installation mechanism much simpler, compliant to existing PEPs, and reproducible (as all packages would be nicely tracked in the `poetry.lock` file). Right now I'm resorting to installing `tensorrt_bindings` and `tensorrt_libs` as I didn't seem to find other ways. Here you can find the error I'm getting through Poetry: ```bash poetry add tensorrt Using version ^8.6.1.post1 for tensorrt Updating dependencies Resolving dependencies... (5.0s) Package operations: 1 install, 0 updates, 0 removals • Installing tensorrt (8.6.1.post1): Failed ChefBuildError Backend subprocess exited when trying to invoke build_wheel /tmp/tmpyem0v5d_/.venv/bin/python: No module named pip running bdist_wheel running build running build_py creating build creating build/lib creating build/lib/tensorrt copying tensorrt/__init__.py -> build/lib/tensorrt running egg_info writing tensorrt.egg-info/PKG-INFO writing dependency_links to tensorrt.egg-info/dependency_links.txt writing requirements to tensorrt.egg-info/requires.txt writing top-level names to tensorrt.egg-info/top_level.txt reading manifest file 'tensorrt.egg-info/SOURCES.txt' adding license file 'LICENSE.txt' writing manifest file 'tensorrt.egg-info/SOURCES.txt' installing to build/bdist.linux-x86_64/wheel running install /tmp/tmpyem0v5d_/.venv/bin/python: No module named pip Traceback (most recent call last): File "/usr/local/share/poetry/venv/lib/python3.9/site-packages/pyproject_hooks/_in_process/_in_process.py", line 353, in <module> main() File "/usr/local/share/poetry/venv/lib/python3.9/site-packages/pyproject_hooks/_in_process/_in_process.py", line 335, in main json_out['return_val'] = hook(**hook_input['kwargs']) File "/usr/local/share/poetry/venv/lib/python3.9/site-packages/pyproject_hooks/_in_process/_in_process.py", line 251, in build_wheel return _build_backend().build_wheel(wheel_directory, config_settings, File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/build_meta.py", line 434, in build_wheel return self._build_with_temp_dir( File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/build_meta.py", line 419, in _build_with_temp_dir self.run_setup() File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/build_meta.py", line 507, in run_setup super(_BuildMetaLegacyBackend, self).run_setup(setup_script=setup_script) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/build_meta.py", line 341, in run_setup exec(code, locals()) File "<string>", line 110, in <module> File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/__init__.py", line 103, in setup return distutils.core.setup(**attrs) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/core.py", line 185, in setup return run_commands(dist) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/core.py", line 201, in run_commands dist.run_commands() File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/dist.py", line 969, in run_commands self.run_command(cmd) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/dist.py", line 989, in run_command super().run_command(command) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/dist.py", line 988, in run_command cmd_obj.run() File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/wheel/bdist_wheel.py", line 399, in run self.run_command("install") File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/cmd.py", line 318, in run_command self.distribution.run_command(command) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/dist.py", line 989, in run_command super().run_command(command) File "/tmp/tmpyem0v5d_/.venv/lib/python3.9/site-packages/setuptools/_distutils/dist.py", line 988, in run_command cmd_obj.run() File "<string>", line 62, in run File "<string>", line 40, in run_pip_command File "/home/wizard/mambaforge/envs/ra/lib/python3.9/subprocess.py", line 373, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['/tmp/tmpyem0v5d_/.venv/bin/python', '-m', 'pip', 'install', '--extra-index-url', 'https://pypi.nvidia.com', 'tensorrt_libs==8.6.1', 'tensorrt_bindings==8.6.1']' returned non-zero exit status 1. at /usr/local/share/poetry/venv/lib/python3.9/site-packages/poetry/installation/chef.py:147 in _prepare 143│ 144│ error = ChefBuildError("\n\n".join(message_parts)) 145│ 146│ if error is not None: → 147│ raise error from None 148│ 149│ return path 150│ 151│ def _prepare_sdist(self, archive: Path, destination: Path | None = None) -> Path: Note: This error originates from the build backend, and is likely not a problem with poetry but with tensorrt (8.6.1.post1) not supporting PEP 517 builds. You can verify this by running 'pip wheel --use-pep517 "tensorrt (==8.6.1.post1)"'. ```

It seems like the motivation is to dynamically extend index urls to the nvidia indexes.

This also somehow feels like it could lead to bad practices around dependency confusion and extra-index-url

It also wont work for other package managers I presume.

It’s almost better if installation simply failed and printed out instructions on the sdist for how to ensure the nvidia index-url is set if there truly isn’t a better way than nested pip install?

kknechtel · April 20, 2024, 11:36pm

It seems like they’re using setup.py to configure the user’s system, so that future invocations of Pip will know about their own package indexes.

There’s what I interpret as basically a different version of this, which appears to work by locating Pip’s config file and rewriting it. (And in order to search for the config file, it’s trying to use functionality from pip._internal.configuration…).

IMO this is clearly not how the system is supposed to work. Regardless of whether Pip intends to offer an API for other programs to locate or edit Pip config files, none of this makes sense to wrap into a setup.py or fit into the Setuptools framework. It isn’t actually installing usable Python code (yet). It should be provided as a separate Python script that can just be downloaded and run. (It’s not as if going through setup.py is any more secure than that!)

In this context, it seems like they’re doing it as a step in a larger sdist-building process, which would on its face seem more justifiable. But I agree that this feels quite wrong. There’s only so much that can be done automatically to try to make an sdist work on a remote machine, when you don’t control the initial state of the environment. It also doesn’t seem very polite, to me, that they use this to make Pip grab packages from separate indices that the user didn’t specify on the command line. As a user I should be at least somewhat in control of what domains my machine is connecting to for downloads; I’d expect to follow instructions in a README and explicitly add the package index URLs myself first, instead.

(This previous thread seems related?)

groodt · April 21, 2024, 12:07am

It is used as the primary documented approach to install “tensorrt”. It isn’t used as an advanced installation mode unfortunately.

“tensorrt” is an sdist-only metapackage.

pf_moore · April 21, 2024, 7:00am

Yeah that seems pretty horrible. I’m not sure what the point is of raising it here though? Thanks for the heads up, I guess, at least I’ll now know to view any issues raised by users of TensorRT with suspicion…

To get TensorRT to change, someone would have to raise this with the project.

groodt · April 21, 2024, 7:19am

Nothing expected from this forum. Raised it for awareness so standards-aware people have context of real-world usage.

Perhaps also on the off chance there’s a better recommendation for this package.

I personally think the cleanest approach would be:

a tensorrt sdist meta-package that fails and prints out instructions to install “tensorrt-real” wheels from nvidia indexes
a tensorrt-real package and wheels on nvidia indexes so that people can install directly from nvidia without the meta-package (a dummy package with the same name that fails would also need to be installed on PyPI)

But I can imagine pushback will be that it will break existing users of “pip install tensorrt”.

pf_moore · April 21, 2024, 11:29am

It’s worth noting that this usage is (basically) unsupported. Python core doesn’t guarantee that you can safely mess with the site-packages of a running interpreter (at the very least you need to clear the import system’s caches), and pip doesn’t guarantee that it can be run concurrently on the same environment.

So IMO it’s fine to ignore this usage as far as standards are concerned. If the TensorRT project would like standard-backed support for their use case, they can propose something, of course.

pradyunsg · April 22, 2024, 9:43am

And, I’m also certain that we’d engage with them to figure this out, but I expect like other instances where we have gaps, the maintainers will need to talk to the underlying tooling maintainers about the gaps and describe what they want to do.

oscarbenjamin · April 22, 2024, 9:09pm

If you look at the packages in the nvidia index then it seems pretty clear what their first problem is which is that they want different wheels for different CUDA versions. That is something that the selector package idea could help with.

dustin · April 22, 2024, 9:22pm

Also related: https://discuss.python.org/t/what-to-do-about-gpus-and-the-built-distributions-that-support-them/

zeroepoch · April 24, 2024, 10:14am

Hi from NVIDIA. We’ve been working on a better way that respects existing standards. We aren’t quite ready to share it yet. Watch for a post from Ethan Smith around PyCon time in about two weeks. He’s on vacation right now, and because he developed it, we’d like to give him the chance to share it. We’ll link to it here.