Apologies if there is somewhere obvious to find this info, or if the question has been asked before. I tried and couldn’t find anything that answered my question.
Is there somewhere to find the percentage of active pip versions hitting pypi? I recognize it will hide a lot of corporate usage, but should still be a reasonable proxy of versions out there.
This information can be derived by Analyzing PyPI package downloads — Python Packaging User Guide. I don’t think anyone has publicly recorded this information outside of PyPI’s own BigQuery datasets.
You want details.installer.name to be pip, and group by details.installer.version.
Out of curiosity, why do you want this information?
I’m curious how quickly versions upgrade to latest.
I completely recognize how flawed the numbers can be, but curious to know if it rolls out quickly because people run “pip install -U pip”.
Cool. That would be interesting to know - although given that we (the pip maintainers) don’t support older versions of pip, it’s mostly of academic interest to me.
As a counterpoint, you may be interested to know that if we roll out a version of pip with a bug then we usually get bug reports coming in within a matter of hours. There’s a lot of people who download and run the latest version of pip in their CI. I don’t know how much that will skew the figures.
Awesome! Thanks! That’s a very useful site and was precisely what I was looking for.
Seems that updates roll out very quickly for pip. Seems it reaches over 50% in a matter of days.
Hang on - is that downloads of pip, or downloads with pip? I thought @groodt wanted the latter.
Assuming it is downloads with pip, and therefore is based on pip’s user-agent string, it would be interesting to exclude runs in CI (there’s a
ci=True/None value in there).
That’s downloads of pip, which is not really a relevant metric here—unless people have it pinned, or it is constrained by something else (neither of which is likely to be terribly common with pip itself), any download of pip will necessarily be the latest version, which indeed is reflected in the numbers. What is much more useful is the number of downloads of PyPI packages with a particular version of pip—which is tracked in the BigQuery DB, but I’m not aware of a site that exposes it easily aside from a few static analyses.
ci=True/None value in the Warehouse BigQuery data? It would indeed be very interesting to filter that.
I ran out of my free-tier BigQuery quota for this.
details.installer.version, COUNT(*) AS num_downloads
details.installer.name = "pip"
AND DATE(timestamp) BETWEEN DATE_SUB(CURRENT_DATE(), INTERVAL 30 DAY)
(taking the top 10)
(436.43 GB query size)
Yes, Paul was right, it needs to be the logs of downloads of other packages using pip.
The head of the distribution is still modern pip. 23.2.1 is rising quickly and I expect will be number 1 spot in 4 weeks. Apart from 9.0.3 in the top 10, it does seem that the ecosystem benefits from new versions quite quickly.