Clarification on cProfile Timing Resolution and the ".001 seconds" Clock Tick Statement

HiroIshida · July 20, 2025, 5:14pm

Hello
While reading the cProfile documentation’s Limitations section
, I came across the statement

The most obvious restriction is that the underlying “clock” is only ticking at a rate (typically) of about .001 seconds.

I’m trying to understand what this actually means.

From what I can see in the cpython’s source code, cProfile appears to use clock_gettime(CLOCK_MONOTONIC_RAW, ...) on Linux for time measurement. When I check the clock resolution with clock_getres on my system, it returns 1ns, so I’m confused why the documentation suggests a 1ms resolution (“about .001 seconds”) — that seems like a big mismatch in orders of magnitude.

Could anybody explain this mismatch?

barry-scott · July 20, 2025, 5:36pm

That are two seperate properties of a clock.

The rate it ticks.
The precision of the reported time.

The clock is likely ticking every 1ms and the time is report to 1ns resolution.

elis.byberi · July 20, 2025, 5:49pm

Check timer source:
cat /sys/devices/system/clocksource/clocksource0/current_clocksource

Tick Rate

#include <stdio.h>
#include <time.h>

int main() {
    struct timespec t1, t2;
    clock_gettime(CLOCK_MONOTONIC, &t1);
    do {
        clock_gettime(CLOCK_MONOTONIC, &t2);
    } while (t1.tv_nsec == t2.tv_nsec && t1.tv_sec == t2.tv_sec);
    
    long delta_ns = (t2.tv_sec - t1.tv_sec) * 1e9 + (t2.tv_nsec - t1.tv_nsec);
    printf("Observed tick: %ld ns\n", delta_ns);
}

import time

clock = time.CLOCK_MONOTONIC

t1 = time.clock_gettime_ns(clock)
while True:
    t2 = time.clock_gettime_ns(clock)
    if t2 != t1:
        break

delta_ns = t2 - t1
print(f"Observed tick: {delta_ns} ns")

HiroIshida · July 20, 2025, 6:11pm

@barry-scott @elis.byberi

Thank you very much! I understand the difference between the clock tick rate and reported resolution. On my environment, the clock rate is about every 100ns, which is much higher than 1ns. However, still, the orders of magnitude (10000x) gap between this measurement and the statement in the docs, which made me still confused

h-ishida@umejuice:~/tmp/tmp$ cat tmp.py 
import time

for _ in range(10):
    clock = time.CLOCK_MONOTONIC
    t1 = time.clock_gettime_ns(clock)
    while True:
        t2 = time.clock_gettime_ns(clock)
        if t2 != t1:
            break
    delta_ns = t2 - t1
    print(f"Observed tick: {delta_ns} ns")
h-ishida@umejuice:~/tmp/tmp$ python3 tmp.py 
Observed tick: 390 ns
Observed tick: 151 ns
Observed tick: 150 ns
Observed tick: 90 ns
Observed tick: 111 ns
Observed tick: 110 ns
Observed tick: 90 ns
Observed tick: 80 ns
Observed tick: 100 ns
Observed tick: 100 ns

HiroIshida · July 20, 2025, 6:12pm

@barry-scott @elis.byberi
Thank you very much! I understand the difference between the clock tick rate and reported resolution. On my environment, the clock rate is about every 100ns, which is much higher than 1ns. However, still, the orders of magnitude (10000x) gap between this measurement and the statement in the docs (1ms), which made me still confused

barry-scott · July 20, 2025, 8:21pm

It seems you are on a linux system. The default is 1000Hz for the clock tick.

You need to use a waiting sleep call to see the effect of the clock tick rate. Try asking for time after a sleep of 0.00001. I expect you to see reported time about 1ms apart not 0.1ms apart.

If you call a query in a tight loop you only get to see the sampling accuracy.

elis.byberi · July 20, 2025, 8:37pm

I believe it should be “scheduler clock,” not just “clock.”

elis.byberi · July 20, 2025, 9:45pm

Scheduling latency

import time
import multiprocessing


def busy():
    while True:
        pass


if __name__ == "__main__":
    clock = time.CLOCK_MONOTONIC
    t1 = time.clock_gettime_ns(clock)

    p = multiprocessing.Process(target=busy)
    p.start()

    while True:
        t2 = time.clock_gettime_ns(clock)
        if t2 != t1:
            break

    delta_ns = t2 - t1
    print(f"Scheduling latency: {delta_ns} ns")

    p.terminate()

Result:
Scheduling latency: 5264307 ns

The current wording in the documentation is confusing. It should specify “scheduling clock” or “kernel timer,” rather than just “clock.”

barry-scott · July 21, 2025, 8:13am

I used this script to test the scheduler grainularity. I named it tmpdir/t.py

import time
import sys

t1 = time.clock_gettime_ns(time.CLOCK_MONOTONIC)
time.sleep(float(sys.argv[1]))
t2 = time.clock_gettime_ns(time.CLOCK_MONOTONIC)
print('%f' % ((t2-t1)/1000_000_000,))

Then I ran it with on my Fedora 42 system that has kernel 6.15.5-200.fc42.x86_64

$ python3 tmpdir/t.py 0.1
0.100258
$ python3 tmpdir/t.py 0.01
0.010096
$ python3 tmpdir/t.py 0.001
0.001079
$ python3 tmpdir/t.py 0.0001
0.000159
$ python3 tmpdir/t.py 0.00001
0.000079
$ python3 tmpdir/t.py 0.000001
0.000059
$ python3 tmpdir/t.py 0.0000001
0.000059

Nodd · July 21, 2025, 4:35pm

I thought that the 0.001s was true for Windows systems, but I have the same behavior with your script on my Windows 10 system, where it doesn’t get below 0.00006 s.

HiroIshida · July 22, 2025, 4:57am

Actually I asked on cpython’s issue and according to the contributor, it seems that the documentation is just outdated.

github.com/python/cpython

Clarification on cProfile's ".001 seconds" Clock Tick Statement

opened 08:06AM - 21 Jul 25 UTC

closed 08:17AM - 21 Jul 25 UTC

HiroIshida

docs

# Documentation @birkenfeld While reading the `cProfile` documentation's [Limita…tions section](https://docs.python.org/3/library/profile.html#limitations), I came across the statement > *The most obvious restriction is that the underlying “clock” is only ticking at a rate (typically) of about .001 seconds.* However, after examining the cProfile source code, it appears that on Linux systems, the internal clock used is clock_gettime(CLOCK_MONOTONIC_RAW, ...). Based on a helpful replies to my [Python Discourse](https://discuss.python.org/t/clarification-on-cprofile-timing-resolution-and-the-001-seconds-clock-tick-statement/99508/5) post, I ran an experiment that suggests the resolution of CLOCK_MONOTONIC_RAW is actually around 2 microseconds — several orders of magnitude more precise than the stated 0.001 seconds. The discrepancy was a little confusing, and I feel that the documentation could perhaps benefit from some additional clarification on the tick rate and clock. Also, because the documentation seem to be written in 18 years ago, and the value might be outdated. ```python import time for _ in range(10): time.sleep(0.01) clock = time.CLOCK_MONOTONIC_RAW t1 = time.clock_gettime_ns(clock) while True: t2 = time.clock_gettime_ns(clock) if t2 != t1: break delta_ns = t2 - t1 print(f"Observed tick: {delta_ns} ns") ``` ``` h-ishida@umejuice:~/tmp/tmp$ python3 tmp.py Observed tick: 2274 ns Observed tick: 2084 ns Observed tick: 1522 ns Observed tick: 2324 ns Observed tick: 2064 ns Observed tick: 2013 ns Observed tick: 2064 ns Observed tick: 1222 ns Observed tick: 1773 ns Observed tick: 2695 ns ```

github.com/python/cpython

Docs: Polish profile documentation

main ← gaogaotiantian:profile-docs-fix

opened 02:16AM - 07 Oct 23 UTC

gaogaotiantian

+22 -18

There are a couple of issues with the current (outdated) documentation * `To …profile a function that takes a single argument` is simply misleading. It does not have to be a function, and `a single argument` is .. just wrong. * Users do not like "code as string" and we should at least provide the way to do it with actual code in the "instant manual". * The original statement of cprofile not being able to handle `sys.exit()` is wrong - it can. However, #104676 mentioned that it can't handle `SIGTERM`. I moved this statement up - closer to where it should be seen. * In the limitation section, the claim about clock precision is super old - we know it was there at least 16 years ago, when it was copied from somewhere else. The resolution of the system clock *now* is at ns level. I removed the section directly, we can redo the statement to talk about this - it's still possible that the system clock resolution is an issue, but it's probably much better and definitely not at 0.001s level. ---- :books: Documentation preview :books:: https://cpython-previews--110494.org.readthedocs.build/

elis.byberi · July 22, 2025, 3:45pm

It’s not exactly outdated; it’s still relevant, but it’s unclear what the section is trying to explain. It may require in-depth knowledge of implementation details, especially since it discusses limitations.

Scheduler kernel timers are still observable, as I demonstrated in my previous post using multiprocessing (or threading).

The purpose of the Limitations section is to convey that certain constraints exist due to software or hardware limitations, without going too deep into technical detail.

It seems more like a note to one’s future self, though.

barry-scott · July 22, 2025, 4:26pm

The linux scheduler gets a lot of work done on it that seems to be why the 1ms tick is no longer a limit.

I would not be surprised to find simular improvements in scheduling on macOS and Windows-11.