Using microseconds on duration/timestamp will result in loss of precision

Question

Using microseconds on duration/timestamp will result in loss of precision

fwenguang opened this issue 5 months ago · comments

issues in pytorch: pytorch/pytorch#116830

Brian Coutinho · Answer 1 · Sat Jan 06 2024 05:07:39 GMT+0800 (China Standard Time)

That's a fair point that data for events does come in with nano second granularity for start and end timestamps.

AFaIK the main constraint is Chrome Trace format. We use these 'X' Complete events to represent the kernels and operators. They unfortunately specify the timing in resolution of microseconds only

There is an extra parameter dur to specify the tracing clock duration of complete events in microseconds. All other parameters are the same as in duration events.

And all timestamsp are in microseconds too

ts: The tracing clock timestamp of the event. The timestamps are provided at microsecond granularity.

https://docs.google.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/preview

Florian · Answer 2 · Mon Jan 08 2024 10:54:53 GMT+0800 (China Standard Time)

That's a fair point that data for events does come in with nano second granularity for start and end timestamps.

AFaIK the main constraint is Chrome Trace format. We use these 'X' Complete events to represent the kernels and operators. They unfortunately specify the timing in resolution of microseconds only

There is an extra parameter dur to specify the tracing clock duration of complete events in microseconds. All other parameters are the same as in duration events.

And all timestamsp are in microseconds too

ts: The tracing clock timestamp of the event. The timestamps are provided at microsecond granularity.

https://docs.google.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/preview

Yes, ts and dur is in microseconds granularity. But they can be floating point type as far as i know.

So if there are no other constraints, I think it would be necessary to perform the conversion from nanoseconds(int64) to microseconds(double) only when saving to a Chrome trace file. Using floating-point types for 'ts' and 'dur' would prevent the loss of time in the order of nanoseconds or even hundreds of nanoseconds per event.

Additionally, it need to use relative timestamps for 'ts'.Due to the timestamps in nanoseconds require 19 digits. But double can typically represent around 15-16 significant digits of precision.

Brian Coutinho · Answer 3 · Sat Jan 13 2024 07:24:50 GMT+0800 (China Standard Time)

But they can be floating point type as far as i know.

Oh I didn't know that. We can try that actually.

Additionally, it need to use relative timestamps for 'ts'.Due to the timestamps in nanoseconds require 19 digits. But double can typically represent around 15-16 significant digits of precision.

How about just using a fixed point representation with 3 digits after the decimal point?

Florian · Answer 4 · Mon Jan 29 2024 10:33:17 GMT+0800 (China Standard Time)

How about just using a fixed point representation with 3 digits after the decimal point?

Sorry. I don't fully understand that.Using what filed to save these 3 digits?

If the 'ts' in nanosecond is 1706495062881409718. It was like this before the modification.

{
    "ph": "X", "cat": "kernel", "name": "kernel_name", "pid": 0, "tid": 1,
    "ts": 1706495062881409, "dur": 36,
    "args": {
      ...
    }
  }

What should it be like after the modification?

sraikund16 · Answer 5 · Fri Apr 19 2024 02:45:24 GMT+0800 (China Standard Time)

Resolved in this PR: pytorch/pytorch#123650