graph, store: Make sure vid batching works with large vids #5970

lutter · 2025-04-23T21:32:38Z

Changing to the new vid scheme of block_num << 32 + sequence_num revealed some numerical problems in the batching code.

zorancv

This is clear improvement over the current situation and is equivalent to it from algebraic point of view, so I approved it with one remark for consideration in the future. Nice that you added a regression test case!

zorancv · 2025-04-25T16:48:10Z

graph/src/util/ogive.rs

+    // as f64) as i64 < points[j]`
+    //
+    // We therefore try to only convert differences between points to f64
+    // which are much smaller.


Here the assumption is that differences of the subsequent points is never more than 2'000'000 blocks, as we have only 21 bits of the mantissa available (53 of f64 - 32 of the shift)? If that's not absolutely guaranteed I would suggest to convert all the calculations to work with i128. The bins_size could be represented as a ratio of two integers. The only drawback are somewhat slow divisions...

That's a really good point. I'll leave this as-is for now, but if I ever need to touch it, I'll adopt your suggestion.

Changing to the new vid scheme of `block_num << 32 + sequence_num` revealed some numerical problems in the batching code.

lutter requested a review from zorancv April 23, 2025 21:33

zorancv approved these changes Apr 25, 2025

View reviewed changes

graph, store: Make sure vid batching works with large vids

72834fd

Changing to the new vid scheme of `block_num << 32 + sequence_num` revealed some numerical problems in the batching code.

lutter force-pushed the lutter/ogive-roundoff branch from 1a634c4 to 72834fd Compare April 25, 2025 22:15

lutter merged commit 72834fd into master Apr 25, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph, store: Make sure vid batching works with large vids #5970

graph, store: Make sure vid batching works with large vids #5970

lutter commented Apr 23, 2025

zorancv left a comment

zorancv Apr 25, 2025

lutter Apr 25, 2025

graph, store: Make sure vid batching works with large vids #5970

graph, store: Make sure vid batching works with large vids #5970

Conversation

lutter commented Apr 23, 2025

zorancv left a comment

Choose a reason for hiding this comment

zorancv Apr 25, 2025

Choose a reason for hiding this comment

lutter Apr 25, 2025

Choose a reason for hiding this comment