Skip to content

feat(encodings + SubIntSplit): Extension to SubIntSplit Cost Model Selection#880

Open
David-C-L wants to merge 22 commits into
facebookincubator:mainfrom
David-C-L:subint_split_selection_extension
Open

feat(encodings + SubIntSplit): Extension to SubIntSplit Cost Model Selection#880
David-C-L wants to merge 22 commits into
facebookincubator:mainfrom
David-C-L:subint_split_selection_extension

Conversation

@David-C-L

Copy link
Copy Markdown
Contributor

PR for extending the SubIntSplit Cost Models to support:

  • BlockBitpacking
  • FOR
  • PFOR
  • FrequencyPartitioning
  • Delta
  • SIMDBitpacking

Includes additional methods for estimating size from Statistics in some of the encodings, and adds each of the above encodings to the consideration for the SubIntSplit dynamic programming algorithm for selecting bit-splits and encodings.

David-C-L added 22 commits June 16, 2026 14:49
…erload with statistics for use in SubIntSplit
…s for BlockBitPacking, PFOR, and SIMDBitpacking
…lection to consider PFOR, SimdForBitpack and BlockBitPacking
…d existing SubIntSplit tests to test PFOR, SimdBitpack, BlockBitPack integration
…rt freq part tiers (deduplicating DominantValue logic where possible) and implement freq part cost model
@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 16, 2026
@David-C-L David-C-L changed the title Extension to SubIntSplit Cost Model Selection Jun 16, 2026
@David-C-L David-C-L changed the title feat[encodings + SubIntSplit]: Extension to SubIntSplit Cost Model Selection Jun 16, 2026
@meta-codesync

meta-codesync Bot commented Jun 16, 2026

Copy link
Copy Markdown

@srsuryadev has imported this pull request. If you are a Meta employee, you can view this in D108818670.

@srsuryadev srsuryadev left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@David-C-L Let us update the test plan for cost model selection changes based on microbenchmark results as well to avoid spurious selections https://github.com/facebookincubator/nimble/tree/main/dwio/nimble/encodings/benchmarks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

3 participants