Created 1 year, 5 months ago

Active 1 year, 1 month ago

Viewed 446 times

2 replies

Recently I did many Profile-Guided Optimization (PGO) benchmarks on multiple projects (including interpreters and compilers like CPython, Rustc, Clang, Clangd, Clang Tidy, and many others) - the results are available here. I guess enabling the R interpreter would be a good option for the users since it can bring additional performance.

I can suggest the following things to do:

Evaluate PGO's applicability to the R interpreter with some benchmarks.
If PGO helps to achieve better performance - add a note to R's documentation about that. In this case, users and maintainers will be aware of another optimization opportunity for the R interpreter.
Provide PGO integration into the build scripts. It can help users and maintainers easily apply PGO on the R interpreter for their workloads.
Optimize pre-built binaries with PGO.

Here are some examples of how PGO is already integrated into other projects' build scripts:

Rustc: a CI script for the multi-stage build
GCC:
- Official docs, section "Building with profile feedback" (even AutoFDO build is supported)
- A part in a "wonderful" configure script
Clang: Docs
Python:
- CPython: README
- Pyston: README
Go: Bash script
V8: Bazel flag
ChakraCore: Scripts
Chromium: Script
Firefox: Docs
- Thunderbird has PGO support too
PHP - Makefile command and old Centminmod scripts
MySQL: CMake script
YugabyteDB: GitHub commit
FoundationDB: Script
Zstd: Makefile
Foot: Scripts
Windows Terminal: GitHub PR
Pydantic-core: GitHub PR

Some PGO documentation examples in various projects:

ClickHouse: https://clickhouse.com/docs/en/operations/optimizing-performance/profile-guided-optimization
Databend: https://databend.rs/doc/contributing/pgo
Vector: https://vector.dev/docs/administration/tuning/pgo/
Nebula: https://docs.nebula-graph.io/3.5.0/8.service-tuning/enable_autofdo_for_nebulagraph/
GCC: Official docs, section "Building with profile feedback" (even AutoFDO build is supported)
Clang:
- https://llvm.org/docs/HowToBuildWithPGO.html
- https://llvm.org/docs/AdvancedBuilds.html
tsv-utils: https://github.com/eBay/tsv-utils/blob/master/docs/BuildingWithLTO.md

After PGO, I can suggest evaluating LLVM BOLT as an additional optimization step after PGO - Post Link Optimization (PLO). This optimization technique is already integrated into the Clang, CPython, and Rustc build scripts. But I suggest starting with PGO - it's a more stable optimization than PLO in the general case.

P.S. Previously this was created as a https://stackoverflow.com/questions/77512986/adding-profile-guided-optimization-pgo-to-the-r-interpreter, but as people stated, the Stack Overflow's Discussions space is more suitable for this topic.

pgo r

edited Feb 28, 2024 at 4:40

M--

29.7k
10
70
106

created Nov 21, 2023 at 22:02

zamazan4ik

2 replies

Sorted by:

Dirk is no longer here

368.9k
59
666
741

Both locations are wrong / ineffective. The only place that matters for these types of proposals is the r-devel mailing list. Simply enumerating other projects where this helped may not buy you much but may you could try to apply it to (maybe just a part of) the R sources and see what happens?

Nov 21, 2023 at 22:24

zamazan4ik

Author

Okay, I just created the discussion on SO because it was proposed by a person in the original issue. If you think that the mailing list is a more efficient way to discuss such questions - good, I'm fine with moving to the mailing list.

I list similar projects just to encourage people to give a try to the PGO since it works in many other projects and it's much easier for me to show them before investing more time into the actual optimization steps for the R interpreter.

Before applying PGO to the R interpreter. Is there an R interpreter bench suite or something like that? Using ready-to-use and verified benchmarks is much easier to do instead of crafting a proper benchmark and only then trying to optimize the R interpreter. I am talking about something like https://github.com/python/pyperformance but for R.

Nov 23, 2023 at 2:02

Share perspectives, advice, and insights

Use Discussions to engage in deeper dialogue, have opinion-based conversations, and exchange perspectives about a technical concept. See full Discussions guidelines.

Discussions is different than Q&A

Discussions exists separately from the traditional question-and-answer space. If you have a specific programming question, go to Stack Overflow Q&A to post your question.

Be welcoming and patient

All users are expected to treat one another with kindness and respect. Remember, everyone is here to learn, and sometimes while learning, people make mistakes. See code of conduct.

No resume or job listings

Discussions are not for sharing your resume or job listing.

Avoid self-promotion

If your post happens to be about your product or website, you must disclose your affiliation. See spam guidelines and best practices.

Collectives™ on Stack Overflow

Adding Profile-Guided Optimization (PGO) to the R interpreter

2 replies