Emit config diagnostics by sgavriil01 · Pull Request #2289 · facebook/pyrefly

sgavriil01 · 2026-02-02T14:27:54Z

Summary

This PR adds LSP diagnostics for config file (pyrefly.toml/pyproject.toml) parsing errors. When a config file has invalid syntax, diagnostics now appear in VS Code's Problems panel and clear automatically when the file is fixed.

Changes:

Modified ConfigError to include file path information
Fixed bug in workspace.rs where config errors were being discarded
Added publish_config_diagnostics() to emit LSP diagnostics for config files
Added caching to ensure diagnostics persist across publish cycles

Fixes #2078

Test Plan

Added integration test test_config_file_diagnostics() that creates invalid TOML and verifies proper handling
Manually tested by creating syntax errors in pyrefly.toml - diagnostics appear in Problems panel with correct line/column positions
Verified diagnostics clear when config is fixed
Ran ./test.py --no-test --no-conformance - formatting and linting pass

Emit diagnostics for config file parse errors (facebook#2078) Config errors (pyrefly.toml/pyproject.toml) now appear as LSP diagnostics in the Problems panel and clear when fixed.

connernilsen · 2026-02-06T22:39:32Z

Hey @sgavriil01, thanks for making this! It seems like something we definitely need. I'll review early next week

connernilsen

Hey, thanks for doing this! It's looking great so far and I love the motivation. I left a few comments, mostly small things, but I do think there needs to be a different approach for how we figure out which files to update for, which I left in the else branch when we're getting the diagnostics to publish.

Let me know if you have any questions or disagree, I'm totally down to discuss things or give more thoughts if you want :)

connernilsen · 2026-02-09T17:58:02Z

+            });
+            *self.cached_config_errors.lock() = config_errors.clone();
+        } else {
+            // If get_config_errors() returned empty (consumed by mem::take),


Those errors might be empty because all errors have been fixed. If that's the case, will we not end up clearing diagnostics here?

What we might need to do instead to make sure this is working is to keep a map in the config loader of all config paths to errors that have been loaded since the last time they were taken. Then we can use that to determine if a config has been changed since the last time we published errors. If it's not in the map, then we don't update diagnostics from the previous result. If it is, then completely overwrite with new errors.

…efactor ConfigError API - Replace std::collections::HashSet with SmallSet for deduplication - Add source: DiagnosticSource parameter to publish_config_diagnostics - Consolidate ConfigError::error/warn methods to accept Option<PathBuf> - Update all callsites to use the new unified API

- Add span field to ConfigError for line/column information - Extract span from toml::de::Error in from_file() - Update config_error_to_diagnostic to use ConfigError.span() - Remove extract_line_col_from_toml_error() regex function - Add error_with_span() constructor for TOML errors

sgavriil01 · 2026-02-11T17:11:31Z

Thanks for the review! I've addressed the first 4 points:

Using SmallSet for deduplication
Passing DiagnosticSource as a parameter instead of hardcoding it
Simplified the ConfigError API
Using TOML's built-in error information instead of regex parsing

For point 5 about the caching strategy - could you clarify the intended behavior? Should the ConfigFinder track which config files were recently loaded and only publish diagnostics for those, rather than maintaining a separate cache in the Server?

github-actions · 2026-02-11T17:11:39Z

According to mypy_primer, this change doesn't affect type check results on a corpus of open source code. ✅

github-actions · 2026-02-26T00:38:47Z

This pull request has been automatically marked as stale because it has not had recent activity for more than 2 weeks.

If you are still working on this this pull request, please add a comment or push new commits to keep it active. Otherwise, please unassign yourself and allow someone else to take over.

Thank you for your contributions!

connernilsen · 2026-03-09T18:18:18Z

Hey @sgavriil01, sorry for the long delay. I've had to shift my focus internally for a bit to focus on a bug that's been causing a lot of problems. I think it should be resolved soon, so I'll come back and check out your changes here hopefully this week :)

connernilsen

Alright, thanks for making the previous changes. And sorry about the wait. I was pulled into a few internal issues over the past few weeks that took up all of my time.

For point 5 about the caching strategy - could you clarify the intended behavior? Should the ConfigFinder track which config files were recently loaded and only publish diagnostics for those, rather than maintaining a separate cache in the Server?

What I'm thinking is, we have our ConfigFinder implementation that handles loading and caching configs. There are times where a config is re-loaded or we clear the config finder's cache, but it kind of knows the state of all configs at all times, so let's use that as our source of truth.

Your changes above removing anyhow::Error from ConfigError make it so we can add #[derive(Clone)] to ConfigError, so now we don't need to take errors from the config finder, we can just clone them when needed. Those errors will also automatically be cleared when they're not longer useful, so we can just:

clear all config errors that currently exist
clone all errors from the config finder
process the errors to turn them into diagnostics
push those diagnostics to the IDE

Then all we need to do is make sure we know which configs we've pushed diagnostics to, and for that, we can continue to use the config_files_with_diagnostics you created, but not have to manage complicated state in the server with cached_config_errors.

Sorry, I changed a few details from my first comment mentioning the caching changes, so let me know if this makes more sense.

connernilsen · 2026-03-17T20:49:40Z

+                        let error_str = toml_err.to_string();
+                        let line = error_str
+                            .split("line ")
+                            .nth(1)
+                            .and_then(|s| s.split(&[',', ' ', '\n'][..]).next())
+                            .and_then(|s| s.parse::<usize>().ok())
+                            .unwrap_or(1);
+                        let column = error_str
+                            .split("column ")
+                            .nth(1)
+                            .and_then(|s| s.split(&[',', ' ', '\n'][..]).next())
+                            .and_then(|s| s.parse::<usize>().ok())
+                            .unwrap_or(1);
+                        (line, column)


Instead of trying to parse this, can we convert the span to a line, col pair? You can probably do something like that with

connernilsen · 2026-03-17T20:53:20Z

+                    toml_err.span().map(|_span| {
+                        // The span is in bytes, but toml errors also expose line/column in their Display
+                        // For now, we parse the display string which includes "at line X, column Y"
+                        let error_str = toml_err.to_string();
+                        let line = error_str
+                            .split("line ")
+                            .nth(1)
+                            .and_then(|s| s.split(&[',', ' ', '\n'][..]).next())
+                            .and_then(|s| s.parse::<usize>().ok())
+                            .unwrap_or(1);
+                        let column = error_str
+                            .split("column ")
+                            .nth(1)
+                            .and_then(|s| s.split(&[',', ' ', '\n'][..]).next())
+                            .and_then(|s| s.parse::<usize>().ok())
+                            .unwrap_or(1);
+                        (line, column)


Instead of trying to parse this, can we convert the span to a line, col pair? You can probably do something like this

use ruff_source_file::LineIndex; let index = LineIndex::from_source_text(source); let line_col = index.line_column(text_size_offset, source); // line_col.line and line_col.column are both 0-indexed

github-actions · 2026-04-02T00:43:09Z

This pull request has been automatically marked as stale because it has not had recent activity for more than 2 weeks.

If you are still working on this this pull request, please add a comment or push new commits to keep it active. Otherwise, please unassign yourself and allow someone else to take over.

Thank you for your contributions!

meta-cla Bot added the cla signed label Feb 2, 2026

git status

87f17f3

Emit diagnostics for config file parse errors (facebook#2078) Config errors (pyrefly.toml/pyproject.toml) now appear as LSP diagnostics in the Problems panel and clear when fixed.

sgavriil01 force-pushed the emit-config-diagnostics branch from c487ef4 to 5393991 Compare February 2, 2026 14:55

Removed unecessary comments

7e85055

sgavriil01 force-pushed the emit-config-diagnostics branch from 5393991 to 7e85055 Compare February 2, 2026 15:01

sgavriil01 mentioned this pull request Feb 2, 2026

Make some sort of UI signal in VS Code if we fail to parse pyproject.toml #2078

Open

This comment has been minimized.

Sign in to view

github-actions Bot added the needs-triage label Feb 4, 2026

connernilsen self-assigned this Feb 4, 2026

connernilsen self-requested a review February 4, 2026 19:50

connernilsen added configuration language-server Issues specific to our IDE integration rather than type checking and removed needs-triage labels Feb 6, 2026

connernilsen requested changes Feb 9, 2026

View reviewed changes

sgavriil01 added 2 commits February 11, 2026 17:50

github-actions Bot added the stale label Feb 26, 2026

github-actions Bot removed the stale label Mar 11, 2026

connernilsen requested changes Mar 17, 2026

View reviewed changes

github-actions Bot added the stale label Apr 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Emit config diagnostics#2289

Emit config diagnostics#2289
sgavriil01 wants to merge 4 commits into
facebook:mainfrom
sgavriil01:emit-config-diagnostics

sgavriil01 commented Feb 2, 2026

This comment has been minimized.

connernilsen commented Feb 6, 2026

connernilsen left a comment

Uh oh!

Uh oh!

Uh oh!

connernilsen Feb 9, 2026

Uh oh!

Uh oh!

sgavriil01 commented Feb 11, 2026

github-actions Bot commented Feb 11, 2026

github-actions Bot commented Feb 26, 2026

connernilsen commented Mar 9, 2026

connernilsen left a comment

connernilsen Mar 17, 2026

connernilsen Mar 17, 2026

github-actions Bot commented Apr 2, 2026

Labels

2 participants

Uh oh!

Conversation

sgavriil01 commented Feb 2, 2026

Summary

Test Plan

This comment has been minimized.

connernilsen commented Feb 6, 2026

connernilsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

connernilsen Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sgavriil01 commented Feb 11, 2026

github-actions Bot commented Feb 11, 2026

github-actions Bot commented Feb 26, 2026

connernilsen commented Mar 9, 2026

connernilsen left a comment

Choose a reason for hiding this comment

connernilsen Mar 17, 2026

Choose a reason for hiding this comment

connernilsen Mar 17, 2026

Choose a reason for hiding this comment

github-actions Bot commented Apr 2, 2026

Labels

2 participants