Skip to content

Update/add SCM sensors and thresholds#720

Open
pawagh1 wants to merge 5 commits into
facebook:mainfrom
pawagh1:scmsensor
Open

Update/add SCM sensors and thresholds#720
pawagh1 wants to merge 5 commits into
facebook:mainfrom
pawagh1:scmsensor

Conversation

@pawagh1

@pawagh1 pawagh1 commented Dec 4, 2025

Copy link
Copy Markdown
Contributor

This update refreshes the SCM section of sensor_service.json so it matches the latest threshold guidance and exposes the additional telemetry that the platform now reports.

Key highlights of this update:

Threshold Definition Updates:

  • Refreshed upperCriticalVal and lowerCriticalVal settings for all SCM voltages and temperatures.
  • Introduced maxAlarmVal and minAlarmVal where the new guidance specifies warning bands, improving early alarm coverage.

Additional Sensors:

  • Added SCM_Vp1P0_PCIE4, SCM_I_VP1P05_CPU, and SCM_I_VP1P8_VCCIN so the service can monitor the new power rails and currents delivered by the SCM regulators.

No other fields (slotPath, sysfsPath, pmUnitName, type) were modified; only the thresholds and the new sensor entries were updated.

Test Plan:

  • Confirm the sensor_service loads the updated JSON without validation errors.
  • Verify each SCM sensor reports values inside the refreshed threshold ranges on hardware.
  • Spot-check that the newly added sensors emit readings and appear in downstream monitoring dashboards.
- Update temperature sensor thresholds (SCM_OUTLET, SCM_INLET, SCM_CPU_CORE_TEMP_0, SCM_SSD)
- Add maxAlarmVal and minAlarmVal thresholds to all voltage sensors
- Update threshold values for better monitoring
- Add three new sensors: SCM_Vp1P0_PCIE4, SCM_I_VP1P05_CPU, SCM_I_VP1P8_VCCIN
@meta-cla meta-cla Bot added the CLA Signed label Dec 4, 2025
@meta-codesync

meta-codesync Bot commented Dec 23, 2025

Copy link
Copy Markdown
Contributor

@mikechoifb has imported this pull request. If you are a Meta employee, you can view this in D89735468.

@mikechoifb mikechoifb left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sensor name such as SCM_Vp1P0_PCIE4 should not contain lower-case letter. Please use uppercase.
After replacing SCM_Vp1P0_PCIE4 to SCM_VP1P0_PCIE4, now we see that this name is used twice (duplicate) - please rename the relevant entry correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

2 participants