feat: pre-call sanitization and post-call tool guardrails by teknium1 · Pull Request #1732 · NousResearch/hermes-agent

teknium1 · 2026-03-17T11:24:11Z

Summary

Salvage of PR #1321 by @alireza78a — reimplemented against current main.

Phase 1 — Pre-call message sanitization

_sanitize_api_messages() now runs unconditionally before every LLM call. Previously gated on context_compressor being present (line 4998), so sessions loaded from disk or running without compression could silently accumulate dangling tool_call/tool_result pairs — causing "No tool call found for call_id" API errors.

Phase 2a — Delegate task cap

_cap_delegate_task_calls() truncates excess delegate_task calls per turn to MAX_CONCURRENT_CHILDREN. The existing cap in delegate_tool.py only limits the task array within a single call; this catches multiple separate delegate_task tool_calls in one turn.

Phase 2b — Tool call deduplication

_deduplicate_tool_calls() drops duplicate (tool_name, arguments) pairs within a single turn when models stutter.

All three are static methods on AIAgent, independently testable.

Tests

29 tests in tests/test_agent_guardrails.py covering all three phases — orphaned result removal, stub injection, mixed orphans, delegate cap with interleaved ordering, dedup first-occurrence preservation, input mutation safety, empty list edge cases, SDK object vs dict format handling.

Closes #626

@alireza78a

Salvage of PR #1321 by @alireza78a (cherry-picked concept, reimplemented against current main). Phase 1 — Pre-call message sanitization: _sanitize_api_messages() now runs unconditionally before every LLM call. Previously gated on context_compressor being present, so sessions loaded from disk or running without compression could accumulate dangling tool_call/tool_result pairs causing API errors. Phase 2a — Delegate task cap: _cap_delegate_task_calls() truncates excess delegate_task calls per turn to MAX_CONCURRENT_CHILDREN. The existing cap in delegate_tool.py only limits the task array within a single call; this catches multiple separate delegate_task tool_calls in one turn. Phase 2b — Tool call deduplication: _deduplicate_tool_calls() drops duplicate (tool_name, arguments) pairs within a single turn when models stutter. All three are static methods on AIAgent, independently testable. 29 tests covering happy paths and edge cases.

teknium1 mentioned this pull request Mar 17, 2026

feat: pre-call tool sanitization and post-call tool interception #1321

Closed

teknium1 merged commit 85993fb into main Mar 17, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pre-call sanitization and post-call tool guardrails#1732

feat: pre-call sanitization and post-call tool guardrails#1732
teknium1 merged 1 commit intomainfrom
hermes/hermes-c91521bf

teknium1 commented Mar 17, 2026

Uh oh!

Labels

1 participant

Conversation