toolrunner: validate tool inputs against JSON Schema before handler execution by subhashdasyam · Pull Request #305 · anthropics/anthropic-sdk-go

subhashdasyam · 2026-03-26T16:45:10Z

What this fixes

The parse() function in toolrunner/tool.go has a comment that says it "validates and parses the input according to the tool's schema." It doesn't. It just calls json.Unmarshal into a typed struct.

That means the JSON Schema you define for a tool (required fields, enums, patterns, numeric bounds) is sent to the model but never actually checked at runtime. If the model returns bad arguments, whether from a bug, hallucination, or prompt injection, those arguments hit your handler with zero validation. Missing required fields become Go zero values. Enum violations pass through silently. additionalProperties: false is ignored entirely.

This is a problem because developers read that comment and reasonably assume the SDK is doing what it says. They skip handler-level validation because the SDK told them it already happened.

What this changes

The parse() method now validates incoming JSON against the tool's schema before calling json.Unmarshal. If validation fails, the handler never runs and the caller gets a clear error.

Constraints now enforced at runtime:

required fields must be present (not silently zero-valued)
type correctness (string, number, integer, boolean, array, object, null, and union types like ["string", "null"])
enum values must match the allowed set (type-aware comparison via reflect.DeepEqual)
additionalProperties: false rejects unknown keys (including when properties is empty or absent)
additionalProperties: {schema} validates unknown keys against the provided schema
pattern regex is matched against string values (compiled once at creation, cached for reuse)
minLength / maxLength on strings (measured in Unicode codepoints via utf8.RuneCountInString, per JSON Schema spec)
minimum / maximum / exclusiveMinimum / exclusiveMaximum on numbers
minItems / maxItems on arrays
items schema validation on array elements
Nested object/array validation — recursive, not just top-level properties
$ref / $defs resolution (local refs only)
anyOf / oneOf variant matching

How it works

The schema is parsed into a raw map[string]any at tool creation time (not per-request). We keep the original JSON bytes from NewBetaToolFromBytes and NewBetaToolFromJSONSchema so that fields like additionalProperties survive the roundtrip through BetaToolInputSchemaParam, which drops them during marshal.

At creation time, prepareSchema walks the entire schema tree to:

Pre-compile regex patterns (so validation doesn't call regexp.Compile on every request)
Detect schema definition errors (invalid patterns, unsupported types, unresolvable $ref)

Validation runs before json.Unmarshal, so invalid inputs never reach the handler and never produce misleading zero-value structs.

Error handling contract by constructor

The three constructors have different error contracts:

Constructor	Returns error?	Schema errors
`NewBetaToolFromBytes`	Yes	Fails fast — returns error if schema has invalid patterns, unsupported types, or unresolvable `$ref`
`NewBetaToolFromJSONSchema`	Yes	Fails fast — same as above (schema is generated from struct tags, so errors indicate developer bugs)
`NewBetaTool`	No	Best-effort — extracts raw schema via marshal roundtrip; if roundtrip loses fields, validation is silently reduced; if extraction fails entirely, validation is skipped

NewBetaToolFromBytes and NewBetaToolFromJSONSchema already returned errors before this change, so surfacing schema definition errors through them is not an API change. However, schemas with previously-ignored issues (e.g., an invalid regex in a pattern field that was never enforced) will now cause construction to fail. This is intentional — these are schema bugs that should be caught early.

NewBetaTool does not return an error by design (unchanged signature), so it falls back gracefully.

What it doesn't change

No new dependencies. The validator uses encoding/json, regexp, unicode/utf8, and reflect from the standard library.
go.mod is untouched.
The public API signatures (NewBetaTool, NewBetaToolFromBytes, NewBetaToolFromJSONSchema) are unchanged.
Raw input types (json.RawMessage, []byte) skip validation, same as before.

Limitations

JSON Schema patterns use Go's RE2 regexp engine, not the ECMA-262 dialect specified by JSON Schema. Most patterns work identically, but lookaheads/lookbehinds are not supported.
format keyword is not enforced (e.g., "format": "email" is accepted but not validated).

Tests

All existing tests pass. New test suites added:

TestToolRunner_SchemaValidation — required fields, enum, type, empty object, optional omission
TestToolRunner_AdditionalPropertiesRejected — extra keys blocked when schema says no
TestToolRunner_PatternValidation — regex enforcement on strings
TestToolRunner_StringLengthValidation — minLength/maxLength bounds
TestToolRunner_NumericBoundsValidation — minimum/maximum bounds

Each rejection test confirms the handler is never called when validation fails (handlerCalled flag).

The comment fix

Old:

// parse validates and parses the input according to the tool's schema.

New:

// parse validates the input against the tool's JSON Schema and then unmarshals
// it into the target type T. Validation enforces required fields, additionalProperties,
// type correctness, enum constraints, pattern, string length bounds, numeric bounds,
// and array item counts before the handler runs.

…xecution The parse() function claimed to validate inputs against the tool's schema but only called json.Unmarshal. This meant required fields, enum constraints, additionalProperties, pattern, and numeric bounds defined in the schema were never enforced at runtime. This commit adds a zero-dependency schema validator (stdlib only: encoding/json and regexp) that checks inputs before they reach the handler. The schema is parsed once at tool creation time. If validation fails, the handler never runs. Constraints now enforced: - required: missing fields rejected instead of becoming zero values - type: string/number/integer/boolean/array/object checked - enum: values must be in the allowed set - additionalProperties: false rejects unknown keys - pattern: regex matched against string values - minLength/maxLength: string length bounds - minimum/maximum/exclusiveMinimum/exclusiveMaximum: numeric bounds - minItems/maxItems: array length bounds The original JSON bytes are preserved at creation time so fields like additionalProperties survive the BetaToolInputSchemaParam marshal roundtrip. If schema parsing fails for any reason, validation is skipped and behavior falls back to the previous json.Unmarshal-only path. No breaking changes. Includes 5 new test suites (16 total tests pass): - TestToolRunner_SchemaValidation (required, enum, type, empty, optional) - TestToolRunner_AdditionalPropertiesRejected - TestToolRunner_PatternValidation - TestToolRunner_StringLengthValidation - TestToolRunner_NumericBoundsValidation

Copilot

Pull request overview

Adds runtime JSON Schema validation to toolrunner tool input parsing so invalid tool arguments returned by the model are rejected before reaching handlers, aligning behavior with the existing documentation and preventing silent zero-value fallthrough.

Changes:

Introduces a lightweight JSON Schema validator (required, additionalProperties, type, enum, pattern, length/bounds, item counts) and runs it before json.Unmarshal into T.
Preserves a raw schema map alongside the typed BetaToolInputSchemaParam to avoid losing schema fields during marshal/unmarshal roundtrips.
Adds new test suites covering the newly enforced validation constraints and ensuring handlers are not called on invalid input.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
`toolrunner/tool.go`	Adds schema validation logic, stores raw schema for validation, and updates constructors to initialize the validator.
`toolrunner/runner_test.go`	Adds test coverage for required fields, enum/type checks, additionalProperties, patterns, string length, and numeric bounds.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

toolrunner/tool.go

Validation tests belong in tool_test.go (tests tool.go logic), not runner_test.go (tests runner orchestration loop). Same package, same directory, just the right file.

…ties, enum, regex Fixes 5 issues from code review: 1. newSchemaValidator now recognizes object schemas when type is an array containing "object", or when type is absent but properties, required, or additionalProperties is present. 2. additionalProperties:false now rejects unknown keys even when the properties field is absent entirely (not just empty). 3. Enum comparison uses reflect.DeepEqual instead of fmt.Sprintf, so string "1" no longer matches numeric enum value 1. 4. Invalid regex patterns in schemas now produce a validation error instead of silently passing all inputs. 5. Regex patterns are pre-compiled once at tool creation time and cached in the validator. Compile errors are stored separately and surfaced during validation. New tests cover all 5 fixes: - TestMissingTypeInference (3 subtests) - TestAdditionalPropertiesNoPropsField (2 subtests) - TestEnumCrossTypeMismatch (2 subtests) - TestInvalidRegexPattern

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

toolrunner/tool.go

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

toolrunner/tool.go

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

toolrunner/tool.go

subhashdasyam · 2026-03-27T05:04:09Z

@stainless-ci-bot and @claude do review

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-27T05:19:16Z

toolrunner/tool.go

+		case "integer":
+			if f, ok := value.(float64); ok && f == float64(int64(f)) {
+				return nil
+			}


Integer validation can be bypassed due to float64 rounding during encoding/json unmarshaling (e.g., large numbers with fractional parts can round to an integer), causing non-integer JSON inputs to incorrectly pass the integer check. A more robust approach is to decode validation input using json.Decoder with UseNumber() and validate integers from the original json.Number string representation (or at least guard against precision loss by rejecting values outside the safe integer range).

Copilot · 2026-03-27T05:19:16Z

toolrunner/tool.go

+func (v *schemaValidator) validateObject(path string, obj map[string]any, schema map[string]any, refStack map[string]bool) error {
+	if req, ok := schema["required"].([]any); ok {
+		for _, r := range req {
+			name, _ := r.(string)
+			if name == "" {
+				continue
+			}
+			if _, exists := obj[name]; !exists {
+				return fmt.Errorf("missing required property '%s'", joinPath(path, name))
+			}
+		}
+	}
+
+	props, _ := schema["properties"].(map[string]any)
+	additional := schema["additionalProperties"]
+	for key, val := range obj {
+		propPath := joinPath(path, key)


Two related issues make validation errors harder to consume/debug: (1) missing-required errors use joinPath(path, name) which yields city at the root rather than consistently prefixing with input like other errors (via displayPath), and (2) iterating for key, val := range obj produces nondeterministic error selection/order when multiple properties are invalid (Go map iteration is randomized). Consider normalizing required error paths (e.g., include displayPath(path) consistently) and iterating object keys in sorted order when validating to produce stable, predictable error messages.

Copilot · 2026-03-27T05:19:17Z

toolrunner/tool.go

+	if variants, ok := schema["oneOf"]; ok {
+		if err := v.validateVariants(path, value, variants, true, refStack); err != nil {
+			return err
+		}
+	}


The implementation adds oneOf support, but the new tests only cover anyOf (and not oneOf edge cases). Add tests that assert: (a) exactly one variant matches succeeds, (b) zero matches fails, and (c) multiple matches fails—ideally also verifying the handler is not called when oneOf validation fails.

Copilot · 2026-03-27T05:19:17Z

toolrunner/tool_test.go

+// TestSchemaValidation verifies that the tool runner validates inputs
+// against the JSON Schema before executing the handler. This prevents missing
+// required fields, enum violations, and type mismatches from reaching handlers.
+func TestSchemaValidation(t *testing.T) {


The PR description lists new test suites named TestToolRunner_SchemaValidation, TestToolRunner_AdditionalPropertiesRejected, etc., but the added tests are named TestSchemaValidation, TestAdditionalPropertiesRejected, etc. Either update the PR description to match the actual test names or rename the tests to match what's documented.

subhashdasyam requested a review from a team as a code owner March 26, 2026 16:45

Copilot AI review requested due to automatic review settings March 26, 2026 16:45

Copilot started reviewing on behalf of subhashdasyam March 26, 2026 16:45 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Outdated Show resolved Hide resolved

Subhash Dasyam added 2 commits March 26, 2026 20:51

toolrunner: move schema validation tests to tool_test.go

b594ab0

Validation tests belong in tool_test.go (tests tool.go logic), not runner_test.go (tests runner orchestration loop). Same package, same directory, just the right file.

subhashdasyam requested a review from Copilot March 26, 2026 17:08

Copilot started reviewing on behalf of subhashdasyam March 26, 2026 17:09 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner: tighten string length and pattern validation

3e75c6f

subhashdasyam requested a review from Copilot March 26, 2026 17:19

Copilot started reviewing on behalf of subhashdasyam March 26, 2026 17:19 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

toolrunner/tool.go Outdated Show resolved Hide resolved

toolrunner/tool.go Show resolved Hide resolved

toolrunner: recurse through nested schemas

fdde13f

subhashdasyam requested a review from Copilot March 26, 2026 21:17

Copilot started reviewing on behalf of subhashdasyam March 26, 2026 21:18 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

toolrunner/tool.go Show resolved Hide resolved

toolrunner/tool.go Show resolved Hide resolved

toolrunner/tool.go Show resolved Hide resolved

subhashdasyam requested a review from Copilot March 27, 2026 04:54

Copilot started reviewing on behalf of subhashdasyam March 27, 2026 05:10 View session

Copilot AI reviewed Mar 27, 2026

View reviewed changes

Conversation

subhashdasyam commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this fixes

What this changes

How it works

Error handling contract by constructor

What it doesn't change

Limitations

Tests

The comment fix

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

subhashdasyam commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

subhashdasyam commented Mar 26, 2026 •

edited

Loading

subhashdasyam commented Mar 27, 2026 •

edited

Loading