VERDICT WEIGHT - Confidence Scoring for Autonomous AI

Overview

Scorer is the canonical entry point for scoring a single decision. It composes all eight streams, returns a calibrated confidence score, and writes the scoring event to the audit chain.

from verdict_weight import Scorer

scorer = Scorer()
result = scorer.score(prediction=..., evidence={...})

Constructor

Scorer(
    config: ScorerConfig | None = None,
    audit_log_path: str | Path | None = None,
)

Parameter	Type	Description
`config`	`ScorerConfig`	Optional configuration object. If omitted, conservative defaults are used. See Hyperparameters.
`audit_log_path`	`str` or `Path`	Path to the audit chain log file. If omitted, an in-memory chain is used (not recommended for production).

`score`

The primary scoring method.

result = scorer.score(
    prediction,
    evidence: dict,
    *,
    context: dict | None = None,
)

Parameter	Type	Description
`prediction`	any	The decision being scored. The framework is prediction-type-agnostic; treat this as an opaque label for audit purposes.
`evidence`	`dict`	The evidence payload. Keys are source identifiers; values are the evidence values. See Evidence aggregation for accepted types.
`context`	`dict`	Optional caller-supplied context (request id, user id, etc.). Recorded in the audit chain; not used for scoring.

Returns: `ScoreResult`

Field	Type	Description
`confidence`	`float`	Calibrated confidence in `[0, 1]`.
`should_act`	`bool`	Gate decision against the configured threshold.
`outcome`	`str`	One of `"score"`, `"abstain"`, `"abort"`.
`stream_breakdown`	`dict[str, StreamContribution]`	Per-stream contributions.
`audit_id`	`str`	Hash-chain identifier for this event.
`reason`	`str` \| `None`	Populated on `abstain` or `abort` outcomes.

`outcome` values

score

Normal output. confidence is meaningful and should_act is set.

abstain

The framework declined to produce a confidence value. Evidence was contradictory or insufficient.

abort

A hardening stream raised a veto. Confidence is forced to zero and reason identifies the trigger.

Example: handling all three outcomes

result = scorer.score(prediction, evidence)

match result.outcome:
    case "score":
        if result.should_act:
            execute(prediction)
        else:
            escalate_to_review(prediction, result)
    case "abstain":
        log_abstention(result.reason)
        escalate_to_review(prediction, result)
    case "abort":
        log_security_event(result.reason)
        # Do NOT continue to other scorers - investigate first.
        raise FrameworkAbort(result.reason)

An "abort" outcome means a hardening stream detected a condition where the scoring layer itself may be compromised. Do not silently retry. Do not switch to a different scorer instance and continue. Investigate per your incident response plan.

Thread safety

Each Scorer instance is intended to be used from a single thread or an async event loop. For concurrent scoring, instantiate one scorer per worker and configure them to share an audit log path with appropriate file locking, or run them with separate logs that are reconciled offline. See Pipeline for the supported concurrency patterns.

Python SDK

Configuration

Scorer

Overview

Constructor

`score`

Returns: `ScoreResult`

`outcome` values

score

abstain

abort

Example: handling all three outcomes

Thread safety

Python SDK

Configuration

Documentation Index

​Overview

​Constructor

​score

​Returns: ScoreResult

​outcome values

score

abstain

abort

​Example: handling all three outcomes

​Thread safety

Overview

Constructor

`score`

Returns: `ScoreResult`

`outcome` values

Example: handling all three outcomes

Thread safety