How SIFTR works

This page explains how SIFTR processes documents and maps explicit, quoted evidence to UK Cyber Assessment Framework (CAF) outcomes.

The approach is designed to be accurate, repeatable and defensible, while ensuring that final judgement remains with the organisation.

Overview

At a high level, SIFTR:

Ingests selected organisational documents
Extracts and prepares text for analysis
Applies privacy and sanitisation controls
Uses a two-stage analysis process to avoid inference and hallucination
Maps explicit statements to CAF outcomes
Aggregates evidence across documents
Presents quote-backed results for human review

Each stage is designed to minimise assumption and maximise traceability.

Step 1: Document ingestion

Users upload existing governance and assurance artefacts relevant to CAF assessment.

At this stage, SIFTR supports PDF documents only.

This constraint is intentional. PDF files preserve document integrity, pagination and structure, which supports accurate evidence quoting and traceability during CAF review.

Only documents explicitly selected by the user are included in scope.

Step 2: Text extraction and preparation

Uploaded PDFs are processed to extract text content for analysis.

During this stage:

Document structure is normalised
Headers, footers and layout artefacts are reduced
Non-text elements (such as images or diagrams) are ignored
Original wording and meaning are preserved

No content is added, rewritten or enhanced.

Step 3: Privacy and PII handling

SIFTR is designed to support data minimisation and purpose limitation.

Where documents contain personal or identifying information, sanitisation controls are applied so that analysis focuses on governance-relevant content only.

This may include:

Masking or removing direct personal identifiers
Removing incidental personal detail not relevant to CAF evidence
Preserving role-based accountability (for example job titles or committees)

Step 4: Two-stage analysis (sanitise → map)

To reduce the risk of hallucination, bias or misattribution, SIFTR uses a two-stage analysis approach.

Stage 1 — Sanitisation

The first stage produces a clean, governance-focused representation of the document by:

Reducing irrelevant contextual noise
Removing non-essential personal information
Preserving only content relevant to CAF evidence mapping

Stage 2 — CAF evidence mapping

The second stage evaluates the sanitised text against the wording and intent of the relevant CAF outcomes.

For each outcome, SIFTR looks only for explicit statements relating to:

Roles, responsibilities and accountability
Governance structures and oversight
Risk management approaches
Decision-making and escalation arrangements

SIFTR does not infer intent or fill gaps where evidence is not stated.

Step 5: Evidence identification and quoting

Where relevant evidence is found, SIFTR:

Extracts the minimum necessary quotation
Preserves the original wording
Records the source document and, where available, page reference

If no explicit evidence is found in a document, no quote is generated.

Step 6: Evidence aggregation and coverage classification

Evidence is identified at document level and assessed across all uploaded documents.

For each CAF outcome:

If any document provides Strong evidence, the outcome is reported as Strong
If no Strong evidence exists but one or more documents provide Partial evidence, the outcome is reported as Partial
If no document provides relevant evidence, the outcome is reported as None

Coverage labels are defined as:

Strong — clear, explicit evidence directly supports the outcome
Partial — some relevant evidence exists, but gaps or ambiguity remain
None — no explicit evidence was identified

These labels reflect evidence presence only. They do not represent maturity, effectiveness or compliance.

Step 7: Output and review

Results are presented in a structured table showing:

CAF outcomes in scope
Evidence coverage classification
Primary supporting quote
Source document reference
A short rationale explaining the classification
For Partial outcomes, what key aspects appear to be missing

Outputs are intended to be reviewed, challenged and supplemented as part of normal governance and assurance activity.

Human judgement remains essential

SIFTR is a decision-support tool.

Users remain responsible for:

Interpreting evidence in organisational context
Weighing proportionality and risk
Making CAF judgements and assertions
Deciding what additional evidence or action is required

SIFTR does not make compliance or assurance decisions.

Transparency and repeatability

For each analysis, SIFTR records:

The CAF version used
The outcomes included in scope
The documents analysed
The date and time of analysis

This supports consistent reassessment as documentation or governance arrangements change.

Last updated: January 2026

Scope & Limitations Security & Data Protection