Data and Evidence

How Fund Analyst Intelligence captures sources, links claims to evidence, enforces provenance, and keeps outputs reproducible.

Data and Evidence

Fund Analyst Intelligence is evidence-first by design.
It treats sources as production artefacts.
It treats provenance as a core data structure.

This is what enables trust.
It is also what enables scale.
Evidence captured once can be reused across cycles and reports.

Objectives

A production evidence system must ensure:

every key claim is traceable to a source
sources are versioned and time-stamped
evidence is preserved across monthly cycles
outputs are reproducible from stored artefacts
conflicts and gaps are explicit, not hidden

Source types

Fund Analyst Intelligence supports controlled source classes.

1. User-provided artefacts

These are the primary source set for most partners.

Examples

DDQ / ODD documents
factsheets and pitch decks
manager letters and periodic updates
fee schedules, term sheets, share class details
internal notes and committee decisions (optional)

User artefacts are stored with version metadata.
They are attached to a specific monthly cycle.

2. Approved online sources

Online sources are policy-defined.
Allow-lists are the default approach.

Examples

manager website publications
official announcements and notices
regulated filings where applicable
policy-approved third-party sources

Online sources are recorded with retrieval timestamps.
Where required, snapshots are retained for reproducibility.

Source policy

A source policy defines:

what domains and source types are permitted
recency requirements for critical fields
rules for conflicting sources
retention and archival expectations
roles and permissions for adding sources

Source policy is a governance decision.
It should be approved by the owning team.
It should be stable across cycles.

Artefact versioning

Every artefact should be treated like production input.

Fund Analyst Intelligence records:

artefact identifier
file name and type
ingestion timestamp
cycle identifier
source origin and metadata
version relationship to prior artefacts

This prevents “latest file ambiguity”.
It also supports reproducibility and audit.

Claims and evidence mapping

The platform distinguishes two things:

Fields: structured fund attributes (e.g., fees, liquidity terms).
Claims: statements extracted from sources (e.g., “monthly dealing with 30 days notice”).

Every field and claim can be linked to evidence:

source artefact id
location reference (page, section, excerpt where possible)
extraction timestamp and method
confidence indicators
reviewer notes if edited or overridden

This mapping is the credibility layer of the system.
It is also what enables fast follow-up.

Evidence packs

An evidence pack is a structured output.
It is produced alongside reports.
It is designed for scrutiny.

A standard evidence pack contains:

source inventory for the cycle
claim-to-evidence mapping for key statements
citations and location references
conflicts and gaps, with status
reviewer decisions and sign-off references

Evidence packs can be delivered as a document bundle or a structured export.
They are the fastest way to answer “where did this come from”.

Handling uncertainty and gaps

Production systems must handle imperfect inputs.

Fund Analyst Intelligence uses explicit states:

supported: evidence exists and is consistent
conflicted: evidence exists but contradicts
missing: required evidence is absent
stale: evidence exists but violates recency policy
needs verification: evidence is weak or ambiguous

These states appear in exceptions and reports.
They prevent silent assumptions.
They also guide follow-up work.

Conflict resolution rules

When sources conflict, the system does not guess.
It surfaces the conflict as an exception.

Resolution can follow a policy such as:

prefer primary artefacts over secondary references
prefer newer versions where provenance is clear
require reviewer decision for high-severity fields
record rationale and evidence links on resolution

The decision becomes part of the audit trail.
The conflict remains searchable historically.

Reproducibility

A report is reproducible if:

the input artefacts are stored or referenced reliably
extracted fields and claims are versioned by cycle
validation and delta logic are deterministic
reviewer decisions are recorded
report generation uses the stored cycle record

Fund Analyst Intelligence is built around cycle records and snapshots.
This is what makes “re-run and reproduce” possible.

Data minimisation and controlled scope

Evidence-first does not mean collecting everything.
It means collecting what is necessary and defensible.

Partners should define:

required artefacts for baseline and monthly cycles
the fund fields that must be evidence-backed
retention periods and allowed storage locations
who can add sources and when

This keeps operations clean.
It also reduces compliance exposure.

Definition of done

Data and evidence governance is production-grade when:

source policy is explicit and enforced
artefacts are versioned and cycle-linked
claims map to evidence consistently
conflicts and gaps are explicit exceptions
evidence packs support scrutiny without manual reconstruction
reports are reproducible from stored cycle records

This is the foundation of trust in Fund Analyst Intelligence.