Discussion about this post

sadok's avatar

Your analysis of provenance as a structural condition of knowledge—rather than a mere archival technique—touches a deeper ontological fault line in current AI systems. The problem, as you frame it, is not simply that sources are lost, but that the generative architecture itself dissolves the relational fabric through which knowledge acquires legitimacy, continuity, and intelligibility.

In my own philosophical work on event and temporality, I approach a related question from a different angle: knowledge is not a static object that carries provenance as an external attribute, but an event emerging within a structured field of relations. When those relations are reduced to probabilistic patterning without preserving their formative dynamics, what decays is not only attribution, but the very ontological coherence of what counts as knowledge.

I believe that part of the solution may lie in rethinking knowledge infrastructures in explicitly event-relational terms—where temporal emergence, structural dependency, and generative linkage are modeled as intrinsic rather than supplementary features. Provenance, in such a framework, would not be appended metadata but a trace of the constitutive processes that bring knowledge into presence.

The challenge, then, may not be merely to reattach sources to outputs, but to redesign our systems so that the dynamics of formation remain structurally legible.

Eugenio's avatar

Jessica, this piece articulates with remarkable precision a problem we've been working to solve architecturally in southern Brazil.

We're a small team building what we call the PATOS–Lector–PRISMA (PLP) infrastructure — a normative information architecture for pharmaceutical knowledge management. Your diagnosis that "where provenance ends, knowledge decays" maps almost exactly to the gap we identified: most existing systems (RAG pipelines, clinical decision support, conversational agents) collapse three epistemologically distinct operations into a single layer — document preservation, semantic interpretation, and contextual presentation. The result is precisely what you describe: provenance becomes irrecoverable.

Our response is a three-layer architecture:

- PATOS — a sovereign document preservation layer (17,000+ regulatory documents with explicit versioning and bidirectional traceability). No compression, no blending, no statistical distribution across parameters. The source remains the source.

- Lector — machine-assisted reading with explicit human curation, producing what we call Evidence Packs: typed assertions anchored to primary sources, with epistemic boundaries and curatorial decisions documented. This is where we operationalize your point about knowledge requiring "processing through verification systems" — the interpretive act itself becomes traceable.

- PRISMA — contextual presentation through what we call the RPDA framework (Regulatory, Prescription, Dispensing, Administration), which refracts the same informational core into distinct professional views governed by context, risk, and responsibility.

The key insight we share with your analysis: provenance is not metadata you append — it's an architectural property you either preserve structurally or lose irreversibly. What we add is that preserving provenance is necessary but insufficient; you also need the assertions derived from sources to be fully traceable to the interpretive act that produced them, the curator who validated them, and the epistemic boundaries that constrain them.

Your observation about "vibe citing" resonates deeply in the pharmaceutical domain — where a hallucinated drug interaction or a fabricated contraindication isn't an academic inconvenience but a patient safety risk.

We're currently preparing a paper formalizing this architecture (grounded in OAIS, Buckland's tripartite information, hermeneutic theory, speech act theory, and evidence-based medicine). Your earlier piece on context graphs vs. semantic layers was already part of our theoretical framework — this new one strengthens the provenance argument considerably.

From Porto Alegre, Brazil — glad to see the same structural diagnosis emerging independently across continents.

16 more comments...

No posts

Ready for more?