A Persistent 8–15% Failure Rate Across Domains: Evidence for an Epistemic Boundary

elly99 · March 14, 2026, 6:19pm

LLMs can’t fix their own epistemic blind spots.

Not with better prompting.
Not with stricter verification.
Not with more sophisticated agents.

The MarCognity benchmark reveals a persistent 8–15% failure rate across 8 domains,
even when the system is equipped with a full metacognitive layer.

This residual error is not noise.

It may be an emergent property of autoregressive generation itself —
a structural boundary where the epistemic space accessible to a probabilistic model
is narrower than the space of claims requiring justification.

I call this the Epistemic Boundary.

Resources:

• Zenodo : https://doi.org/10.5281/zenodo.19020581

• GitHub : GitHub - elly99-AI/MarCognity-AI: A research framework for structured LLM evaluation, claim verification and reflective reasoning mechanisms. · GitHub

Topic		Replies	Views
Beyond Correction: Epistemic Safety as a Mediator for Policy Transfer in Large Language Models Research	0	37	November 29, 2025
Paraconsistent Logic and AI models Beginners	4	43	March 22, 2026
MarCognity-AI for 13 Critical Questions About LLMs Research	2	92	October 17, 2025
AletheionLLM-v2 — 354M decoder-only LLM with integrated epistemic tomography (AGPL-3.0) Show and Tell	0	23	March 10, 2026
From Interpolation to Extrapolation: How Language Models Theorize in Data-Sparse Regimes Research	0	66	November 23, 2025

A Persistent 8–15% Failure Rate Across Domains: Evidence for an Epistemic Boundary

Related topics