·

Clinical Documentation

Primary Care

Clinician

How interruptions cause clinical coding errors in GP consultations

Discover how mid-consultation interruptions fragment GPs' clinical thinking and lead to inaccurate SNOMED CT coding, affecting patient safety and QOF payments

General practitioners in the UK lose more than they realise every time a consultation is interrupted. The immediate cost is visible: a disrupted conversation, a patient who has to repeat themselves, a thought that evaporates mid-sentence. What is less visible is the downstream cost: a clinical code that gets skipped, approximated, or applied to the wrong encounter. Clinical coding is not an administrative afterthought that happens to be imperfect under pressure. It is a direct record of what occurred clinically, and when interruptions fragment the mental model a GP builds during a consultation, that record becomes unreliable in ways that compound across time, across care settings, and across patients.

What counts as an interruption during a GP consultation

Not all interruptions are equal, and not all of them come from outside the room. A useful distinction separates external interruptions (phone calls, urgent patient requests from reception, medical record system alerts, and colleague queries arriving mid-consultation) from internal interruptions, which include the self-directed attention shifts that occur when a GP pauses a conversation to type a note, search a code, or respond to a pop-up within the clinical system.

Research into physician interruptions in outpatient settings identifies history-taking as a particularly high-risk phase, because it is precisely when the GP is building a coherent clinical picture from fragmented information. An interruption at this stage means questions may be forgotten entirely, or asked twice, and neither produces a reliable clinical record.

A qualitative study on phone-call interruptions in primary care found that healthcare workers explicitly raised concerns about medical errors caused by interruptions to their clinical thinking. The phone call is perhaps the most familiar example, but the same mechanism applies to any event that redirects attention, including the medical record system itself, which can function as a source of internal interruption when its interface demands engagement at the wrong moment.

A large NHS mixed-methods study of 61 GPs across 28 practices, involving 238 hours of direct observation, found that interruptions introduced risks of divided cognition and attention, were sometimes safety-critical, and that even goal-driven interruptions left GPs struggling to refocus. Operational failures, including those caused by interruptions, accounted for approximately 5 per cent of all GP tasks observed.

How cognitive switching degrades coding accuracy

The mechanism linking interruptions to coding errors runs through working memory. When a GP's attention is redirected mid-consultation, the mental model they have been building (the patient's presenting complaint, relevant history, examination findings, and the clinical reasoning connecting them) is partially cleared. Resuming the consultation does not restore that model intact. The GP reconstructs it, and reconstruction is imperfect.

This matters for SNOMED CT (Systematised Nomenclature of Medicine Clinical Terms) and Read code selection in a specific way. Accurate coding is not a mechanical lookup task. Selecting the right code for a nuanced presentation, distinguishing for example between 'mixed anxiety and depressive disorder' and 'generalised anxiety disorder', or between 'type 2 diabetes mellitus with diabetic nephropathy' and a more generic diabetes code, requires contextual recall of the full clinical picture. When that picture has been fragmented by an interruption, the GP is more likely to select a broader, less precise code, or to omit a code entirely.

A 2024 study applying cognitive load theory to inpatient consultations found that trainees agreed interruptions were distracting from the very first consultation of a shift, and suggested, in a secondary care training context, that cognitive overload risks may emerge after approximately four consultations. While the researchers conducted this study in a secondary care context, the underlying cognitive mechanism (interruption increasing extraneous load and reducing the capacity for accurate clinical reasoning) applies equally in general practice.

A 2022 analysis of interruptions in clinical consultations distinguished between cooperative interruptions (those that advance the clinical encounter) and intrusive ones (those that disrupt it). The latter category was associated with greater disruption to clinical thinking, and it is intrusive interruptions (the phone call, the colleague at the door, the alert that demands an immediate response) that most directly threaten coding accuracy.

The role of time pressure in compounding coding errors

Interruptions do not occur in a vacuum. In UK general practice, the standard appointment slot is 10 to 15 minutes, a duration that, even without interruption, leaves limited margin for thorough documentation. When time is lost to an interruption, it cannot usually be recovered within the same appointment. The GP must choose between extending the consultation (disrupting the rest of the session) or compressing what remains.

Compression typically means that documentation and coding are the first things to be abbreviated. A GP who has lost two minutes to a phone call mid-consultation is unlikely to spend an additional two minutes carefully selecting the most accurate SNOMED CT code at the end of it. The default is to use an approximate code, one that is close enough to satisfy the immediate need but not precise enough to be clinically reliable over time.

Research on why clinical coding in general practice is inconsistently performed highlights that coding accuracy tends to improve when codes are directly tied to financial incentives, such as Quality and Outcomes Framework (QOF) indicators. This is a revealing finding: it suggests that in the absence of an immediate, visible consequence, time-pressured GPs will deprioritise coding precision. The problem is not a lack of knowledge. It is a structural one, in which the conditions of the consultation make accurate coding difficult to sustain.

Delayed documentation: when coding happens after the consultation

Many GPs do not code at all during the consultation. Notes and codes are deferred to the end of a session, or, particularly in high-volume practices, to the end of the working day. This is a rational response to the demands of the appointment itself, but it introduces a distinct category of coding risk: recall degradation.

Human memory for clinical detail is not stable over even short periods. The specific language a patient used, the severity qualifier they mentioned, the comorbidity that surfaced briefly during a medication review: these details can fade rapidly, often within the same session. What remains is a general impression, and general impressions produce general codes.

A pilot study examining GP consultation performance under interruption conditions, including scenarios involving red-flag cancer presentations, found that interruptions during the consultation affected the completeness of clinical recall. When documentation is deferred beyond the consultation, that already-degraded recall is the only source from which codes are generated.

The consequences are specific and cumulative: omitted codes for conditions that were discussed but not formally documented; incorrect severity markers because the GP cannot recall whether the patient described their symptoms as 'occasional' or 'persistent'; missed comorbidity flags that would have been captured if coding had occurred in real time.

Systematic coding gaps and their effect on QOF payments

The Quality and Outcomes Framework rewards practices for achieving defined clinical standards across their registered population. Achieving those standards requires that patients are correctly coded onto the relevant disease registers in the first place. A patient with type 2 diabetes who is not coded as such does not appear on the diabetes register. A patient with hypertension who is coded with a non-specific blood pressure code may not trigger the relevant QOF indicators. The practice loses the payment. More significantly, the patient loses the structured monitoring that the framework is designed to deliver.

A quality improvement project examining cancer-related clinical coding in primary care in North Central London found substantial variation in coding quality across practices, with gaps in cancer pathway documentation that had direct implications for care coordination and performance reporting. While focused on oncology, the mechanisms identified (inconsistent code selection, missed codes, and reliance on free text rather than structured codes) are not specific to cancer care. They reflect systemic patterns in how coding is performed under real-world conditions.

Research on SNOMED CT coding in general practice also identifies duplicate codes within the coding system itself as a source of ambiguity, situations where multiple codes exist for the same clinical concept, and where a GP under time pressure may select whichever appears first rather than the most appropriate one. This is not a failure of clinical knowledge. It is a predictable consequence of performing a cognitively demanding task in suboptimal conditions.

How coding inaccuracy affects audit trails and clinical safety

Beyond QOF, the longitudinal patient record is the substrate on which safe ongoing care depends. Clinical decision support tools (the alerts that flag drug interactions, the prompts that identify patients overdue for a review) are only as reliable as the coded data that triggers them. An under-coded encounter creates a gap in that substrate.

Tandem Health's analysis of clinical coding errors and patient safety traces how SNOMED CT codes travel across care settings, noting that an inaccuracy recorded in primary care does not remain in primary care. It follows the patient into secondary care referrals, discharge summaries, and any system that draws on the shared record. Research on coding error propagation, cited in that analysis, found that errors introduced at the point of care compound as they move through the system.

During Care Quality Commission (CQC) inspections, significant event reviews, or medico-legal proceedings, the coded record is treated as the authoritative account of what occurred clinically. Where that record is incomplete or inaccurate (because an interruption prevented accurate coding, or because documentation was deferred and recall was imperfect) the gap becomes a liability. The clinical encounter happened; the record does not reflect it.

Which clinical areas are most vulnerable to interruption-driven coding errors

Some clinical domains are more exposed than others, because they depend on accurate, complete coding for safe ongoing care rather than simply for administrative completeness.

  • Mental health: Coding distinctions between anxiety disorders, depressive episodes, and mixed presentations carry direct implications for prescribing, referral pathways, and safeguarding decisions. These distinctions are among the most vulnerable to the cognitive compression that interruptions produce.

  • Chronic disease management: Diabetes, hypertension, asthma, and chronic obstructive pulmonary disease (COPD) are managed through structured programmes that depend on disease register accuracy. A missed or approximate code at one encounter can affect the patient's care trajectory for years.

  • Safeguarding: Safeguarding flags require precise, unambiguous coding. An interruption that causes a safeguarding concern to be noted in free text rather than coded, or not documented at all, creates a gap that may not be visible until a serious incident review.

  • Medication reviews: Medication reviews often surface comorbidities, side effects, and adherence issues that require coding. Under time pressure following an interruption, these secondary findings are the most likely to be omitted.

  • Cancer pathways: As the North Central London quality improvement project demonstrated, cancer-related coding in primary care is inconsistently performed, with consequences for referral accuracy and pathway tracking.

How ambient voice technology can reduce coding errors at the point of care

Ambient voice technology (AVT) addresses the interruption-to-coding-error chain at its most vulnerable point: the moment of the consultation itself. Rather than requiring the GP to divide attention between the patient and the clinical system, an AI medical assistant using AVT captures clinical content in real time as the conversation unfolds. The GP does not need to type, click, or code mid-consultation. The assistant generates a structured note and, in more advanced implementations, suggests relevant clinical codes based on the content of the encounter.

A registered clinical trial examining ambient voice technology in general practice notes that over half of GPs spend more than 20 per cent of their working time on administration, and that documentation burden negatively affects patient-provider communication. By reducing the cognitive load associated with real-time documentation, AVT reduces the degree to which medical record system interaction itself functions as an internal interruption.

A scoping review of digital scribes in primary care found that automatic speech recognition and natural language processing technologies supporting clinical documentation are now sufficiently mature for deployment in primary care environments, though the review also noted variability in accuracy across accents, clinical specialties, and recording conditions, a limitation that practices should account for when evaluating specific tools.

A comparative analysis of AI scribes versus human documentation in simulated general practice consultations found that AI-generated documentation was competitive with human documentation in terms of clinical content capture, though the study acknowledged that simulated conditions may not fully replicate the complexity of real-world GP consultations. Evidence from medical oncology similarly found that AI scribes reduced documentation time and improved clinician satisfaction, with physicians reporting that they could focus more fully on the patient during the encounter.

The coding benefit is indirect but significant. When a structured note is generated from a complete, uninterrupted capture of the consultation (rather than from a GP's post-hoc recall of a fragmented encounter) the clinical content available for code selection is more complete, more precise, and more likely to reflect what actually occurred.

What practices can do to protect SNOMED CT accuracy under real-world conditions

No single intervention eliminates coding errors, and it would be misleading to suggest that AVT or any other tool resolves the structural pressures of UK general practice. The evidence points to a combination of systemic changes rather than any single fix.

Reduce avoidable interruptions at the workflow level. The NHS mixed-methods study found that many interruptions reaching GPs mid-consultation could be managed by reception staff or triaged before they reached the clinician. Practices that have implemented clear protocols for what constitutes a genuine mid-consultation emergency (and what can wait) report fewer intrusive interruptions without compromising patient safety.

Use structured templates that prompt complete coding. Templates that include mandatory fields for condition severity, comorbidity flags, and relevant SNOMED CT categories reduce the likelihood that a time-pressured GP will omit a code simply because the interface does not prompt for it. Research on clinical decision support in primary care confirms that workflow-integrated prompts improve documentation completeness, though uptake depends on usability. Poorly designed prompts are ignored or dismissed.

Code at the point of care where possible. The evidence on recall degradation is consistent: the longer the gap between the clinical encounter and the documentation of it, the less accurate the record. Practices that have moved towards real-time or near-real-time coding (supported by AVT or by protected time immediately after each consultation) report fewer coding gaps than those that batch documentation at the end of a session.

Invest in coding training that addresses SNOMED CT ambiguity. Research on GP coding inconsistency identifies training gaps as a contributing factor, particularly around SNOMED CT duplicate codes and the selection of the most specific applicable code. This is a solvable problem, but it requires investment in education that goes beyond basic medical record system training.

Treat coding accuracy as a practice-level quality indicator. Where coding quality is monitored through regular audits of disease register completeness, QOF achievement rates, and significant event reviews that include documentation quality, practices are better positioned to identify systematic gaps before they become patient safety issues. Individual clinician responsibility is not a sufficient frame. The conditions that produce coding errors are structural, and the response needs to be structural too.

The commentary on interruptions in medical education makes a point that applies equally to established practice: real-world clinical work is 'fraught with interruptions and competing tasks', and systems that assume otherwise will consistently underperform. Designing for interruption, rather than assuming it can be eliminated, is the more honest and more effective approach.

Frequently asked questions

▶ How do interruptions during a GP consultation affect clinical coding accuracy?

Interruptions disrupt the mental model a GP builds during a consultation. When attention is redirected, working memory is partially cleared, and the GP reconstructs the clinical picture on resuming. That reconstruction is imperfect. Selecting the right Systematised Nomenclature of Medicine Clinical Terms (SNOMED CT) code for a nuanced presentation requires full contextual recall. When that context has been fragmented, GPs are more likely to select a broader, less precise code or omit a code entirely.

▶ What types of interruptions pose the greatest risk to coding quality?

Interruptions fall into two categories. External interruptions include phone calls, colleague queries, and reception requests arriving mid-consultation. Internal interruptions are self-directed attention shifts, such as pausing to type a note, search a code, or respond to a medical record system alert. A 2022 analysis distinguished between cooperative interruptions, which advance the clinical encounter, and intrusive ones, which disrupt clinical thinking. Intrusive interruptions pose the greater risk to coding accuracy.

▶ Why does time pressure compound coding errors in general practice?

Standard GP appointment slots of 10 to 15 minutes leave limited margin for thorough documentation even without interruption. When time is lost to an interruption, it can't usually be recovered within the same appointment. Documentation and coding are typically the first things abbreviated. A GP who has lost two minutes to a phone call mid-consultation is unlikely to spend additional time carefully selecting the most accurate SNOMED CT code afterwards. Research on coding in general practice confirms that coding precision tends to improve only when it's directly tied to financial incentives, such as Quality and Outcomes Framework (QOF) indicators.

▶ What are the risks of deferring clinical coding until after the consultation?

Many GPs defer notes and codes to the end of a session or the end of the working day. Human memory for clinical detail fades rapidly, sometimes within the same session. Specific language a patient used, severity qualifiers, and comorbidities that surfaced briefly during a medication review can all be lost. What remains is a general impression, and general impressions produce general codes. A pilot study examining GP consultation performance under interruption conditions found that interruptions during the consultation affected the completeness of clinical recall, meaning deferred documentation relies on an already-degraded source.

▶ How do coding gaps affect QOF payments and patient care?

The QOF rewards practices for achieving defined clinical standards across their registered population. Achieving those standards requires that patients are correctly coded onto the relevant disease registers. A patient with type 2 diabetes who isn't coded as such doesn't appear on the diabetes register. A patient with hypertension coded with a non-specific blood pressure code may not trigger the relevant QOF indicators. The practice loses the payment, and the patient loses the structured monitoring the framework is designed to deliver.

▶ Which clinical areas are most vulnerable to interruption-driven coding errors?

Mental health coding is particularly exposed, because distinctions between anxiety disorders, depressive episodes, and mixed presentations carry direct implications for prescribing, referral pathways, and safeguarding decisions. Chronic disease management, including diabetes, hypertension, asthma, and chronic obstructive pulmonary disease (COPD), depends on disease register accuracy that a missed or approximate code can undermine for years. Safeguarding flags, medication reviews, and cancer pathways are also high-risk areas, as a quality improvement project in North Central London found substantial variation and gaps in cancer-related coding with direct consequences for care coordination.

▶ How does ambient voice technology help reduce coding errors at the point of care?

Ambient voice technology (AVT) captures clinical content in real time as the consultation unfolds, so the GP doesn't need to type, click, or code mid-consultation. The AI medical assistant generates a structured note and, in more advanced implementations, suggests relevant clinical codes based on the encounter content. A registered clinical trial examining AVT in general practice notes that over half of GPs spend more than 20 per cent of their working time on administration, and that documentation burden negatively affects patient-provider communication. By reducing the cognitive load of real-time documentation, AVT reduces the degree to which the medical record system itself functions as an internal interruption.

▶ How accurate is AI-generated documentation compared with human documentation in GP consultations?

A comparative analysis of AI scribes versus human documentation in simulated general practice consultations found that AI-generated documentation was competitive with human documentation in terms of clinical content capture. The study acknowledged that simulated conditions may not fully replicate the complexity of real-world GP consultations. A scoping review of digital scribes in primary care found that automatic speech recognition and natural language processing technologies supporting clinical documentation are now sufficiently mature for deployment in primary care, though accuracy varies across accents, clinical specialties, and recording conditions.

▶ What practical steps can GP practices take to protect SNOMED CT coding accuracy?

The evidence points to a combination of approaches. Practices can reduce avoidable interruptions by implementing clear protocols for what constitutes a genuine mid-consultation emergency. Structured templates with mandatory fields for condition severity and comorbidity flags reduce the likelihood that a time-pressured GP omits a code. Coding at the point of care, rather than deferring to the end of a session, reduces recall degradation. Investing in training that addresses SNOMED CT ambiguity and duplicate codes helps clinicians select the most specific applicable code. Treating coding accuracy as a practice-level quality indicator, monitored through regular audits, positions practices to identify systematic gaps before they become patient safety issues.

▶ Why do coding inaccuracies in primary care matter beyond the GP practice?

SNOMED CT codes travel with the patient across care settings. An inaccuracy recorded in primary care follows the patient into secondary care referrals, discharge summaries, and any system that draws on the shared record. Clinical decision support tools, including alerts that flag drug interactions and prompts that identify patients overdue for review, are only as reliable as the coded data that triggers them. During Care Quality Commission (CQC) inspections, significant event reviews, or medico-legal proceedings, the coded record is treated as the authoritative account of what occurred clinically. Where that record is incomplete or inaccurate, the gap becomes a liability.

Get started with Tandem today

Join thousands of clinicians enjoying stress-free documentation.

Get started with Tandem today

Join thousands of clinicians enjoying stress-free documentation.

Get started with Tandem today

Join thousands of clinicians enjoying stress-free documentation.