·

Clinical Documentation

Primary Care

Healthcare IT / CIO

Free-text notes & population health data in GP practices

How unstructured clinical notes undermine disease registers, QOF reporting, and resource allocation in primary care – and how AI can bridge the gap

Clinical documentation in general practice has always involved a trade-off. The narrative note captures uncertainty, context, and nuance in the clinician’s own words. But as GP practices face growing pressure to contribute accurate, coded data to national registries, Quality and Outcomes Frameworks (QOF), and integrated care system (ICS) dashboards, that same narrative note quietly undermines the quality of population health data at scale. The gap between how clinicians document care and what reporting systems can actually read is not a minor technical inconvenience. It is a structural problem with measurable consequences for disease registers, resource allocation, and health equity.

What unstructured free-text documentation means in practice

In the context of GP consultations, free-text notes are narrative entries written directly into the medical record system in the clinician’s own language, as opposed to structured, coded fields that assign standardised identifiers to diagnoses, symptoms, medications, and outcomes. A GP might type “patient reports feeling low in mood, sleep disrupted, appetite reduced — likely depression, discussed watchful waiting” into a notes field. That entry is clinically meaningful. To a reporting system, it is invisible.

Free-text remains the dominant mode of clinical documentation in primary care for understandable reasons. It is faster than navigating structured templates during a ten-minute consultation. It accommodates clinical uncertainty: conditions that are suspected but not confirmed, symptoms that don’t yet fit a diagnostic category, or patient-reported experiences that resist standardised labels. It preserves the relational texture of a consultation in a way that a dropdown menu cannot. Over 80% of digital healthcare data is unstructured, and primary care is no exception.

How population health reporting works in GP practices

Population health reporting in general practice depends almost entirely on structured, machine-readable data. When a clinician assigns a SNOMED CT (Systematized Nomenclature of Medicine Clinical Terms) code to a diagnosis or clinical finding during or after a consultation, that code is recorded in the medical record system in a way that can be extracted, aggregated, and reported. Disease registers, for conditions including diabetes, hypertension, chronic obstructive pulmonary disease, and serious mental illness, are built from these coded entries. QOF indicators are calculated against them. ICS dashboards and integrated care board (ICB) planning tools draw from them.

National audit programmes, including the National Diabetes Audit and the Primary Care Network (PCN) dashboard metrics used by NHS England, rely on the same structured data layer. So do the risk stratification tools that ICBs use to identify high-need cohorts and direct resources toward underserved populations. None of these systems can interrogate a free-text narrative. Clinical notes written in free text may not be easily translated to structured data fields, resulting in missing information on symptoms, exposures, and outcomes, which is a direct barrier to population-level surveillance.

Where free-text notes break the reporting chain

The failure points are specific and well-documented. When a clinician records a new diagnosis in a consultation note without assigning a corresponding SNOMED CT code, that diagnosis does not appear on the disease register. When a risk factor such as smoking status, alcohol use, or family history of cardiovascular disease is documented only in narrative text, it is not counted in the structured fields that drive QOF indicators or risk scores. When an outcome such as a referral decision or a medication change is described in free text rather than coded, it is lost to any downstream reporting.

Research using 2.9 million patient records from a Dutch GP database has quantitatively demonstrated that structured codes and unstructured clinical notes are highly complementary rather than redundant. Most concepts recorded in one data type do not appear in the other. Depression consultations were found to rely heavily on unstructured data, with relatively few concepts captured in structured codes alone. This means that for entire clinical domains, the data feeding population health reports may be systematically incomplete.

A systematic review of 43 UK studies using clinical free text found that studies including free-text data showed improved accuracy compared to those relying on structured codes alone. It also found that free-text data is routinely stripped out before records are made available for research, leaving a substantial source of population health intelligence untapped. The review highlighted that UK GP clinic notes and mental health records are particularly reliant on free-text narratives.

The problem extends to how new diagnoses enter the record. New diagnoses are often only recorded in letters filed under administrative codes such as “letter from specialist”, rather than under the relevant diagnostic code. The clinical information is present in the medical record system, but it is stored in a location and format that reporting systems cannot access.

A study comparing diagnostic codes with natural language processing (NLP) analysis of free-text clinical notes in primary care found substantially different prevalence estimates depending on which data source was used. The coded data systematically undercounted cases. Applied to a population health context, this means that disease registers built solely from structured codes will routinely underestimate true prevalence, and the degree of underestimation will vary by condition, clinician, and practice.

The real-world consequences for GP practices and ICBs

The downstream effects operate at multiple levels simultaneously. At practice level, incomplete disease registers reduce the population eligible for QOF achievement, which can lower overall achievement scores and associated income. If patients with a relevant diagnosis are recorded only in free text rather than on the disease register, they fall outside the denominator used to calculate QOF performance, even though the clinical care may have been delivered appropriately.

At ICB level, the consequences are more diffuse but potentially more significant. Risk stratification tools that assign patients to high, medium, or low need categories depend on the completeness of coded data. Skewed or incomplete structured data leads to inaccurate population risk stratification and, by extension, to resource allocation decisions that do not reflect actual population need. Cohorts that should be prioritised for targeted interventions, including people with undiagnosed long-term conditions, those with multiple comorbidities, or those from communities with historically lower rates of coded diagnosis, may not appear in the data at all.

Social determinants of health and quality-of-life measures are rarely captured in structured medical record fields, and are instead documented, if at all, in free-text notes. This creates a particular problem for health inequality monitoring. If the data used to identify underserved populations is concentrated in a format that reporting systems cannot read, those populations remain invisible to planning processes, compounding existing inequity rather than addressing it.

Disease registers and quality-of-care assessments using medical records will be misleading if free-text information is not taken into account, a finding that has been replicated across multiple clinical domains, including geriatric syndromes and post-surgical outcomes documented almost exclusively in narrative notes.

Why clinicians default to free text

The prevalence of free-text documentation is not primarily a behaviour problem. It is a system design problem.

Structured templates and coding interfaces in most medical record systems are not designed around the pace and cognitive demands of a GP consultation. Navigating dropdown menus, searching for the correct SNOMED CT code, and populating structured fields takes time that is not available in a standard appointment. Clinical uncertainty, stigma, time pressures, and poor clinician training in coding are all documented reasons why structured fields remain incomplete, even when clinicians understand their importance.

The documentation burden in primary care is real and well-evidenced. Asking clinicians to choose between giving full attention to the patient in front of them and ensuring their notes are correctly coded is not a reasonable ask. Systems that frame this as a compliance issue rather than a workflow design issue are unlikely to produce sustainable improvement.

Manual coding of free-text clinical data is both time-consuming and costly, and the cognitive load of accurate clinical coding during or immediately after a consultation is significant. Free-text is not a workaround. For many clinicians, it is the only realistic option given the tools available.

How ambient voice technology and AI assistants are changing the equation

The structural tension between documentation speed and data quality is not inevitable. Ambient voice technology (AVT), which uses artificial intelligence (AI) to listen to a consultation in real time and generate both narrative notes and structured clinical data simultaneously, changes the terms of the trade-off.

Rather than requiring the clinician to choose between writing a comprehensive note and assigning the correct code, an AI medical assistant using AVT can do both in the background. The clinician conducts the consultation as they normally would. The assistant listens, generates a draft clinical note, and surfaces suggested SNOMED CT codes for review. The clinician reviews and confirms, a process that takes seconds rather than minutes and does not interrupt the consultation itself.

This approach addresses the root cause of free-text overreliance: the fact that structured documentation currently requires more time and cognitive effort than narrative documentation. Innovative methods to improve the structured capture of clinical data are needed to facilitate the use of routinely collected clinical data for patient phenotyping and population health reporting. AVT-based AI assistants represent one of the most practical implementations of this principle in primary care.

Automated classification of unstructured free-text primary care data for disease prevalence estimation is technically feasible, but applying it retrospectively to existing records is resource-intensive. The more efficient intervention is to prevent the data gap from forming in the first place, by supporting structured capture at the point of consultation.

What good documentation looks like: structured notes without slowing down care

A documentation workflow that genuinely supports population health reporting has several identifiable characteristics:

  • Automatic code suggestions surfaced post-consultation, based on what was said during the appointment, not requiring the clinician to search for codes manually during or after the encounter

  • Structured data captured in the background, so that disease register entries, QOF-relevant indicators, and referral decisions are recorded in machine-readable format without additional clinician input

  • Templates populated without manual data entry, drawing on the consultation transcript to pre-fill structured fields that would otherwise be left blank or completed in free text

  • Medical record system integration at the data layer, so that structured outputs flow directly into the relevant fields in the clinical system rather than requiring a separate documentation step

The distinction between this approach and legacy bolt-on coding tools is significant. Earlier coding assistance tools required clinicians to engage with a separate interface, search for codes manually, or review long lists of suggestions generated from billing data rather than clinical content. AI-native approaches that operate from the consultation transcript and surface a small number of high-confidence code suggestions in context are meaningfully different in terms of workflow impact.

Combining structured and unstructured medical record data consistently produces more accurate patient identification and population health insights than either data type alone. The goal of AI-assisted documentation is not to eliminate free-text notes, which carry genuine clinical value, but to ensure that the structured data layer is populated consistently and accurately alongside them.

Key considerations when evaluating AI documentation tools for GP practices

For healthcare decision-makers assessing AI documentation tools in the context of population health reporting, several criteria are particularly relevant.

Accuracy of SNOMED CT code suggestions. The clinical value of an AI assistant depends on the accuracy and specificity of its coding suggestions. Tools should be evaluated against real primary care consultation data, with transparency about false positive and false negative rates for clinically significant codes.

Medical record system integration depth. A tool that generates structured outputs in a proprietary format rather than writing directly to the relevant structured fields in the practice’s medical record system does not solve the population health reporting problem. Integration at the data layer, not just the interface layer, is the relevant standard.

Data security and privacy. GP consultation data is among the most sensitive personal data processed in any sector. Tools must comply with UK General Data Protection Regulation (GDPR), with clear documentation of data residency, processing agreements, and access controls. ISO 27001 (the international standard for information security management) certification is a baseline expectation.

Medical device classification. AI tools that influence clinical documentation and coding decisions may be classified as medical devices under the UK Medical Device Regulation (MDR). Practices and ICBs should confirm the regulatory status of any tool under evaluation and understand what clinical safety obligations that classification entails.

Evidence of real-world performance in primary care. Peer-reviewed evidence, or at minimum independently validated performance data from UK primary care settings, should be available before deployment at scale. Performance in secondary care or US health system contexts does not reliably predict performance in UK general practice.

Documentation quality is a population health issue

The quality of population health data is inseparable from how individual clinicians document individual consultations. Every free-text entry that contains a clinically significant finding, such as a new diagnosis, an uncontrolled risk factor, or a deteriorating long-term condition, but is not accompanied by a structured code represents a gap in the data that feeds disease registers, QOF calculations, and ICB planning tools.

This is not a problem that can be solved by asking clinicians to code more carefully. The documentation burden in primary care is already substantial, and adding to it is neither sustainable nor effective. The practical path forward is to change the relationship between consultation and documentation, using AI assistance to ensure that structured data capture happens as a by-product of clinical care rather than as an additional task layered on top of it.

Predictive models using both structured data and unstructured narrative notes consistently outperform those using either data type alone. The same principle applies to population health reporting: the most accurate and complete picture of population health emerges when structured and unstructured data are captured together, consistently, at the point of care. Achieving that, at scale, across a GP practice or an ICB, is one of the most consequential infrastructure decisions available to healthcare decision-makers today.

Frequently asked questions

▶ Why does free-text clinical documentation cause problems for population health reporting?

Population health reporting depends on structured, machine-readable data. When a clinician records a diagnosis or risk factor in a narrative note without assigning a corresponding SNOMED CT (Systematized Nomenclature of Medicine Clinical Terms) code, that information is invisible to disease registers, Quality and Outcomes Framework calculations, and integrated care system dashboards. The clinical detail is present in the record, but reporting systems can’t read it.

▶ How much clinical data in primary care is unstructured?

Research suggests that over 80 per cent of digital healthcare data is unstructured, and primary care is no exception. Free-text narrative notes are the dominant mode of clinical documentation in GP settings, meaning a significant proportion of clinically meaningful information never enters the structured data layer that feeds reporting systems.

▶ What are the practical consequences for GP practices when documentation is incomplete?

At practice level, incomplete disease registers mean missed Quality and Outcomes Framework points and lost income. If a diagnosis is recorded only in free text, the patient won’t appear on the relevant register, and the practice can’t claim the associated payments, even if the clinical care was delivered. At integrated care board level, skewed structured data leads to inaccurate risk stratification and resource allocation decisions that don’t reflect actual population need.

▶ Why do clinicians default to free-text notes rather than structured coding?

The prevalence of free-text documentation is a system design problem, not a behaviour problem. Navigating dropdown menus and searching for the correct SNOMED CT code takes time that isn’t available in a standard ten-minute consultation. Clinical uncertainty, stigma, time pressure, and limited training in coding are all documented reasons why structured fields remain incomplete. For many clinicians, free text is the only realistic option given the tools available.

▶ Which clinical areas are most affected by reliance on free-text documentation?

Research using 2.9 million patient records from a Dutch GP database found that depression consultations rely almost exclusively on unstructured data. A systematic review of 43 UK studies found that UK GP clinic notes and mental health records are particularly reliant on free-text narratives. Social determinants of health and quality-of-life measures are also rarely captured in structured fields, which creates specific problems for health inequality monitoring.

▶ How does ambient voice technology help with structured clinical documentation?

Ambient voice technology (AVT) uses artificial intelligence to listen to a consultation in real time and generate both a narrative note and suggested SNOMED CT codes simultaneously. The clinician conducts the consultation as normal. The AI assistant drafts the note and surfaces code suggestions for review, a process that takes seconds and doesn’t interrupt the appointment. This means structured data capture happens alongside the consultation rather than as a separate task afterwards.

▶ Does using AI assistance mean removing free-text notes from clinical records?

No. The goal of AI-assisted documentation isn’t to eliminate free-text notes, which carry genuine clinical value. Research consistently shows that combining structured and unstructured data produces more accurate patient identification and population health insights than either data type alone. The aim is to ensure the structured data layer is populated consistently and accurately alongside narrative notes, not to replace one with the other.

▶ What should GP practices and integrated care boards look for when evaluating AI documentation tools?

Key criteria include the accuracy of SNOMED CT code suggestions, the depth of integration with the practice’s medical record system, and compliance with UK General Data Protection Regulation (GDPR) including clear data residency and access controls. ISO 27001 (the international standard for information security management) certification is a baseline expectation. Practices should also confirm whether a tool is classified as a medical device under the UK Medical Device Regulation (MDR), and look for independently validated performance data from UK primary care settings specifically.

▶ How does incomplete structured data affect health equity?

Risk stratification tools that identify high-need cohorts depend on complete coded data. When clinically significant information sits only in free-text notes, communities with historically lower rates of coded diagnosis may not appear in planning data at all. Social determinants of health are rarely captured in structured fields, so the populations that most need targeted interventions can remain invisible to the processes designed to reach them.

▶ Is automated coding of existing free-text records a practical solution?

Automated classification of unstructured free-text data using natural language processing is technically feasible, but applying it retrospectively to existing records is resource-intensive. Research also shows that free-text data is routinely stripped out before records are made available for analysis, limiting what retrospective tools can access. The more efficient approach is to prevent the data gap from forming in the first place by supporting structured capture at the point of consultation.

Empieza a usar Tandem hoy

Únete a miles de facultativos que disfrutan de una documentación sin estrés.

Empieza a usar Tandem hoy

Únete a miles de facultativos que disfrutan de una documentación sin estrés.

Empieza a usar Tandem hoy

Únete a miles de facultativos que disfrutan de una documentación sin estrés.