The Evidence Gap Problem: What the Data Shows About FDA AI Device Summaries

The FDA has authorized a cumulative total of 1,451 AI/ML-enabled medical devices as of the end of 2025, a figure that has grown from just 6 devices in 2015 to a record 295 clearances in 2025 alone. This rapid expansion, however, has not been matched by a corresponding increase in the quality of clinical evidence underpinning these authorizations. A 2025 cross-sectional study published in JAMA analyzed 691 FDA-cleared AI devices authorized through July 2023 and found that less than 2% were supported by randomized clinical trials. More troubling for regulatory affairs professionals: 46.7% of the FDA 510(k) summaries reviewed did not describe the study design used to validate the device, and 53.3% omitted the sample size entirely.

These transparency deficits are not merely academic concerns. They create real operational risk for health systems evaluating AI tools for procurement, for clinicians trying to assess whether a device will perform reliably in their patient population, and for manufacturers who must navigate an increasingly demanding regulatory environment. The same JAMA study found that only 5.2% of authorized AI devices had any adverse event reports on file, and 5.8% were ever recalled — mostly due to software bugs rather than clinical performance failures. The low adverse event reporting rate may reflect underreporting rather than genuine safety, given that post-market surveillance infrastructure for AI devices remains immature.

The implications for manufacturers are direct and consequential. A 510(k) submission that omits study design or sample size information may have passed FDA review under earlier standards, but the agency is now signaling clearly that such omissions will no longer be acceptable. The FDA's response to these documented evidence gaps takes the form of three interconnected regulatory modernization tools: the Total Product Life Cycle (TPLC) framework, Predetermined Change Control Plans (PCCPs), and new transparency and labeling requirements. Each tool addresses a specific dimension of the evidence problem, and together they represent a fundamental shift from a pre-market review paradigm to continuous lifecycle oversight.

Timeline infographic showing FDA-cleared AI/ML medical device growth from 6 devices in 2015 to 1,451 devices in 2025 with specialty breakdown and regulatory milestone callouts.
Growth of FDA-authorized AI/ML medical devices from 2015 to 2025, with specialty distribution and key regulatory milestones.

The Total Product Life Cycle (TPLC) Framework: January 2025 Draft Guidance

On January 7, 2025, the FDA issued draft guidance titled Artificial Intelligence-Enabled Device Software Functions: Lifecycle Management and Marketing Submission Recommendations (docket number FDA-2024-D-4488). This document represents the agency's most comprehensive statement to date on how it expects manufacturers to manage AI-enabled devices across the entire product lifecycle — not just at the point of initial market authorization.

The guidance identifies nine documentation areas that the FDA recommends manufacturers address in their marketing submissions. These are not optional checkboxes; they form the basis for what the agency considers a complete and defensible submission for an AI-enabled device.

Nine recommended documentation areas in the FDA's January 2025 TPLC draft guidance (FDA-2024-D-4488).
Documentation AreaWhat It Covers
Device DescriptionIntended use, indications for use, device overview, and AI-specific functionality description
User InterfaceDesign considerations, human factors testing, and how AI outputs are presented to users
LabelingInstructions for use, warnings, precautions, and AI-specific disclosure statements
Risk AssessmentHazard identification, risk analysis, and risk control measures specific to AI behavior
Data ManagementData sources, collection methods, labeling processes, and data quality assurance
Model Description and DevelopmentAlgorithm architecture, training methodology, hyperparameters, and training data characteristics
ValidationVerification and validation activities, including test datasets, metrics, and acceptance criteria
Device Performance MonitoringPost-market monitoring plan, performance metrics, and processes for detecting performance degradation
CybersecuritySecurity risk management, vulnerability assessment, and update mechanisms

What distinguishes the TPLC framework from earlier FDA guidance is its emphasis on continuous lifecycle management rather than a single pre-market review event. The guidance explicitly addresses the reality that AI models can change over time — through retraining, fine-tuning, or deployment in new clinical contexts — and that these changes require ongoing oversight. The framework also introduces specific recommendations for bias mitigation, including testing device performance across demographic subgroups defined by age, sex, race, and ethnicity.

For regulatory affairs teams, the TPLC guidance creates immediate documentation obligations. Even if a device was cleared before January 2025, the guidance signals what the FDA will expect in future submissions and, increasingly, in post-market reviews. Manufacturers should begin auditing their existing device documentation against the nine TPLC areas to identify gaps before they become compliance issues.

Predetermined Change Control Plans (PCCPs): Enabling Iterative Improvement

The Predetermined Change Control Plan (PCCP) is arguably the most consequential regulatory innovation in the FDA's AI device toolkit. Finalized in December 2024, the PCCP framework allows manufacturers to pre-specify planned modifications to an AI-enabled device — including algorithm updates, retraining protocols, and performance threshold adjustments — and receive FDA authorization for those changes without submitting a new 510(k) or De Novo request each time.

A PCCP must include three required components:

  • Description of Modification: A detailed specification of the types of changes the manufacturer intends to make, including the scope, frequency, and triggers for each modification. This section must be precise enough that the FDA can evaluate the potential impact of changes that have not yet occurred.
  • Assessment Protocol: The methodology the manufacturer will use to validate and verify each modification before implementation. This includes pre-defined performance metrics, acceptance criteria, test datasets, and validation procedures.
  • Impact Assessment: An analysis of how each type of modification could affect device safety and effectiveness, including risk assessment updates and any necessary labeling changes.

Adoption of PCCPs has been measured but meaningful. In 2025, 10% of all AI/ML device clearances included a PCCP — a notable figure for a framework that was only finalized in December 2024. This adoption rate suggests that early-adopter manufacturers, particularly those with mature quality management systems, are already operationalizing the PCCP pathway.

Three-panel diagram of an FDA Predetermined Change Control Plan showing description of modification, assessment protocol, and impact assessment components.
The three required components of an FDA Predetermined Change Control Plan (PCCP).

A landmark case study is Aidoc's CARE1 foundation model, which received FDA clearance in February 2025 as the first foundation model-powered clinical AI device. The CARE1 clearance included a PCCP, reflecting the reality that foundation models — which can be adapted to multiple clinical tasks through fine-tuning — require a regulatory framework that accommodates iterative improvement without requiring de novo review for each new application.

International harmonization is also advancing. In August 2025, the FDA, Health Canada, and the UK's Medicines and Healthcare products Regulatory Agency (MHRA) jointly published five guiding principles for PCCPs in machine learning-enabled medical devices. These principles cover:

  • Transparency and documentation of planned changes
  • Risk management throughout the device lifecycle
  • Validation and performance monitoring
  • Continuous improvement processes
  • Stakeholder communication and labeling

For manufacturers, the decision to pursue a PCCP is strategic. Devices with predictable, well-characterized update cycles — such as radiology AI tools that require periodic retraining on new imaging data — are strong candidates. Devices with less predictable change patterns, or those where the impact of modifications is difficult to assess prospectively, may be better served by traditional submission pathways until the PCCP framework matures.

New Transparency and Labeling Requirements: Model Cards, Bias Evaluation, and AI Disclosure

The third pillar of the FDA's regulatory modernization effort addresses the transparency deficits that the JAMA study documented. The January 2025 TPLC draft guidance recommends that manufacturers provide structured, plain-language information about their AI devices through several mechanisms.

The most significant recommendation is the use of model cards — structured reports that describe an AI model's characteristics, intended use, performance metrics, limitations, and known biases. Model cards are not new to the AI industry — Google pioneered the concept in 2019 — but their incorporation into FDA guidance represents a formal regulatory endorsement. The guidance recommends that model cards include:

  • Model input and output specifications
  • Performance measures across relevant clinical subgroups
  • Known sources of bias and their potential impact
  • Demographic subgroup testing results (age, sex, race, ethnicity)
  • Limitations and contraindications
  • PCCP monitoring plans where applicable

The FDA also expects manufacturers to provide plain-language AI disclosure statements that are understandable and accessible to the intended users of the device. This goes beyond simply stating that a device uses AI — it requires explaining what the AI does, how it was validated, and what its limitations are in terms that clinicians and patients can act on.

Summary of new transparency and labeling requirements for AI-enabled medical device manufacturers.
Transparency RequirementWhat Manufacturers Must ProvideRegulatory Basis
AI Disclosure StatementPlain-language description of AI functionality, validation approach, and limitationsTPLC draft guidance (Jan 2025)
Model CardStructured report of model characteristics, performance, bias sources, and subgroup testingTPLC draft guidance (Jan 2025)
Bias Evaluation DocumentationTesting results across demographic subgroups; data representativeness analysisTPLC draft guidance (Jan 2025)
PCCP Monitoring PlanPre-specified performance metrics and acceptance criteria for planned modificationsPCCP final guidance (Dec 2024)
Post-Market Performance DataReal-world performance tracking results and data drift analysisTPLC draft guidance; Sept 2025 RFI

Bias evaluation is a particularly demanding requirement. The guidance recommends that manufacturers test device performance across demographic subgroups and document any performance disparities. This is not a one-time exercise — the expectation is that bias monitoring continues throughout the device lifecycle, particularly when models are retrained or deployed in new populations.

Post-Market Monitoring Expectations: Data Drift and Real-World Performance

The FDA's post-market monitoring expectations for AI devices represent a significant departure from traditional medical device surveillance. Unlike a physical implant or a diagnostic reagent, an AI model's performance can degrade over time without any physical change to the device — a phenomenon known as data drift or model drift. Changes in patient populations, imaging equipment, clinical workflows, or documentation practices can all cause a model's performance to shift, sometimes dramatically.

The TPLC draft guidance addresses this by recommending that manufacturers establish ongoing device performance monitoring plans that include:

  • Pre-defined performance metrics and thresholds that trigger investigation when breached
  • Processes for detecting and quantifying data drift in input data distributions
  • Mechanisms for collecting and analyzing real-world performance data
  • Protocols for communicating performance changes to users and, where appropriate, to the FDA

In September 2025, the FDA issued a Request for Public Comment seeking evidence-based methods for measuring real-world AI device performance. This RFI signals that the agency recognizes the current state of post-market surveillance for AI devices is inadequate and is actively seeking input on how to improve it. The RFI covers topics including:

  • Methods for collecting real-world performance data without imposing excessive burden on clinical sites
  • Statistical approaches for detecting performance degradation in deployed AI models
  • Standards for reporting real-world evidence to the FDA and to the clinical community
  • Methods for evaluating device performance across diverse patient populations in real-world settings

The post-market monitoring expectations create practical infrastructure requirements for manufacturers. Teams need systems for collecting and analyzing performance data from deployed devices, processes for distinguishing between expected performance variation and genuine degradation, and clear escalation pathways when performance thresholds are breached. For manufacturers with devices deployed across multiple sites, this requires investment in data aggregation and analytics infrastructure that many do not currently have.

Practical Compliance Roadmap for Manufacturers in 2026

For regulatory affairs and quality assurance teams, the transition from the pre-market review paradigm to lifecycle oversight is not theoretical — it creates concrete actions that should be underway now. The following roadmap is organized by priority and timeline.

Priority compliance roadmap for AI medical device manufacturers in 2026.
PriorityActionTimelineKey Consideration
HighAudit existing device documentation against the nine TPLC areasQ1–Q2 2026Identify gaps in device description, validation, and performance monitoring documentation
HighEvaluate whether planned algorithm updates qualify for PCCP submissionQ1–Q2 2026Focus on devices with predictable, well-characterized update cycles
HighPrepare model cards and bias evaluation documentation for active submissionsQ1–Q2 2026Include demographic subgroup testing results; document data representativeness
MediumEstablish post-market monitoring infrastructureQ2–Q3 2026Define performance metrics, data drift detection methods, and escalation protocols
MediumReview labeling for AI disclosure complianceQ2–Q3 2026Ensure plain-language AI descriptions are accurate and accessible
LowMonitor international harmonization developmentsOngoingTrack FDA-Health Canada-MHRA convergence; prepare for potential alignment

The most urgent action is the documentation audit. Many manufacturers have device descriptions and validation documentation that were prepared under earlier FDA expectations. The TPLC guidance makes clear that the FDA now expects comprehensive documentation across all nine areas. Gaps in areas like data management, model description, and device performance monitoring are particularly common and will require significant effort to address.

The PCCP evaluation is equally urgent but more strategic. Not every device is a good candidate for a PCCP. The framework works best for devices where the manufacturer can clearly describe the types of changes they intend to make and can pre-specify the validation protocols for those changes. Devices with less predictable change patterns, or where the clinical impact of changes is difficult to assess prospectively, may be better served by traditional submission pathways for now.

Model card preparation should begin immediately, even for devices that are not currently under active submission. The model card format is becoming a de facto industry standard, and having one ready demonstrates regulatory maturity. The bias evaluation component is particularly important — manufacturers should be able to show that they have tested their device across relevant demographic subgroups and can document any performance disparities.

Comparison with the EU AI Act: Converging but Distinct Frameworks

Manufacturers seeking to market AI-enabled medical devices in both the United States and the European Union must navigate two distinct but increasingly convergent regulatory frameworks. The EU AI Act, which entered into force in August 2024, classifies AI systems that are safety components of medical devices — or devices themselves — as high-risk. This classification applies to devices classified under MDR as class IIa, IIb, or III, and under IVDR as class A through D.

The timeline for compliance is accelerating. Most high-risk obligations under the AI Act take effect in August 2026, with full compliance required by August 2027. This means manufacturers have roughly 12 to 24 months to bring their AI-specific quality management systems into alignment with the Act's requirements.

Two-column comparison diagram showing FDA and EU AI Act regulatory frameworks with common ground row.
Comparison of FDA and EU AI Act regulatory frameworks for AI-enabled medical devices.
Comparison of FDA and EU AI Act regulatory frameworks for AI-enabled medical devices.
DimensionFDA (United States)EU AI Act (European Union)
Risk ClassificationDevice-specific (510(k), De Novo, PMA)High-risk classification for MDR class IIa–III and IVDR class A–D devices
Lifecycle ApproachTPLC framework (draft guidance Jan 2025)Lifecycle monitoring required under AI Act
Change ManagementPCCP framework (final guidance Dec 2024)Notified body involvement for substantial modifications
TransparencyModel cards, AI disclosure, bias documentationTransparency obligations, human oversight, record-keeping
Post-Market SurveillancePerformance monitoring, data drift detection, Sept 2025 RFIPost-market surveillance under MDR; AI Act record-keeping
Bias EvaluationDemographic subgroup testing recommendedData quality and governance requirements; bias mitigation expected
Conformity AssessmentFDA review (510(k), De Novo, PMA)Single conformity assessment possible where notified bodies are accredited under both MDR and AI Act
Key Effective DatePCCP final Dec 2024; TPLC draft Jan 2025High-risk obligations Aug 2026; full compliance Aug 2027

The common ground between the two frameworks is substantial. Both require lifecycle monitoring, transparency in how AI systems function and are validated, post-market surveillance, and bias evaluation. The EU AI Act layers AI-specific requirements — including data quality governance, human oversight, and record-keeping — on top of existing MDR/IVDR obligations. The Medical Device Coordination Group's MDCG 2025-6 guidance, published in June 2025, clarifies the interplay between MDR and the AI Act, noting that a single conformity assessment is possible where notified bodies are accredited under both frameworks.

The key differences lie in risk classification approach and conformity assessment pathways. The FDA's device-specific classification system (510(k), De Novo, PMA) differs fundamentally from the AI Act's high-risk classification, which applies automatically to certain device categories. Manufacturers should not assume that a device classified as low-risk under one framework will be similarly classified under the other.

For manufacturers operating in both markets, the most efficient approach is to build a quality management system that satisfies the most demanding requirements of each framework. The areas of convergence — lifecycle monitoring, transparency, post-market surveillance, and bias evaluation — should be treated as baseline requirements. The areas of divergence — particularly around risk classification and change management — require jurisdiction-specific planning.

The FDA's regulatory modernization through PCCPs, the TPLC framework, and transparency requirements represents a fundamental shift in how AI medical devices are regulated. For manufacturers, the message is clear: the era of minimal-evidence 510(k) submissions for AI devices is ending. The agencies are converging on a lifecycle oversight model that demands comprehensive documentation, ongoing performance monitoring, and proactive bias management. Manufacturers that invest in these capabilities now will be better positioned for both FDA and EU AI Act compliance in the years ahead.