ChatPPG Editorial

PPG Wearable Testing Standards: IEEE, ANSI, ISO, and What They Actually Require

A practical guide to the key standards governing PPG and wearable heart rate sensor testing: IEEE 60601, ANSI/AAMI, ISO 80601, and what researchers need to know about validation protocol design.

ChatPPG Research Team
9 min read
PPG Wearable Testing Standards: IEEE, ANSI, ISO, and What They Actually Require

Validating a PPG wearable requires navigating a patchwork of international standards, FDA guidance documents, and evolving best practices. For researchers designing validation studies, product developers seeking regulatory clearance, and clinicians evaluating accuracy claims, understanding what these standards actually require — and what they don't — is essential context.

The Regulatory Landscape for PPG Wearables

No single universal standard governs all PPG wearable devices. The applicable standards depend on:

  • What the device claims to measure (HR, SpO2, AFib, blood pressure)
  • Whether it is positioned as medical, wellness, or research equipment
  • The regulatory jurisdiction (FDA in US, CE marking in EU, PMDA in Japan)
  • Device classification tier (Class I, II, or III)

The most relevant standards form a layered stack:

  1. ISO 80601-2-61 — Pulse oximeter accuracy requirements (SpO2)
  2. ANSI/AAMI EC57 — Heart rate monitor testing and performance criteria
  3. IEEE 11073 (10420 sub-standard) — Health informatics data communication for wearables
  4. IEC 60601-1 — General medical electrical equipment safety
  5. FDA Guidance: De Novo Request for PPG Wearables (2020, updated 2022)

ISO 80601-2-61: The Pulse Oximeter Standard

This is the primary accuracy standard for pulse oximeters, including wearable devices claiming to measure SpO2.

Key Requirements

Accuracy range: The device must demonstrate SpO2 accuracy (ARMS ≤3.5%) across the range 70-100%. Historical standards only required testing at 90-100% — clinical reality (hypoxia, respiratory failure, altitude) demanded the 70-80% range be included.

Minimum data requirements:

  • ≥200 data pairs comparing device SpO2 to co-oximetry reference across the 70-100% range
  • At least 10% of data pairs must be at saturation levels 70-80%
  • Testing in a minimum of 10 subjects; at least 10% of subjects in each Fitzpatrick skin tone group (I-III, IV-VI)

Motion testing: Devices claiming accuracy during motion must demonstrate ARMS ≤3.5% during a specified motion protocol. The motion protocol involves arm elevation and lowering at defined angles and frequencies.

Multi-ethnic testing requirement (added via FDA guidance, not original ISO standard): As of 2023 FDA expectations, devices must test across skin tone distribution including ≥15% dark skin tones (Fitzpatrick V-VI). This was a significant change driven by evidence of skin tone bias in consumer and medical devices.

What ISO 80601-2-61 Does Not Cover

  • Performance below 70% SpO2
  • Specific motion artifact types beyond the standardized protocol (e.g., exercise, tremor)
  • Continuous monitoring accuracy over extended wear durations (>8 hours)
  • Performance in disease states (anemia, methemoglobinemia, low perfusion)

Devices that claim only "wellness" SpO2 measurement can position themselves outside this standard's scope, which is exactly what Apple Watch and most consumer wearables do.

ANSI/AAMI EC57: Heart Rate Monitor Testing

ANSI/AAMI EC57 is the primary US standard for cardiac rate and rhythm monitors. It covers:

  • Electrocardiograph monitors (ECG)
  • Pulse rate monitors (including PPG-based devices)
  • Arrhythmia detection devices

Accuracy Requirements for Heart Rate

EC57 specifies that heart rate monitors must achieve within ±5 BPM or ±5% (whichever is larger) of the true heart rate across the range 20-300 BPM under standardized conditions. The standard defines test signals and reference methods.

For PPG-based devices, EC57 requires testing against ECG reference across defined signal types and rates. This is a lower bar than many clinical users expect — ±5 BPM at 100 BPM means ±5% error is acceptable. At 50 BPM resting, ±5 BPM represents ±10% relative error.

Motion Testing in ANSI/AAMI

EC57 specifies motion test conditions using a standardized motion fixture that moves the device at defined frequencies and amplitudes. The test is designed for bedside/hospital use cases (patient turning in bed, arm repositioning) rather than vigorous exercise.

This matters for consumer wearable comparisons: devices validated against EC57 motion testing may not perform within specification during running or HIIT, where motion profiles differ substantially from the standardized test.

IEEE 11073 Standards for Wearable Health Data

The IEEE 11073 Personal Health Device (PHD) standard family governs data communication for medical and health monitoring devices. Key sub-standards for PPG wearables:

IEEE 11073-10417: Pulse oximeter device specialization. Defines data types, encoding, and communication protocols for pulse oximeter devices in point-of-care and connected health environments.

IEEE 11073-10406: Basic electrocardiograph (for devices combining ECG + PPG).

IEEE 11073-10420 (2020): Wearable cardiovascular fitness device specialization. Addresses activity monitors, heart rate monitors, and wearable biosensors for wellness/fitness applications. This is the standard most relevant to consumer smartwatches and fitness bands — it defines how heart rate, HRV, and step count data should be structured and communicated.

IEEE 11073 compliance is increasingly required for healthcare interoperability — wearable data flowing into EHR systems via HL7 FHIR often references IEEE 11073 data semantics.

FDA Guidance on PPG-Based Wearables

The FDA has issued several guidance documents specifically addressing PPG wearables:

2020 De Novo Decision: Apple Watch Irregular Rhythm Notification: Established a regulatory pathway for PPG-based irregular heart rhythm detectors, creating a "new type" device classification. The predicate framework now allows subsequent submitters to use this decision as a starting point.

2022 Digital Health Center of Excellence Guidance: Clarifies the wellness/medical device boundary for heart rate and SpO2 monitoring, providing specific language on what claims move a device from wellness to medical.

2022 Pulse Oximeter Accuracy and Limitations Guidance: Addresses consumer device performance gaps identified during COVID-19, calling for:

  • Better labeling of accuracy limitations across skin tones
  • Clear statements about the range of saturation values where accuracy is validated
  • Discouragement of using consumer wellness devices for clinical monitoring decisions

The Voluntary vs. Mandatory Divide

One important nuance: many relevant standards (ISO 80601-2-61, ANSI/AAMI EC57) are voluntary in the US. The FDA references them as "recognized consensus standards" — following them creates a presumption of safety and effectiveness, but is not legally required unless FDA makes it mandatory for a specific device type.

For 510(k) submissions, the FDA recommends following recognized consensus standards. Deviating from them is permissible with adequate justification — but in practice, deviating from ISO 80601-2-61 for a medical pulse oximeter would face substantial regulatory scrutiny.

Designing a Good PPG Validation Study

For researchers validating a new PPG wearable, the emerging best practices draw on ISO 80601-2-61 methodological principles even for non-SpO2 applications:

For Heart Rate Validation

  1. Reference standard: ECG (12-lead or Holter) for every study. Chest strap ECG (Polar H10) is acceptable for exercise studies.
  2. Subjects: Minimum 30 subjects, diverse skin tones, both sexes, BMI range, age range relevant to intended use
  3. Activity protocol: Rest, light walking, moderate exercise (60-70% max HR), vigorous exercise (>80% max HR), and recovery transitions
  4. Statistical reporting: MAE, MAPE, RMSE, Bland-Altman plot, 95% limits of agreement, data count per activity level
  5. Temporal alignment: Careful synchronization of wearable and ECG timestamps — even 500 ms misalignment significantly affects IBI comparison

For HRV Validation

  1. Reference: ECG-derived RR intervals (not chest strap PPG-derived IBI)
  2. IBI agreement metrics: MAE (ms), RMSE (ms), false positive/negative beat detection rates
  3. HRV metrics comparison: RMSSD, SDNN, pNN50, LF/HF power via frequency domain analysis — compare device-derived versus ECG-derived values
  4. Artifact handling: Specify ectopic beat rejection strategy in both reference and device algorithms

For SpO2 Validation

Follow ISO 80601-2-61 methodology: induced hypoxia via nitrogen dilution (in ethics-approved controlled settings), co-oximetry reference, ≥200 data pairs across 70-100%, ≥15% dark skin tones.

For wearable-specific validation at normal saturation ranges (90-100%), alternative protocols using exercise (natural desaturation to ~92-94%) or altitude chambers are used when ethics committees restrict induced hypoxia.

The Evolving Skin Tone Equity Standards

Perhaps the most significant ongoing change in PPG device standards is the skin tone equity requirements. The 2023 FDA guidance and updated ISO committee drafts require:

  • Quantitative skin tone measurement (Melanin Index or Individual Typology Angle, not subjective Fitzpatrick scale)
  • Mandatory dark skin tone representation in SpO2 validation cohorts
  • Reporting of accuracy stratified by skin tone group
  • Investigation of systematic skin-tone-related bias

This is a meaningful shift from pre-2022 practice, where most validation studies enrolled predominantly light-skinned volunteers and did not report skin-tone-stratified results.

Researchers designing new PPG validation studies should incorporate ITA (Individual Typology Angle) measurement of all subjects and plan for skin-tone-stratified analysis from the start.

Internal Resources

For related technical content, see PPG ISO standards overview, PPG signal quality assessment, PPG skin tone bias and accuracy, and clinical-grade vs consumer PPG wearables.

FAQ

What standard governs pulse oximeter accuracy in wearables? ISO 80601-2-61 is the primary international standard for pulse oximeter accuracy, requiring ARMS ≤3.5% across 70-100% SpO2 in diverse populations. The FDA references this standard in 510(k) submissions. Consumer wellness devices often position themselves outside this standard's scope to avoid its requirements.

Does ANSI/AAMI EC57 apply to consumer smartwatch heart rate monitors? ANSI/AAMI EC57 applies to heart rate monitors used for medical monitoring. Consumer smartwatches with wellness-only heart rate claims are generally not required to meet EC57. However, manufacturers seeking to make clinical claims (e.g., AFib screening clearance) must meet the appropriate performance standard as part of FDA 510(k) or De Novo submission.

What is the FDA De Novo pathway for PPG devices? The De Novo pathway allows novel device types to receive FDA clearance without an existing predicate device. Apple Watch's irregular rhythm notification used this pathway in 2018 (De Novo K172947), creating a new device type that subsequent applicants can use as a predicate for 510(k) submissions. It typically takes 12-24 months and requires a performance study meeting FDA-specified accuracy thresholds.

How should researchers report PPG wearable validation data? Best practice includes: device model + generation + firmware version, subject demographics (n, age range, sex, BMI, skin tone distribution), reference standard method, activity protocol details, synchronization method, statistical metrics (MAE, RMSE, Bland-Altman), and data quantity at each activity level. Skin-tone-stratified accuracy reporting is increasingly expected in peer review.

Are there standards specifically for wearable HRV measurement? There are guidelines (not formal standards) from the Task Force of the European Society of Cardiology and North American Society of Pacing and Electrophysiology (1996, updated by subsequent expert groups) that define HRV metrics. For wearable PPG-derived HRV, no dedicated ISO standard exists. Researchers typically follow the consensus guidelines for HRV metric definitions while acknowledging the additional measurement uncertainty introduced by PPG-derived IBI compared to ECG-derived RR intervals.

What is the difference between FDA cleared and FDA registered for a pulse oximeter? FDA clearance (510(k), De Novo, or PMA) means the device has undergone premarket review and demonstrated safety and effectiveness for its claimed indication. FDA registration simply means the manufacturer has registered their production facility — it involves no device performance evaluation. "FDA registered" on marketing materials for a consumer pulse oximeter says nothing about the device's accuracy or validation status.