Skip to main content
Have a personal or library account? Click to login
Essential Tremor and Digital Biomarkers: A Scoping Review Using the TRACE Framework to Map Readiness for Clinical Trials and Routine Practice Cover

Essential Tremor and Digital Biomarkers: A Scoping Review Using the TRACE Framework to Map Readiness for Clinical Trials and Routine Practice

Open Access
|Jun 2026

Full Article

Introduction

Essential Tremor (ET) is the most prevalent movement disorder in adults [1, 2, 3], yet it remains substantially underrepresented in biomarker and digital health research relative to Parkinson’s disease (PD) [4, 5]. Characterised by kinetic and postural tremor [3], ET presents specific challenges for longitudinal monitoring: symptoms fluctuate with fatigue, emotional state, caffeine, and medication timing [6]; traditional clinical rating scales are episodic, subjective, and insensitive to the intra-individual variability that is central to the patient experience [7, 8]. In addition, many digital biomarker studies to date have enrolled ET patients primarily as a comparator group for PD rather than as the primary population of interest [9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19]. As novel pharmacological agents and neuromodulatory interventions enter clinical development for ET [20, 21], objective sensor-derived tremor metrics offer a promising alternative to subjective clinical scales, but their validation for use as trial endpoints or in routine practice remains incomplete [22, 23].

Wearable inertial sensors, digitised drawing tasks, computer vision, surface EMG, and acoustic voice analysis have each been applied to tremor quantification in ET. However, technical feasibility in a supervised clinic setting does not automatically confer utility as a clinical trial endpoint: a metric must also demonstrate ambulatory validity, temporal stability across days, sensitivity to clinically meaningful change, and interpretability in terms of patient experience before it can serve as a primary or secondary outcome measure in a longitudinal interventional study [23, 24].

Given the methodological heterogeneity of the field and our aim to map evidence breadth across multiple sensing modalities, rather than synthesizing effect sizes, we conducted a scoping review [25]. This scoping review has three objectives: to map the landscape of digital biomarkers in ET by sensor modality and context of use; to evaluate their clinical trial readiness using the TRACE framework, a novel five-tier validation maturity model developed for this review; and to identify the critical evidence gaps and propose a roadmap for advancing the most promising modalities toward deployment as endpoints in future ET trials and routine clinical practice.

Methods

Study Design and Registration

This scoping review was conducted in accordance with PRISMA-ScR [25]. A study protocol was developed a priori by consensus among all authors, defining eligibility criteria, search strategy, data charting fields, and tier assignment rules prior to screening. The protocol was not prospectively registered. No ethical approval was required.

Eligibility Criteria

Studies were eligible if they met all the following: publication from 1 January 2000 onwards; original peer-reviewed research reporting quantitative digital measurement of tremor in human participants; inclusion of ET patients as a primary population or identifiable subgroup of ten or more participants; use of a digital sensor-based technology; and availability in English. Studies in which ET patients served as a comparator group remained eligible provided the ET subgroup comprised ten or more participants with extractable ET-specific data; such studies were flagged accordingly. Digital biomarkers were operationally defined as quantitative tremor metrics derived from body-worn inertial sensors, accelerometers, gyroscopes, goniometers, digitising tablets, surface electromyography, computer vision or smartphone camera systems, or acoustic recording devices, deployed for the purpose of tremor assessment or monitoring. Studies using these technologies solely for neurophysiological characterisation of tremor mechanisms without reporting extractable clinical measurement endpoints were excluded.

Studies were excluded if they were reviews, editorials, book chapters, conference abstracts without full data, or case reports; if they focused exclusively on non-digital biomarkers; or if published prior to 2000. Studies evaluating devices solely for tremor suppression were excluded unless extractable tremor signal metrics were reported before or after intervention, in which case they were included and flagged as therapeutic context studies.

An a priori exception to the ten-participant threshold was specified for repeated-measures or interventional designs in which the number of independent observations substantially exceeded the number of participants, provided the design was pre-specified. Full eligibility criteria and decision logic for borderline cases are provided in Supplementary File 1.

Information Sources and Search Strategy

Searches were conducted across PubMed, Web of Science, Scopus, and IEEE Xplore; the last was included to capture the engineering and signal processing literature systematically underrepresented in biomedical databases. Searches were limited to 1 January 2000 to August 2025; the final search was executed on 8 September 2025. Search terms combined Essential Tremor and digital assessment technology domains using Boolean operators, deliberately excluding clinical validation or biomarker performance terms to avoid overly restrictive filtering. Reference lists of all included studies and relevant prior reviews were hand-searched. In response to peer review, a supplementary search was conducted on 27 April 2026 across the same four databases using expanded sensor-specific terminology (accelerometry, electromyography, goniometry, and related MeSH terms) to capture studies indexed under device-specific rather than digital biomarker headings. The supplementary search was restricted to the same date window (1 January 2000 to August 2025) as the original search. Full search strings are provided in Supplementary File 2.

Study Selection

Screening was conducted in two sequential stages. DL screened all titles and abstracts, retaining uncertain records for full-text review. DL then performed full-text assessment with reasons for exclusion documented. Borderline decisions were reviewed by AM or NP, with final determination by discussion. Study selection is summarised in the PRISMA-ScR flow diagram (Figure 1).

Figure 1

PRISMA-ScR flow diagram for the scoping review of digital biomarkers in Essential Tremor.

Records were identified through systematic searching of four electronic databases and supplemented by manual reference list searching of included studies and relevant reviews. Deduplication was performed prior to title and abstract screening. Full-text articles were assessed against pre-specified eligibility criteria; reasons for exclusion are reported. The total number of unique studies included in the scoping review is 165. PRISMA-ScR, Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews; ET, Essential Tremor; IEEE, Institute of Electrical and Electronics Engineers.

Data Charting

A structured 27-field data charting instrument was developed a priori, covering study identity, participant characteristics, technology characterisation, context of use, outcome metrics, and TRACE tier evidence variables. It is provided in full as Supplementary File 3. Unreported fields were recorded explicitly; absence of reporting is itself informative in a field where many validation gaps are gaps of omission rather than negative findings. Data charting was performed by DL across all 165 studies. AM and NP independently reviewed all studies assigned Tier 3 or above, and all Tier 2 studies flagged as Tier 2+; disagreements were resolved by discussion. Consistent with scoping review methodology, formal critical appraisal was not performed; TRACE tier assignment captures validation maturity, not methodological rigour.

The TRACE Framework

To evaluate the clinical trial readiness of identified digital biomarkers, we developed the TRACE (Technology Readiness And Clinical Evidence) framework, a five-tier validation maturity model in which each tier corresponds to a discrete and progressively more demanding domain of evidence. Existing frameworks, most notably the Digital Medicine Society V3 model, provide a strong foundation for sensor verification and analytical validation but group all clinical evidence into a single stage [23]. A key translational bottleneck, the’ Ecological Gap’, exists between technologies that demonstrate correlation with clinical rating scales in a controlled setting and those with proven utility in longitudinal, ambulatory, real-world contexts. TRACE addresses this by deconstructing the clinical validation phase into two distinct tiers, Referenced validity and Ambulatory utility, and extending the pathway to encompass the operational and economic evidence required for integration into clinical trials and ultimately clinical practice.

The five tiers are defined as follows. Tier 1 (Technical Verification) requires bench or phantom testing demonstrating sensor accuracy against a mechanical or electrical reference standard, independent of patient data. Tier 2 (Referenced Clinical Validity) requires demonstration of correlation with an established tremor rating scale, such as The Essential Tremor Rating Scale (TETRAS), the Fahn-Tolosa-Marin Tremor Rating Scale (FTM-RS)/the Clinical Rating Scale for Tremor (CRST) [26, 27], or reliable discrimination between ET and a comparator group, in a controlled clinical or laboratory setting; both confirm that the signal captures clinically meaningful tremor variation under supervised conditions. Tier 3 (Ambulatory and Longitudinal Utility) requires evidence of sensor performance in a home or ambulatory setting across at least two separate calendar days, with demonstration of temporal stability, test-retest reliability, or sensitivity to change over time. Tier 4 (Clinical Trial Readiness) requires the digital metric to be pre-specified as a primary or secondary endpoint in a registered trial, with operational delivery demonstrated through adherence and data completeness reporting. Tier 5 (Economic and Implementation Readiness) requires formal evidence of patient acceptability alongside at least one of the following: validated health economic modelling or demonstrated integration into a reimbursed clinical workflow. Although less immediately applicable than the preceding tiers, its relevance will grow as metrics validated at Tier 4 begin transitioning from trial endpoints toward tools for guiding treatment decisions in routine practice. Each study was assigned to the highest tier for which all core criteria were satisfied. TRACE was deliberately calibrated to the current maturity of the ET field; formal FDA or EMA qualification, which represents post-hoc endorsement of an established evidence base rather than a prerequisite for trial deployment, constitutes the strongest possible evidence at Tier 4 but is not required for assignment. Studies meeting Tier 2 criteria through multi-visit or test-retest clinic designs with calendar-separated sessions but without home deployment were flagged as Tier 2+; Tier 3 studies additionally demonstrating operational features of Tier 4 readiness, such as multi-site data collection or quantified adherence, without a pre-registered digital endpoint were flagged as Tier 4 signal. Full tier definitions, core criteria, the Tier 2/3 boundary decision flowchart, and decision logic for borderline cases are provided in Supplementary File 1.

Synthesis

Results are presented as a narrative evidence map by sensor modality and TRACE tier. No meta-analysis was performed given the heterogeneity of technologies, signal processing approaches, outcome metrics, and study designs. Quantitative summaries of study characteristics are presented alongside modality-specific narrative subsections and a structured endpoint readiness synthesis drawing on TRACE tier assignments and binary validation flags.

Results

Study Selection

The original database search identified 2,789 records with a further 53 from citation hand-searching. After deduplication and screening, 64 studies were included. The supplementary search retrieved 2,127 records; after deduplication and removal of overlap with the original search, 1,007 new unique records were screened, of which 102 met eligibility criteria. After reconciliation of one overlapping dataset, the combined total was 165 studies. The complete screening pathway and exclusion reasons are reported in Figure 1.

Characteristics of Included Studies

The 165 included studies span 2000 to 2025, with publication volume accelerating markedly from 2018 onwards (62 studies before 2018; 103 from 2018 onwards), reflecting broader expansion of wearable and mobile sensing technology. Studies originated from 35 countries; the United States contributed the largest share (n = 49), followed by Germany (n = 15), Spain (n = 14), China (n = 11), and the Czech Republic (n = 7), with the remainder distributed across Europe, Asia, Australasia, the Middle East, and the Americas.

By sensing modality, wearable inertial measurement units (IMUs) and smartwatches account for 114 studies (69%), digitised handwriting and drawing for 16 (10%), computer vision for 15 (9%), surface EMG for 10 (6%), acoustic voice analysis for six (4%), and gait wearables for four (2%). Twenty-four studies deployed sensors spanning two modality categories and are listed under their primary modality. Across all modalities, most tasks and sensor placements target postural or kinetic upper-limb tremor, the cardinal manifestation of ET. Voice tremor (n = 6), gait disturbance (n = 4), and head tremor are sparsely represented, leaving a substantial portion of the ET clinical spectrum without comparable digital characterisation. Study design was predominantly single-session (122 studies); RCTs accounted for 15, multi-visit clinic designs for 14, longitudinal designs for 11, and test-retest designs for three. Studies enrolling ET as a comparator group or within mixed cohorts, and studies conducted in a therapeutic context, are flagged in Table 1.

Table 1

Characteristics of included studies (n = 165).

AUTHOR, YEARSTUDY DESIGNET nDEVICE/PLATFORMCONTEXTCLINICAL ANCHORTRACE TIERNOTES/FLAGS
Wearable IMUs (n = 114)
Bilodeau, 2000 [50]Longitudinal13Miniature accelerometer, force transducer, and surface EMGLab/clinicBFRSTier 2Therapeutic context
Louis, 2000 [51]Single-session19Modified Klove-Matthews Motor Steadiness Battery (Groove-Type Steadiness Tester and Nine-Hole Steadiness Tester by Lafayette Instrument, with impulse counters) and Quantitative Computerized Tremor Analysis Platform using triaxial accelerometers and surface EMGHybridwTRSTier 2
Ondo, 2000 [52]RCT25Unnamed triaxial accelerometerLab/clinicBFRSTier 2Therapeutic context
Wharrad, 2000 [53]Single-session20Bruel and Kjaer type 4367 piezo-electric accelerometerLab/clinicNoneTier 2
O’Suilleabhain, 2001 [54]Single-session283Space Fastrak (Polhemus, Inc. Colchester, VT)Lab/clinicClinician visual estimateTier 2Contains Tier 1 component
Obwegeser, 2001 [55]Multi-visit31MM-1 Movement Monitor (Axon Instruments, Foster City, CA)Lab/clinicFTM-TRSTier 2ET subgroup. Therapeutic context
Brennan, 2002 [56]Single-session31Ultra-light piezoresistive miniature accelerometers (±25 g, 0.5 g)Lab/clinicwTRSTier 2
Gironell, 2002 [57]RCT10Single-plane accelerometer transducer (Grass Instruments Division, Astro-Med Inc.)Lab/clinicTCRSTier 2Therapeutic context
Zeuner, 2003 [58]Single-session11Four-gram triaxial piezo-resistive accelerometers (Kistler Instrument Corp.) and tin surface EMG electrodesLab/clinicNoneTier 2ET comparator
Caligiuri, 2004 [59]Test-retest35Tremorometer (FlexAble Systems, Inc.) comprising a pair of dual-axis accelerometersLab/clinicUPDRSTier 2Contains Tier 1 component
Gironell, 2004 [60]Single-session223Piezoresistive single-plane Grass SPA accelerometer transducer and silver-silver chloride surface EMG electrodesLab/clinicNoneTier 2
Ushe, 2004 [61]Single-session16T1 Tremor Analysis System (Neurokinetics, Edmonton, Alberta, Canada)Lab/clinicNoneTier 2Therapeutic context
Bushara, 2005 [62]RCT24FlexAble-Systems triaxial accelerometerLab/clinicBFRSTier 2Therapeutic context
Frima, 2006 [63]RCT10ENTRAN Egax monoaxial accelerometer with Cardiff tremor acquisition and system analysisLab/clinicNoneTier 2Therapeutic context
Gironell, 2006 [64]RCT16Tri-axial accelerometer transducer (Biopac Systems, Inc.)Lab/clinicTCRSTier 2Therapeutic context
Herzog, 2007 [65]Single-session10Piezoelectric accelerometer, CED 1401 system for EMG, and 3SPACE FASTRAK magnetic six-degree-of-freedom measurement system (Polhemus Inc)Lab/clinicFTM-TRSTier 2Therapeutic context
Zesiewicz, 2007 [66]RCT20Catsys System (portable PC-based test system with an accelerometry device, Danish Product Development, Ltd)Lab/clinicFTM-TRSTier 2Therapeutic context
Zesiewicz, 2007 [67]RCT22Catsys System (portable PC-based test system with an accelerometry device, Danish Product Development, Ltd)Lab/clinicFTM-TRSTier 2Therapeutic context
Shaikh, 2008 [9]Single-session35Unnamed three-axis accelerometerLab/clinicNoneTier 2ET comparator
Matsumoto, 2009 [68]Single-session.17Unnamed custom system comprising an HP Compaq tc4400 tablet PC and JCM AW-100(RX) 3-axis accelerometersLab/clinicClinician visual estimateTier 2ET comparator
Costa, 2010 [69]Single-session18Uniaxial accelerometer (ADXL105, Aircraft Medical/Morpheus Medical)Lab/clinicFTM-TRS; WEMOVE tremor scaleTier 2ET subgroup
Mostile, 2010 [70]Single-session20Kinesia (CleveMed)Lab/clinicTETRASTier 2
Heldman, 2011 [71]Single-session10Kinesia (CleveMed)Lab/clinicwTRSTier 2
Muthuraman, 2011 [72]Single-session41Unnamed accelerometerLab/clinicNoneTier 2ET comparator
Šprdlík, 2011 [73]Single-session30MTx (Xsens)Lab/clinicFTM-TRSTier 2
Uchida, 2011 [74]Single-session11Unnamed biaxial micro-accelerometer (recorded via NEC Sanei Charge Amplifiers and C-LOGGER software)Lab/clinicClinician visual estimateTier 2ET subgroup
de Haas, 2012 [75]RCT9Three miniature linear piezo-electric accelerometers (Nihon Kohden, MT-3T) combined with Grass 15LT surface EMGLab/clinicPerformance-based tremor evaluationTier 2Therapeutic context
Hossen, 2014 [10]Single-session41Unnamed system comprising a 2-gram piezoelectric accelerometer and bipolar surface-EMG with silver-silver-chloride electrodesLab/clinicNoneTier 2ET comparator
Wastensson, 2013 [76]Single-session22CATSYS Tremor Pen and eurythmokinesimeter (EKM)Lab/clinicFTM-TRSTier 2Therapeutic context
Bhidayasiri, 2014 [77]Single-session10Unnamed low-cost 3-dimension inertial sensor prototypeLab/clinicFTM-TRSTier 2ET subgroup
Gironell, 2014 [78]RCT10Unnamed accelerometry systemLab/clinicTCRSTier 2Therapeutic context
Pathak, 2014 [79]Single-session15Active Cancellation of Tremor (ACT) device prototype (Lift Labs)Lab/clinicFTM-TRSTier 2Therapeutic context
Pulliam, 2014 [28]Test-retest20Kinesia HomeView (Great Lakes NeuroTechnologies)HomeNoneTier 3*Tier 4 signal
Ruonala, 2014 [80]Single-session17ME6000 biosignal monitor with Medicotest M-00-S Ag/AgCl surface electrodes and MEAC-X triaxial accelerometersLab/clinicNoneTier 2ET subgroup
Wile, 2014 [12]Single-session14WIMM One smartwatch (and ENTRAN EGAS analog accelerometer for Tier 1 comparison)Lab/clinicClinician visual estimateTier 2ET comparator; Contains Tier 1 component
Woods, 2014 [11]Single-session18HTC Desire model 8181 smartphone (Bosch BMA150 Triaxial digital acceleration sensor)Lab/clinicNoneTier 2ET comparator
Heo, 2015 [81]Single-session183-D gyrosensor (L3G4200D, STMicroelectronics)Lab/clinicNoneTier 2
Thanawattano, 2015 [13]Single-session226-DOF inertial measurement unit (IMU) with an in-house transmission unit and softwareLab/clinicNoneTier 2ET comparator
Atashzar, 2016 [82]Single-session13Biometrics Ltd. kinematic measurement systemLab/clinicNoneTier 2
Chockalingam, 2016 [83]Single-session13Lift Pulse (smartphone application on iOS and Android devices). The study utilized a “smart hat” (a modified baseball cap with a smartphone case) to measure head tremor via the smartphone’s accelerometersLab/clinicFTM-TRSTier 2
Ghassemi, 2016 [14]Single-session11Schwarzer Topas EMG system (Natus, USA) integrating two calibrated accelerometers and bipolar Ag/AgCl surface EMG electrodesLab/clinicNoneTier 2ET comparator
Barrantes, 2017 [84]Single-session16iPhone 5S (using SensorLog application)Lab/clinicNoneTier 2
Chakraborty, 2017 [85]Single-session34Xsens MTx integrated inertial measurement unitsLab/clinicNoneTier 2
di Biase, 2017 [15]Single-session33APDM Opal triaxial accelerometer (test cohort, Rome); Brainvision acceleration sensor (test cohort, Cologne); multiple devices in validation cohort (previously published datasets)Lab/clinicNoneTier 2ET comparator
Samotus, 2017 [82]Longitudinal10Biometrics Ltd. goniometers (SG150) and torsiometer (Q150) with a Noraxon TeleMyo 2400T transmitterLab/clinicFTM-TRSTier 2Therapeutic context
Vittal, 2017 [29]RCT40Kinesia system (Great Lakes NeuroTechnologies)HomewTRS; CGI-s and CGI-c.Tier 3*Tier 4 signal; Therapeutic context
Zheng, 2017 [30]Test-retest8Pebble smartwatch and Android smartphoneHybridFTM-TRSTier 3 *Feasibility-grade Tier 3 (n < 15); Repeated-measures exception
Bove, 2018 [86]Single-session20Triaxial accelerometers (SOMNOwatch; SOMNOmedics, Randersacker, Germany)Lab/clinicNoneTier 2ET subgroup
De Jesus, 2018 [87]Single-session11Kinesia (Great Lakes NeuroTechnology) and GaitRite walkway systemLab/clinicFTM-TRSTier 2Therapeutic context
Jombík, 2018 [88]Single-session133Unnamed custom-made miniature tri-axial accelerometer (two ADXL320 dual-axis analog devices embedded in synthetic resin)Lab/clinicNoneTier 2ET comparator
López-Blanco, 2018 [38]Multi-visit34NetMD system (Sony Smartwatch 3 paired with an ASUS Android smartphone via an investigational Android Wear OS application)Lab/clinicFTM-TRSTier 2Tier 2+
Molparia, 2018 [31]Longitudinal24Pebble Smart-watch paired with LG G2 smartphone running the Fox Insight mobile application/Intel Pharma Analytics PlatformHomeNoneTier 3*ET comparator
Berbakov, 2019 [89]Single-session78Node+ sensor platform (paired with TremorSense Android application)Lab/clinicNoneTier 2ET subgroup
Kroneberg, 2019 [90]Single-session12Mobility Lab (APDM)Lab/clinicFTM-TRSTier 2ET subgroup
Lora-Millán, 2019 [91]Single-session18TechMCS inertial sensors (Technaid, SP)Lab/clinicFTM-TRSTier 2Therapeutic context
Pan, 2019 [92]Single-session20Smartwatch (specific brand unnamed)Lab/clinicFTM-TRSTier 2
Paschen, 2019 [93]longitudinal20Viking, Nicolet EDX System (Natus Medical Inc/Fa. Jäger–Tönnies GmbH/VIASYS Healthcare Inc)Lab/clinicFTM-TRSTier 2Therapeutic context
Zheng, 2019 [94]Single-session20Portable human movement monitoring system (smartwatch with Android smartphone) utilizing an IoTA blockchain ledgerLab/clinicFTM-TRSTier 2
Bruno, 2020 [95]Single-session10Kinesia motion sensor (Great Lakes NeuroTechnologies Inc.)Lab/clinicTETRAS; FTM-TRSTier 2Therapeutic context
Casamento-Moran, 2020 [96]Single-session29Trigno Wireless Sensors (Delsys) and custom low-friction potentiometers (Mouser Electronics)Lab/clinicFTM-TRS; TETRASTier 2Therapeutic context
Loaiza Duque, 2020 [97]Single-session20Built-in triaxial gyroscope of an iPhone 5S using the SensorLog applicationLab/clinicNoneTier 2ET subgroup
Jog, 2020 [98]RCT30TremorTek investigational deviceLab/clinicFTM-TRSTier 2Therapeutic context
Isaacson, 2020 [21]RCT263Cala Health wrist-worn TAPS neuromodulation deviceHomeTETRASTier 4**Therapeutic context
Jombík, 2020 [88]Single-session75Custom-made tri-axial linear accelerometer (made from two ADXL320 dual-axis analog devices)Lab/clinicFTM-TRSTier 2
Kim, 2020 [99]Single-session9Custom wireless wearable tremor modulation system (wrist device with LSM303D accelerometer and constant voltage stimulator)Lab/clinicTETRASTier 2
Kwon, 2020 [100]Single-session18Unnamed wearable gyro-sensor based measurement systemLab/clinicFTM-TRSTier 2
McGurrin, 2020 [101]Single-session13APDM Opal inertial sensorsLab/clinicTETRASTier 2Contains Tier 1 component
Robertson, 2020 [102]Single-session18ETSense™ inertial sensor tremorography device along with Kinesia Home View™Lab/clinicFXTAS-RSTier 2ET comparator
van Brummelen, 2020 [103]Single-session10Seven Consumer Product Accelerometers (Apple iPhone 7, Apple iPod Touch 5, Apple Watch 2, Huawei Nexus 6P, Huawei Watch, mbientlab MetaWear watch, mbientlab MetaWear clip) compared against a reference Laboratory-Grade Accelerometer (Biometrics ACL300)Lab/clinicNoneTier 2ET subgroup
Wilkes, 2020 [104]Single-session20Delsys Trigno Wireless SystemLab/clinicTETRAS; FTM-TRSTier 2Therapeutic context
Yu, 2020 [105]Single-session15Tri-axial accelerometer (APDM Wearable Technologies, Portland, OR)Lab/clinicFTM-TRSTier 2Therapeutic context
Zajki-Zechmeister, 2020 [106]Single-session16TREMITAS-System (TREM) pen-shaped sensorLab/clinicTETRASTier 2ET subgroup
Fuchs, 2021 [107]Single-session20iPhone 5s running the TREMOR12 applicationLab/clinicFTM-TRSTier 2
Kwon, 2021 [39]Single-session18Triaxial gyro sensors (L3G4200D, STMicroelectronics)Lab/clinicFTM-TRSTier 2
Papapetropoulos, 2021 [108]RCT95Kinesia ONELab/clinicTETRASTier 2Therapeutic context
Pokhabov, 2021 [109]Single-session30Kolibri wireless electrophysiological signal monitoring systemLab/clinicNoneTier 2ET subgroup
Brillman, 2022 [32]Longitudinal216Cala Trio (Cala Health)HomeNoneTier 3*Tier 4 signal; Therapeutic context
Everlo, 2022† [110]Single-session90Brain RT software utilizing accelerometers and surface electromyography (polymyography)Lab/clinicClinician visual estimateTier 2ET subgroup
Gauthier-Lafreniere, 2022 [111]Single-session25GENEActiv Original wristwatch (Activinsights, UK) and a custom MATLAB applicationLab/clinicCRSTTier 2
Kovalenko, 2022 [112]Single-session13SensorTile wearable platform (STMicroelectronics) and Logitech BRIO 4k video cameraLab/clinicnoneTier 2ET comparator
Ma, 2022 [113]Single-session98Unnamed miniature IMU device (utilizing a QFN packaged multi-chip MPU-9250)Lab/clinicCRSTTier 2
Ma, 2022 [114]Single-session54IMU-based wearable device (JY901, BMI160)Lab/clinicCRSTTier 2
McGurrin, 2022 [33]Longitudinal13Unnamed wearable inertial sensorsHybridTETRASTier 3*Feasibility-grade Tier 3
Ni, 2022 [115]Single-session20Unnamed smartwatch embedded with a tri-axis accelerometerLab/clinicFTMTRSTier 2
Purrer, 2022 [116]Multi-visit37Dantec Keypoint.NET utilizing uniaxial accelerometry and surface electromyographyLab/clinicCRSTTier 2Tier 2+
Sahin, 2022 [117]Single-session17Apple iPhone 8 smartphone (using Medotemic’s app Medoclinic) and LPMS-B2 STD high-resolution miniature inertia sensorLab/clinicFTM-TRSTier 2ET subgroup
Xing, 2022 [118]Single-session141Dantec Keypoint G4 (Natus Medical Inc.), combining distal finger accelerometers and forearm surface electromyographyLab/clinicNoneTier 2ET comparator
Ali, 2023 [119]Single-session17Delsys Trigno wearable IMULab/clinicFTM-TRSTier 2
Dai, 2023 [37]RCT276Cala Trio (Cala Health)HomeBFRSTier 4**Therapeutic context
Li, 2023 [120]Single-session12Unnamed wearable multi-sensor measurement system (utilizing MPU9250 units)Lab/clinicMDS-UPDRSTier 2ET comparator
Lin, 2023 [17]Single-session80MATRIX (GYENNO SCIENCE)Lab/clinicNoneTier 2ET comparator
Loaiza Duque, 2023 [121]Single-session76TremorSoft app (using built-in smartphone 6-axis inertial sensors or an Xsens DOT wearable sensor)Lab/clinicNoneTier 2ET subgroup; Tier 4 signal.
Lu, 2023 [34]Longitudinal808Cala Trio™ (Cala Health)HomeNoneTier 3*Therapeutic context; Tier 4 signal
Metzner, 2023 [122]Single-session20Trigno IM Sensors and Trigno EMG Sensors (Delsys Inc.)Lab/clinicTETRASTier 2Therapeutic context
Pascual-Valdunciel, 2023 [123]Single-session12Technaid S.L IMUs and Quattrocento OT Bioelettronica bio-signal amplifierLab/clinicNoneTier 2
Piarroux, 2023 [124]Single-session18Unnamed polymyographic system (four-way surface electromyogram coupled with accelerometry)Lab/clinicNoneTier 2None
Smid, 2023 [125]Single-session13MMA8452Q tri-Axis accelerometer (Freescale Semiconductor, Inc.) with LabVIEWLab/clinicFTM-TRSTier 2None
van der Linden, 2023 [126]Single-session16BrainProducts triaxial accelerometer (FingerACC) and GENEActiv Original wristwatch accelerometer (WristACC)Lab/clinicMDS-UPDRSTier 2ET subgroup; Therapeutic context
Vescio, 2023 [127]Single-session40RT-Ring (utilizing an ST Microelectronics LSM6DSL 6-axis IMU)Lab/clinicNoneTier 2ET subgroup
Ameer, 2024 [128]Single-session45Unidirectional piezoelectric accelerometer analyzed via Natus KEYPOINT.NET Software v. 2.40Lab/clinicTETRASTier 2
Cabral, 2024 [129]Single-session18Custom wearable device (incorporating inertial measurement units and piezoelectric actuators)Lab/clinicFTM-TRSTier 2Therapeutic context
Hollý, 2024 [130]Single-session40MTw Awinda triaxial accelerometers (Xsens, the Netherlands)Lab/clinicTETRASTier 2
Hubená, 2024 [131]Single-session25MTw Awinda (Xsens, the Netherlands)Lab/clinicTETRASTier 2ET comparator
Tang, 2024 [132]Single-session53Dantec Keypoint signal acquisition systemLab/clinicFTM-TRSTier 2ET subgroup
Zhang, 2024 [133]Single-session20Unnamed smartwatch monitoring systemLab/clinicFTM-TRSTier 2
Aladro, 2025 [35]Longitudinal12Unnamed portable human movement monitoring system (comprising a smartwatch with a tri-axis accelerometer and a paired Android smartphone app)HomeFTM-TRSTier 3*Feasibility-grade Tier 3; Therapeutic context
Bártová, 2025 [134]Multi-visit12Unnamed accelerometer and camera setup (Note: A URIS I neuromodulation system was used for the therapeutic intervention)Lab/clinicTETRASTier 2Therapeutic context
Buonocore, 2025 [18]Single-session21RT-ringLab/clinicNoneTier 2ET comparator
Häring, 2025 [135]Single-session188Unnamed triaxial and monoaxial accelerometersLab/clinicNoneTier 2
Mugge, 2025 [136]Single-session24MTw Awinda (Xsens Technologies B.V.)Lab/clinicTETRASTier 2Therapeutic context
Purrer, 2025 [137]Multi-visit35SOMNOwatch™ plus® (SOMNOmedics, Randersacker, Germany)Lab/clinicCRSTTier 2ET subgroup. Therapeutic context
Samiee, 2025 [138]RCT88Wearable wristband (Pishgaman Rah Salamat Pars)Lab/clinicTETRASTier 2Therapeutic context
Tarlaci, 2025 [139]Single-session14Vibration Meter app (EXA Tools, Poland) installed on a mobile phone (Vestel Venus, Android 4.0)Lab/clinicBFRSTier 2Therapeutic context
Tsuboi, 2025 [140]Single-session27Triaxial accelerometers (Logical Product, Fukuoka, Japan)Lab/clinicCRSTTier 2ET subgroup
Digitised Handwriting (n = 16)
Haubenberger, 2011 [40]Single-session9Wacom Intuos 3 Model PTZ-930 digitizing tablet + Neuroglyphics softwareLab/clinicFTM-TRSTier 2Contains Tier 1 component
Louis, 2012 [141]Single-session145Wacom Intuos 3 digitizing tabletLab/clinicClinician visual estimateTier 2
Kragelj, 2014 [142]Single-session15Unnamed programmed graphical tablet with tablet pen/Computer Assisted Spirography (CAS) systemLab/clinicNoneTier 2
Elble, 2017 [41]Test-retest18Wacom Intuos 3 digitizing tabletLab/clinicFTM-TRSTier 2Tier 2+
Legrand, 2017 [143]Multi-visit13Wacom, Bamboo Fun Medium Pen&Touch digitising tabletLab/clinicBFRSTier 2Tier 2+
Schuhmayer, 2017 [144]Single-session40Neuroglyphics software on a Windows-based tablet-PC, and an unnamed uniaxial accelerometerLab/clinicTETRASTier 2
Tam, 2017 [145]Single-session12MRI-compatible touch tablet and stylusLab/clinicNoneTier 2
Kim, 2018 [146]Single-session161Wacom Intuos4 graphics tablet with a wireless inking penLab/clinicNoneTier 2
Lin, 2018 [147]Single-session12Wacom Cintiq 13HD graphic tablet with a custom-made computer programLab/clinicClinician visual estimateTier 2ET subgroup
Merchant, 2018 [148]Single-session19Wacom Intuos 2-4 graphics tablet with wireless inked penLab/clinicFTM-TRSTier 2
Sanderson, 2020 [19]Single-session12Touchscreen tablet (iPad running iOS v.11.4 with a custom Swift application) paired with a pressure-sensing stylusLab/clinicNoneTier 2ET comparator
López-Blanco, 2021 [149]Single-session31BQ Aquaris E4.5 Android smartphone running an under-development Android applicationLab/clinicFTM-TRSTier 2
Motin, 2021 [150]Single-session19Wacom Intuos Pro Large digital tablet with a pressure-sensor mounted ink-penLab/clinicFTM-TRSTier 2
Kim, 2022 [151]Single-session113D gyration mouse (Air Mouse Go Plus, Gyration Inc.) and a custom-designed wrist device (LSM303D three-axis accelerometer)Lab/clinicTETRASTier 2
Rajan, 2023 [43]Single-session25Unnamed in-house automated analysis algorithm processing images from a commercial scannerLab/clinicBFRS; FTM-TRSTier 2ET subgroup
Figura, 2024 [152]Multi-visit11Digitising tablet (Intuos series, Wacom) running custom-acquisition softwareLab/clinicFTM-TRSTier 2ET subgroup. Therapeutic context
Computer Vision (n = 15)
Deuschl, 2000 [153]Single-session26MacReflex version 3.2 passive infrared movement analysis system (Qualisys, Sweden)Lab/clinicFTM-TRSTier 2
Uhríková, 2011 [154]Single-session26TremAn (a software tool for computer analysis of video sequences) and Xsens MTx inertial measurement unitsLab/clinicFTM-TRSTier 2
Geiger, 2018 [155]Single-session10trakSTAR electromagnetic motion capture system (Ascension Technologies)Lab/clinicFTM-TRSTier 2
Ishii, 2020 [156]Single-session24Unnamed smartphone-based application with a server-side convolutional neural network (CNN)Lab/clinicTETRASTier 2ET subgroup
Seedat, 2020 [157]Single-session669ResNet-32 convolutional neural network (CNN)Lab/clinicNoneTier 2ET subgroup
Kovalenko, 2021 [16]Single-session13Logitech BRIO 4K PRO camera with OpenPose library for keypoint extractionLab/clinicNoneTier 2ET comparator
Ismail, 2022 [158]Multi-visit12Smartphone camera with a publicly available Instagram filter (“steady-hand filter”)Lab/clinicTETRASTier 2Tier 2+
Ma, 2023 [159]Single-session61Portable video camera/smartphone camera utilizing a Transformer-based HRNet-DARK architecture (HRTNet-Dark) and a Tremor Detection Transformer (TDT) algorithmLab/clinicCRSTTier 2
Wang, 2023 [160]Single-session50Scanned paper drawings analyzed via an optimized Convolutional Neural Network (CNN).Lab/clinicNoneTier 2
Baek, 2024 [36]Longitudinal37Scanned conventional paper-and-pen drawings analyzed via custom IDL softwareHybridCRSTTier 3*ET subgroup; Tier 4 signal: adherence and data completeness quantified for home follow-up phase
Costa, 2024 [161]Longitudinal36SMART motion system (BTS Engineering)Lab/clinicFTM-TRSTier 2Tier 2+
Friedrich, 2024 [162]Multi-visit66Mediapipe and DeepLabCut (DLC-RCNN) computer vision pose-tracking algorithmsLab/clinicFTM-TRSTier 2Therapeutic context
Lee, 2024 [163]Single-session59In-house tremor measurement app utilizing an Android-based smartphone camera (Samsung Galaxy S20)Lab/clinicCRSTTier 2Contains Tier 1 component
Shin, 2025 [164]Single-session22DeepLabCut (marker-less deep learning pose estimation) alongside an Xsens DOT gyroscope used for convergent validationLab/clinicTETRAS; TWSTRS-2Tier 2ET comparator
Wolke, 2025 [165]Single-session13Mediapipe (by Google) and Vision (by Apple) computer vision frameworks utilizing a smartphone camera (Apple iPhone 12 mini)Lab/clinicTETRASTier 2Contains Tier 1 component
Surface EMG (n = 10)
Akbostanci, 2000 [166]Single-session100San-ei EEG 1A96 electroencephalograph with Ag-AgCl surface electrodesLab/clinicBFRSTier 2None
Breit, 2008 [167]Single-session24Unnamed long-term ambulatory surface EMG systemHomeNoneTier 2
Nisticò, 2011 [168]Single-session15Unnamed system comprising two pairs of surface electrodes and needle electrodesLab/clinicNoneTier 2ET subgroup. Tier 2+
Ohye, 2012 [169]Multi-visit13Unnamed surface electromyography (EMG)Lab/clinicUPDRSTier 2ET subgroup. Therapeutic context
Ruonala, 2013 [170]Single-session17ME6000 biosignal monitor with Medicotest M-00-S Ag/AgCl surface electrodesLab/clinicNoneTier 2ET subgroup
Tavakkoli, 2014 [171]Single-session20Power-Lab systemLab/clinicNoneTier 2ET subgroup
Cernera, 2021 [172]Single-session10Trigno Acquisition Unit wearable sensors (Delsys, Inc) and Nexus-D telemetry wand (Medtronic)Lab/clinicFTM-TRSTier 2
Muruzheva, 2022 [173]Single-session90Not reported (described only as a pair of surface electrodes for electromyographic recordings)Lab/clinicFTM-TRSTier 2None
Gulati, 2024 [174]Single-session28Natus Elite – Viking six-channel surface electromyography system and a triaxial accelerometerLab/clinicNoneTier 2ET subgroup
Salazar, 2025 [175]Multi-visit18Unnamed surface electromyography (EMG) and accelerometerLab/clinicTETRASTier 2Therapeutic context
Acoustic Voice Analysis (n = 6)
Akkunje, 2021 [176]Single-session69Computerized Speech Labs 4500 (KAYPENTAX) with high-fidelity microphone (SHURE SM-48) and electroglottography (EGG) model 6103 (KAYPENTAX)Lab/clinicVTRSTier 2
Suppa, 2021 [42]Multi-visit58High-definition audio recorder H4n Zoom with a Shure WH20 Dynamic Headset Microphone.Lab/clinicFTM-TRSTier 2Tier 2+
Rao, 2023 [177]Single-session20CSL4500 microphone with MDVP software (KAYPENTAX) and a custom Probabilistic Source-Filter Model (PSFM) algorithmLab/clinicNoneTier 2
Larner, 2024 [178]Multi-visit12Voice Evaluation Suite (Estill Voice International) using a handheld Ultralink ULM300M wireless microphone (Behringer) and Praat Speech Analysis programLab/clinicClinician visual estimateTier 2Therapeutic context
Borders, 2025 [179]Multi-visit18The Computerized Speech Lab system (CSL Model 4500B, PENTAX Medical) with an AKG P220 microphone and Praat softwareLab/clinicQUEST-VoiceTier 2ET subgroup. Therapeutic context.
Rusz, 2025 [180]Single-session54Head-mounted condenser microphone (Beyerdynamic Opus 55) and MATLAB softwareLab/clinicTETRASTier 2ET subgroup
Gait & Whole-Body (n = 4)
Moon, 2020 [181]Single-session43APDM Opal IMU sensors (analyzed via Mobility Lab software),Lab/clinicNoneTier 2ET subgroup
Kroneberg, 2022 [182]Single-session20Mobility Lab V1 hardware (APDM, Oregon USA)Lab/clinicSARA; FTM-TRSTier 2Therapeutic context
Robertson-Dick, 2023 [183]Single-session20APDM Mobility Lab™ six inertial sensor system (APDM™; version 1)Lab/clinicFXTAS-RSTier 2ET comparator
Kroneberg, 2024 [184]Single-session12Mobility Lab™ system (APDM, V1 hardware)Lab/clinicNoneTier 2ET comparator

[i] Studies are grouped by sensor modality and ordered chronologically within each group. TRACE tier assignments reflect the highest level of validation evidence achieved by each study. * Denotes Tier 3 (Ambulatory and Longitudinal Validation), defined as studies meeting all three criteria of calendar day separation, home or ambulatory data collection, and temporal performance evidence; ** Denotes Tier 4 (Clinical Trial Readiness), defined as a digital metric pre-specified as a primary or secondary endpoint in a registered clinical trial, or multi-site deployment across three or more independent sites with adherence and data completeness reporting. No study achieved Tier 5 (Economic and Implementation Readiness). Denotes studies deploying sensors spanning two modality categories; listed under primary modality.

ET n refers to the number of Essential Tremor participants only; where studies enrolled mixed diagnostic cohorts, participants with other diagnoses are excluded from this count. The Clinical Anchor column records the validated clinical rating scale used as the reference standard for digital metric validation. None indicates no formal correlation with a validated clinical scale was reported. Informal estimate indicates that a clinician visual rating was used without a formally validated instrument. Where a scale was used solely for patient characterisation or group assignment without direct correlation against the digital metric, this is also recorded as None.

The Notes and Flags column records the following methodological designations. ET comparator: ET patients were recruited as a reference group for a study whose primary focus was another movement disorder, most commonly Parkinson’s disease; ET-specific digital biomarker performance was not a primary study aim. ET subgroup: ET patients were included within a larger mixed-diagnosis cohort without a dedicated ET-focused analysis. Tier 2+: a multi-visit or test-retest clinic design providing reliability, MDC, or longitudinal validation data across calendar-separated sessions without a home or ambulatory component; these studies are assigned Tier 2 in all analyses but represent the strongest candidates for Tier 3 promotion given the addition of a home deployment component. Tier 4 signal: a Tier 3 study that additionally demonstrates one or more operational features consistent with Tier 4 readiness, such as multi-site data collection or quantified adherence reporting, but does not meet the full Tier 4 threshold due to absence of a pre-registered endpoint or defined measurement context. Feasibility-grade Tier 3: all Tier 3 criteria are met but the ET sample size is fewer than 15, limiting the generalisability and statistical precision of the longitudinal and ambulatory evidence. Repeated-measures exception: the study was included despite an ET sample size below the standard eligibility threshold of ten participants based on a strong repeated-measures design providing substantial within-subject measurement. Therapeutic context: the device serves a primary therapeutic function; the digital biomarker is a secondary or co-primary endpoint rather than the sole purpose of the device. Contains Tier 1 component: the study includes both bench-level or phantom device validation and ET clinical patient data; the tier assigned reflects the highest tier achieved in the clinical component.

ET, Essential Tremor; TRACE, Technology Readiness and Clinical Evidence framework; IMU, inertial measurement unit; EMG, electromyography; TETRAS, Tremor Research Group Essential Tremor Rating Assessment Scale; FTM-TRS, Fahn-Tolosa-Marín Tremor Rating Scale; CRST, Clinical Rating Scale for Tremor; CRST-C, Clinical Rating Scale for Tremor Part C; MDS-UPDRS, Movement Disorder Society Unified Parkinson’s Disease Rating Scale; MDC, Minimum Detectable Change; NR, not reported; TAPS, Transcutaneous Afferent Patterned Stimulation; wTRS, Washington Heights-Inwood Genetic Study of Essential Tremor (WHIGET) tremor rating scale; TCRS, Tremor Clinical Rating Scale, a modified Fahn-Tolosa-Marin scale; FXTAS-RS, Fragile X-associated tremor/ataxia syndrome Rating Scale; BFRS, Bain & Findley Rating Scale; VTRS, Vocal Tremor Rating Scale; SARA, Scale for the assessment and rating of ataxia; QUEST-Voice, a 5-point patient-reported severity scale from the Quality of Life in Essential Tremor Questionnaire.

Against the TRACE framework, 154 of 165 studies (93%) are classified at Tier 2 (Table 1), demonstrating construct or discrimination validity in a supervised clinical setting. Nine reach Tier 3 [28, 29, 30, 31, 32, 33, 34, 35, 36], all IMU-based except one using remote optical analysis of posted drawing sheets [36]; two achieve Tier 4, both deploying the Cala Health TAPS wristband [21, 37]. No study meets Tier 5 criteria. Within the Tier 2 evidence base, eight studies employed multi-visit or test-retest clinic designs providing reliability or longitudinal validation data across calendar-separated sessions without a home component (Tier 2+); these are highlighted in Table 1. Five Tier 3 studies additionally met operational prerequisites for Tier 4 through multi-site delivery or quantified adherence reporting but lacked a pre-registered tremor endpoint and are flagged as Tier 4 signal studies [28, 32, 34]. The modality-by-tier distribution is visualised in Figure 2.

Figure 2

Distribution of included studies (n = 165) by sensor modality and TRACE validation tier.

Each bubble represents the number of studies at a given modality and tier intersection; bubble area is proportional to the number of studies. No studies achieved Tier 5. Twenty-four studies deployed sensors spanning two modality categories and are counted once under their primary modality; had these been dual-counted, Tier 2 totals would increase by 16 for surface EMG, 6 for wearable IMUs, and 2 for computer vision. IMU, inertial measurement unit; TRACE, Technology Readiness and Clinical Evidence framework; Tier 1, Technical Verification; Tier 2, Referenced Clinical Validation; Tier 3, Ambulatory and Longitudinal Utility; Tier 4, Clinical Trial Readiness; Tier 5, Economic and Implementation Readiness.

Modality Findings

Wearable IMUs and Smartwatches (n = 114)

The IMU evidence base is the largest and most technically heterogeneous. Construct validity against clinical rating scales is demonstrated in 42 studies; discrimination validity between diagnostic groups in 49 (Table 1; Figure 3). Study design is predominantly single-session; 14 RCTs and eight longitudinal designs provide multi-visit data, though most were conducted in a therapeutic rather than validation context (Table 1). López-Blanco and colleagues reported MDC values for a smartwatch gyroscope in a two-visit clinic design (Tier 2+) [38]; PRO correlation is present in six studies (Figure 3).

Figure 3

Evidence gap matrix for digital biomarker validation in Essential Tremor across six sensor modalities (n = 165).

Rows represent the six sensor modalities identified in this review. Columns represent ten binary validation criteria extracted from each included study. Cell colour indicates the proportion of studies within each modality meeting that criterion, scaled from yellow (low proportion) to dark red (high proportion); cell values display the raw count. Twenty-four studies deployed sensors spanning two modality categories and are counted once under their primary modality; validation flags are reported from the primary modality only. MID, minimum important difference; MDC, minimum detectable change; PRO, patient-reported outcome; IMU, inertial measurement unit; EMG, electromyography.

Eight studies reach Tier 3, all demonstrating ambulatory home monitoring or longitudinal data collection across calendar-separated days (Table 1; Figure 2). The Cala Health TAPS platform accounts for five of these [21, 29, 32, 34, 37]. Three carry Tier 4 signal flags [28, 32, 34]; two reach Tier 4 [21, 37]. The two Tier 4 studies, both deploying the Cala Health TAPS wristband, represent the strongest validation evidence in the review. Isaacson and colleagues evaluated 263 patients in a registered multi-site trial, capturing pre- and post-stimulation tremor power twice daily over three months of home use; 21,806 valid sessions were recorded and tremor power tracked clinical rating scale improvement longitudinally (TETRAS r = 0.67) [21]. Dai and colleagues conducted a randomised pragmatic trial (n = 276 completing one-month follow-up) in which tremor power was pre-specified as the primary digital endpoint (NCT05540626), with multi-site recruitment, quantified adherence, and anchoring against the BF-ADL activities of daily living scale [37]. Together these studies demonstrate that a digital tremor metric can be operationalised as a pre-registered endpoint in a registered trial with adherence and data completeness reporting at scale, the defining requirements of Tier 4. Kwon and colleagues found that wrist roll and yaw outperformed finger and forearm placements across three simultaneous sites, with direct implications for sensor positioning in trial protocols [39]. Despite the size of the evidence base, home deployment is present in only 11 of 114 studies, longitudinal responsiveness in 17, and MDC data in one (Figure 3).

Digitised Handwriting and Drawing (n = 16)

Construct validity is demonstrated in ten studies, predominantly against FTM-TRS or TETRAS; discrimination in eight (Table 1; Figure 3). The Wacom Intuos tablet and Archimedes spiral task are the dominant platform (Table 1). Haubenberger and colleagues established foundational validation in a pre-registered ethanol challenge design [40]. Elble and Ellenbogen derived MDC values comparable to FTM-TRS from a three-day clinic trial, noting a day-one practice effect with direct implications for trial baseline measurement (Tier 2+) [41]. No study has deployed in the home setting and no minimum important difference exists; available MDC values quantify measurement precision, not patient-perceived meaningful change.

Computer Vision (n = 15)

Construct validity is demonstrated in seven studies; discrimination in seven (Table 1; Figure 3). One study reaches Tier 3: Baek and colleagues analysed drawings completed at home and returned by post with follow-up to 12 months post-thalamotomy (CRST Part C R² = 0.49, 35 of 39 eligible patients returning sheets), demonstrating longitudinal responsiveness, PRO correlation, and quantified adherence (Tier 4 signal) [36]. No commercially validated contactless platform currently exists.

Surface EMG (n = 10), Voice (n = 6), and Gait (n = 4)

These three modalities contribute 20 studies, all at Tier 2 and predominantly single-session in design (Table 1; Figure 3). sEMG studies are exclusively lab-based; construct validity is demonstrated in three, discrimination in six (Figure 3). Voice studies include the broadest single-study validation profile in the review outside IMU: Suppa and colleagues demonstrated discrimination, treatment responsiveness across separated visits, and PRO correlation via the Voice Handicap Index (Tier 2+) [42]. Gait studies demonstrate construct validity in two and discrimination in four but are limited to single-session designs (Figure 3). The digital characterisation of voice tremor, head tremor, and gait disturbance remains disproportionately sparse relative to their clinical significance in ET and ET-plus.

Cross-Cutting Evidence Gaps

Figure 3 maps ten validation fields against all six modalities. Evidence concentrates in construct and discrimination validity (the Tier 2 criteria) while the ecological and interpretability fields are sparse. Home or ambulatory data collection is present in 13 of 165 studies, all IMU or computer vision; longitudinal responsiveness to change in 26; adherence or data completeness reporting in 27; multi-site deployment in 18; MID or MDC data in three, all quantifying measurement precision rather than patient-perceived meaningful change; and PRO correlation in 11. Stratified analysis within the Tier 2 classification reveals an important internal gradient: eight Tier 2+ studies provide test-retest reliability or MDC data across calendar-separated clinic visits without a home component; five Tier 4 signal studies, all at Tier 3, combine ambulatory deployment with multi-site data collection or quantified adherence reporting but lack a pre-registered digital tremor endpoint (Table 1).

Two further cross-cutting limitations warrant explicit statement. First, diagnostic criteria for ET are inconsistently reported, particularly in engineering publications where enrolment is frequently described only as “diagnosed by a neurologist,” without specifying consensus criteria or whether ET-plus variants were included. Second, the phenotypic scope of the literature is narrow: upper-limb postural and kinetic tremor is addressed across all modalities, while voice, head, gait, and the non-tremor features of ET-plus are digitally under-characterised. Table 2 summarises endpoint readiness by modality.

Table 2

Endpoint readiness summary by sensor modality for digital biomarker validation in Essential Tremor (n = 165).

MODALITYSTUDIES (n)HIGHEST TRACE TIERBEST-EVIDENCED DEVICE(S)/PLATFORM(S)PRIMARY GAP FOR TRIAL INTEGRATIONMID REPORTED (STUDY)PRO CORRELATION (STUDY)
Wearable IMUs & Smart-watches114Tier 4Cala Trio/TAPS wristband (Cala Health) Tier 4: Isaacson 2020 [21] (n = 263, 26 sites, home, RCT), Dai 2023 [37] (n = 276, multi-site, home, RCT).
Kinesia HomeView (Great Lakes NeuroTechnologies) Tier 3: Pulliam 2014 [28] (n = 19, multi-site, home).
Multiple Tier 3 studies also used smartwatches (Zheng 2017 [30]), unnamed wearables (McGurrin 2022 [33], Aladro 2025 [35]), and Cala Trio in therapeutic context (Brillman 2022 [32], Lu 2023 [34]).
22 studies used dual modalities (See Table 1).
Formal MID absent for all platforms; MDC reported only by López-Blanco 2018 [38]. PRO correlation established in six studies (Fuchs 2021 [107], Louis 2000 [51], McGurrin 2022 [33], Brillman 2022 [32], Holly 2024 [130], Aladro 2025 [35]) but none pre-specified as a primary endpoint.
105 of 114 studies are Tier 2 studies.
Yes: López-Blanco 2018 [38] (MDC only, not formally derived MID)Yes: Fuchs 2021 [107] (QUEST); Louis 2000 [51] (Tremor Disability Questionnaire); McGurrin 2022 [33] (TETRAS-ADL); Brillman 2022 [32] (PGI-I); Holly 2024 [130] (TETRAS-ADL); Aladro 2025 [35] (self-rated tremor VAS)
Digitised Handwriting & Drawing16Tier 2Wacom Intuos series (digitising tablet): Multiple Tier 2 studies including Haubenberger 2011 [40], Elble 2017 [41], Legrand 2017 [143], Motin 2021 [150].
Two studies used dual modalities (Schuhmayer 2017 [144], Kim 2022 [151]).
No home deployment using a digital tablet; no longitudinal disease-tracking study; MDC available (Elble 2017 [41], Rajan 2023 [43]) but no formally derived MID; no PRO correlation.
All 16 studies are Tier 2.
Yes: Elble 2017 [41] (MDC, 51% of baseline); Rajan 2023 [43] (MDC, 19.9% for Meijer severity index)No
Computer Vision & Contactless15Tier 3No commercially available or clinically validated platform. Studies use bespoke smartphone applications (Ishii 2020 [156]; Lee 2024 [163]), repurposed social media filters (Ismail 2022 [158]; Instagram “steady-hand filter”), scanned paper drawing pipelines (Baek 2024 [36], Tier 3; Wang 2023 [160]), or open-source computer vision frameworks (Wolke 2025; Google MediaPipe/Apple Vision).
Two studies used dual modalities (Uhríková 2011 [154], Shin 2025 [164]).
No continuous passive ambulatory monitoring; small evidence base (n = 15); no MID reported; Tier 3 evidence method-specific (mailed paper drawings, Baek 2024 [36]). Tier 4 signal present (Baek 2024 [36]: adherence quantified, longitudinal responsiveness demonstrated) but no commercial platform or pre-specified endpoint.
Fourteen of 15 studies are Tier 2.
NoYes: Baek 2024 [36] (CRST-C self-report)
Surface EMG10Tier 2Delsys Trigno wearable sensors (Cernera 2021 [172]); Natus Elite Viking (Gulati 2024 [174]); ME6000 biosignal monitor (Ruonala 2013 [170]); unnamed long-term ambulatory EMG (Breit 2008 [167]).
Two studies used dual modalities (Gulati 2024 [174], Salazar 2025 [175]).
No home or ambulatory deployment; no longitudinal disease-tracking data; no MID reported. Construct validity typically anchored against accelerometry rather than a clinical rating scale (6 of 10 studies report discrimination validity).
All studies are Tier 2.
NoYes: Akbostanci 2000 [166] (ADL inventory); Muruzheva 2022 [173] (ADL questionnaire)
Acoustic Voice Analysis6Tier 2Zoom H4n recorder with Shure WH20 headset microphone: (Suppa 2021 [42])
CSL 4500B (PENTAX Medical): used in multiple studies (Akkunje 2021 [176]; Rao 2021 [177]; Borders 2025 [179]).
No home or ambulatory recording; no continuous monitoring paradigm; evidence base remains small (n = 6). Three studies provide longitudinal data (Suppa 2021 [42], Larner 2024 [178], Borders 2025 [179]). Borders 2025 [179] applies an externally derived perceptual threshold of 4 Hz to interpret acoustic change, but this does not constitute a formally derived MDC or MID.
All studies are Tier 2.
NoYes: Suppa 2021 [42] (VHI); Borders 2025 [179] (QUEST-Voice)
Gait & Whole-Body Wearables4Tier 2APDM Opal/Mobility Lab (Clario): used in all four studies (Moon 2020 [181]; Kroneberg 2022 [182]; Robertson-Dick 2023 [183]; Kroneberg 2024 [184]).No home or ambulatory deployment; no longitudinal data; no MID reported; no PRO correlation. ET recruited primarily as a comparator group for PD or ataxia rather than as the target population. No ET-specific tremor metric; outcomes focus on gait and balance parameters.
All studies are Tier 2.
NoNo

[i] For each modality, the highest TRACE validation tier achieved by any included study is reported alongside the best-evidenced device(s) or platform(s), the primary gap preventing trial integration, and the availability of minimum important difference and patient-reported outcome correlation data. Studies marked with † used sensors spanning two modalities and are listed under their primary modality; see Figure 2 for dual-counted modality totals. MID values reported in López-Blanco 2018, Elble 2017 and Rajan 2023 represent Minimum Detectable Changes or anchor-derived thresholds rather than formally derived Minimum Important Differences. ET, Essential Tremor; TRACE, Technology Readiness and Clinical Evidence framework; MID, Minimum Important Difference; PRO, Patient-Reported Outcome; MDC, Minimum Detectable Change; TAPS, Transcutaneous Afferent Patterned Stimulation; QUEST, Quality of Life in Essential Tremor Questionnaire; VHI, Voice Handicap Index; CRST-C, Clinical Rating Scale for Tremor Part C (patient self-report); IMU, inertial measurement unit; EMG, electromyography.

Discussion

This scoping review maps 165 studies of digital tremor biomarkers in ET against a novel five-tier validation framework. The principal finding is that the field has generated a technically credible body of Tier 2 construct validity evidence, however the ecological, interpretability, and patient-centred evidence required to deploy any of these metrics as clinical trial endpoints is largely absent.

The included studies are heterogeneous by design, as expected in a scoping review mapping a broad technological landscape. Twenty three studies enrolled ET patients primarily as a comparator group for another disorder, 35 included ET within mixed-diagnosis cohorts, and one entered via the repeated-measures exception (Table 1). Diagnostic criteria varied: most recent studies applied or referenced the 2018 MDS consensus classification, but older studies frequently relied on unspecified clinical diagnosis, predating the formal distinction between ET and ET-plus. This heterogeneity does not undermine the principal finding, the concentration of evidence at Tier 2 and the near-absence of ecological validation is consistent across all study subgroups, but it does mean that pooled prevalence estimates for specific validation features should be interpreted with caution, and that future studies should specify which consensus criteria were applied.

Several frameworks exist for evaluating the maturity of digital health technologies, most notably the DiMe V3 model [23]. TRACE was developed for this review to address a gap: the need to formally separate single-session supervised validity from ambulatory and longitudinal utility, which remains the key translational bottleneck in many fields. By making the ecological transition an explicit, independently scorable tier rather than an implicit component of clinical validation, TRACE renders visible a gap that other frameworks obscure. The framework is visualised as an Archimedes spiral (Figure 4). Although derived for ET, TRACE was developed as a pragmatic framework for this review. Its tier criteria were designed to be transferable across sensing modalities and neurological conditions, and it may be applied prospectively as a design framework or retrospectively as an evidence mapping tool; however, formal external validation, inter-rater reliability testing, and application beyond ET remain priorities for future work.

Figure 4

The TRACE Framework — A Five-Tier Maturity Roadmap for Digital Biomarkers in Neurological Disease.

The TRACE framework provides a structured pathway for evaluating the validation maturity of digital biomarkers, progressing from controlled laboratory settings (centre) to real-world clinical implementation (outer loop). Each tier represents a distinct and cumulative evidence threshold: Technical Verification (Tier 1) establishes sensor accuracy against a reference standard in bench or phantom testing; Referenced Clinical Validation (Tier 2) demonstrates correlation with validated rating scales or diagnostic group discrimination in a supervised clinical setting; Ambulatory and Longitudinal Utility (Tier 3) requires reliability, stability, and responsiveness to change demonstrated across separate calendar days in unsupervised home or ambulatory settings; Clinical Trial Readiness (Tier 4) requires the digital metric to be pre-specified as a primary or secondary endpoint in a registered trial, with operational delivery demonstrated through adherence and data completeness reporting; Economic and Implementation Readiness (Tier 5) requires formal evidence of patient acceptability alongside at least one of: validated health economic modelling or demonstrated integration into a reimbursed clinical workflow.

The dashed segment between Tiers 2 and 3 denotes the Ecological Gap, the critical translational hurdle at which most digital biomarkers stall, where laboratory validity does not predict real-world longitudinal performance. A study is assigned to the highest tier for which all core criteria are satisfied; tier assignment is not cumulative. TRACE adapts and extends the Digital Medicine Society (DiMe) V3 framework: Tier 1 corresponds to DiMe V3 Verification; Tiers 2 and 3 together deconstruct DiMe V3 Clinical Validation into snapshot and longitudinal components; Tiers 4 and 5 extend beyond DiMe V3 to encompass the regulatory and economic evidence required for clinical adoption. Tier 4 is operationalised in this review as clinical trial readiness rather than formal regulatory qualification, reflecting the current maturity of the field. Full tier definitions, core criteria, and decision rules are provided in Supplementary File 1.

Across 154 Tier 2 studies, digital tremor metrics correlate with established clinical rating scales and, in many cases, discriminate ET from healthy controls and other movement disorders under supervised conditions. Several platforms demonstrate sufficient precision to differentiate severity levels within ET cohorts, though this is not uniformly reported.

What Tier 2 cannot establish is whether these properties hold in the environment where clinical trials would deploy them. Tremor in ET is characteristically provoked by action and modulated by emotional state, fatigue, caffeine, and medication timing; its day-to-day variability in natural conditions is substantially greater than the within-session variance characterised in clinic studies. A device that correlates well with TETRAS during a standardised postural holding task in a neurology clinic may behave very differently when patients attempt the same task independently at home, without instruction, at varying times of day, and across weeks or months. The step from Tier 2 to Tier 3 is not an incremental refinement but a qualitative transition in what is being tested.

This Ecological Gap, the failure of clinic-validated metrics to transfer reliably to ambulatory real-world settings, is not unique to ET. The Mobilise-D consortium developed a validation roadmap for digital mobility outcomes across multiple chronic conditions that formally separates technical, real-world technical, and clinical validation as distinct sequential stages, motivated by the recognition that supervised laboratory performance cannot be assumed to predict real-world measurement behaviour [24]. The DiMe V3 framework is based on the same principle: analytical validation is necessary but insufficient; clinical validity must be demonstrated in the intended context of use [23].

For ET, this lesson carries a specific complication. In PD, the mobility impairments targeted by digital monitoring, gait slowing, bradykinesia, reduced stride variability, manifest continuously during ordinary daily movement, meaning that background recording over hours or days will naturally and repeatedly sample the clinically relevant signal. ET tremor is different in kind: postural and kinetic tremor requires the limb to be held against gravity or engaged in a voluntary task, and is largely absent during rest, quiet sitting, or low-demand movement. A wrist sensor worn throughout the day will therefore spend most of its recording time in a signal state that does not represent the tremor that constitutes the clinical target. Closing the ecological gap for ET digital biomarkers consequently requires either structured task protocols delivered at home, asking patients to perform a postural holding or drawing task at a specified time, or the development of algorithms capable of identifying and extracting task-relevant epochs from free-living signal streams. The structured task approach has been validated at scale only within the Cala Health TAPS platform, where Isaacson and colleagues and Dai and colleagues deployed repeated home-based postural hold assessments across two registered multi-site trials enrolling over 500 patients, with pre-specified digital endpoints and quantified adherence [21, 37]; however, both studies are embedded within a therapeutic intervention programme rather than an independent biomarker validation context. Algorithm-based extraction of task-relevant epochs from free-living signal streams has not been validated at scale in ET, though Pulliam and colleagues demonstrated its feasibility alongside structured assessments in 20 patients across two calendar-separated days, finding that continuous waveform metrics correlated sufficiently with structured task scores to suggest epoch extraction as an alternative to prompted assessments [28].

No formally anchored minimum important difference exists for any ET digital biomarker. The three studies reporting MID-type values, Elble and Ellenbogen for digitised spirals, López-Blanco and colleagues for smartwatch gyroscopy, and Rajan and colleagues for tablet-derived mean deviation, all quantify measurement precision rather than patient-perceived meaningful change [38, 41, 43]. Elble and Ellenbogen’s MDC of 51% of baseline amplitude, for example, means tremor must roughly halve before a change exceeds measurement error [41]. Anchor-based MID derivation, requiring longitudinal data from patients crossing a clinically meaningful threshold alongside concurrent digital measurements, is methodologically straightforward but depends on the multi-visit, home-capable, PRO-integrated study design that remains largely absent from the current literature.

This matters directly for the regulatory pathway toward qualified drug development endpoints. Both the FDA’s Drug Development Tool (DDT) qualification programme and the EMA’s Qualification of Novel Methodologies process provide voluntary mechanisms through which a digital metric can be qualified for a specific context of use, but both require evidentiary packages extending well beyond analytical validation: clinical meaningfulness, minimum important difference estimation anchored to patient experience, and sensitivity to change in the intended context of use [44]. In the UK, software-based tremor analysis meeting the definition of a medical device falls under MHRA regulation as Software as a Medical Device. Across all three regulatory environments, the pathway from a metric that correlates with TETRAS in a clinical setting to one qualified as a drug development tool requires ecological validation and clinically anchored evidence of meaningful change, neither of which the current ET literature can provide for any modality. To date, no ET digital biomarker has achieved formal DDT qualification under either the FDA or EMA programmes [44]. The TAPS platform, deployed in the only two studies assigned Tier 4, demonstrates that clinical trial readiness is achievable for an ET digital endpoint; across two registered trials enrolling over 500 patients, tremor power was pre-specified as a digital outcome measure with multi-site recruitment, quantified adherence, and longitudinal tracking [21, 37]. However, both studies are embedded within a therapeutic intervention programme rather than an independent biomarker validation context, reflecting the needs of the TAPS trial ecosystem rather than the broader qualification work, including ecological validation and patient-anchored MID estimation, required for formal regulatory qualification as a reusable drug development tool. The measurement protocol, repeated home-based accelerometry with structured postural holds and cloud-based data capture, is separable from the therapeutic stimulation component and could be deployed independently as a digital endpoint in trials of other ET interventions.

The longer-term question, whether validated digital tremor metrics could one day guide treatment decisions in routine practice, as MRI lesion burden has in multiple sclerosis, is the question Tier 5 is designed to address [45]. No study in this review approached that threshold; economic and implementation readiness remains contingent on first bridging the ecological gap that separates most modalities from Tier 3.

The concentration of digital biomarker evidence in upper-limb postural and kinetic tremor reflects the primary diagnostic criterion for classical ET and the most tractable target for existing sensor technologies. However, ET is increasingly understood as a heterogeneous condition: head tremor is a recognised feature of the full syndrome, voice tremor affects a substantial minority of patients [46], and gait and balance impairment, while not diagnostic, are associated features that worsen with disease duration and may be exacerbated by stereotactic thalamic interventions [47]. The ET-plus classification formalised in the 2018 MDS consensus criteria encompasses patients with additional neurological signs including mild ataxia, dystonia, and rest tremor, and these patients may present to clinical trials in increasing numbers as eligibility criteria broaden [3].

If digital biomarker endpoints are to serve trials targeting the breadth of the ET phenotype, including ablative procedures and neuromodulation, validated metrics for non-upper-limb domains will be required. Six voice studies now demonstrate that vocal tremor is measurable and discriminable using accessible acoustic analysis equipment, with Suppa and colleagues providing the broadest validation profile including treatment responsiveness and PRO correlation (Tier 2+); however, no voice study has deployed outside the clinic, and home recording data represent a tractable next step [42]. The four gait studies are limited to single-session designs, and none characterises ET gait in its own right (Table 1). Dedicated ET gait and balance monitoring, drawing on the Mobilise-D method of seven-day home deployment, would address both the phenotypic coverage gap and the ecological validity deficit simultaneously [24, 48]. The diagnostic criteria heterogeneity observed across studies, where enrolment criteria are frequently underspecified, is a related concern: pooling data across studies that may have included ET, ET-plus, and dystonic tremor without differentiation limits both the interpretation of existing findings and the generalisability of any future MID derivation.

Translating the current evidence base into trial-ready endpoints requires a sequenced programme of work. For IMUs, the priority is extending ambulatory Tier 3 evidence beyond the TAPS ecosystem, which accounts for five of eight Tier 3 studies. Feasibility studies using commercially available, clinically validated platforms such as Kinesia (Great Lakes NeuroTechnologies) and the Clario Opal, both with established clinical trial infrastructure, deploying structured home task protocols with concurrent clinical rating scales and PRO instruments across a minimum of two calendar-separated sessions, would establish whether the ambulatory findings from the TAPS platform generalise to independent devices. For digitised handwriting, no study has deployed in the home setting; the next step is a home deployment feasibility study incorporating a familiarisation session, as recommended by the Elble practice effect data, followed by multi-visit MDC reassessment and an anchor-based MID derivation sub-study [26]. For computer vision, independent replication of the Baek postal drawing approach in a prospective ET cohort with a standardised analysis pipeline would establish whether this method is generalisable [36]; the absence of a commercially validated contactless platform represents an additional barrier to near-term trial deployment not faced by the IMU or handwriting modalities. For sEMG, all ten studies are lab-based and single-session; feasibility of home-based EMG recording for ET has not been explored.

Across all modalities, three cross-cutting requirements apply: standardised diagnostic criteria reporting, specifying whether MDS 2018 consensus criteria were applied and whether ET-plus patients were included [3]; incorporation of PRO instruments (QUEST, TETRAS-ADL, Voice Handicap Index) as standard in all Tier 2+ studies to anchor MID derivation and ensure patient-centred interpretability; and adoption of a consensus minimum assessment dataset, as proposed by Varghese and colleagues [49]. Prospective application of the TRACE framework within longitudinal observational cohorts, using tier criteria as pre-specified design requirements rather than post-hoc labels, would provide a structured pathway toward the ecological and interpretability evidence currently absent from the literature.

Several limitations of this review should be noted. As a scoping review, formal critical appraisal was not performed; TRACE tier assignment captures validation maturity, not methodological rigour, and a higher tier should not be interpreted as indicating superior study quality. The modality imbalance, 114 of 165 studies using wearable IMUs, means field interpretation is disproportionately informed by a single sensor class. Extraction from mixed-population studies may have introduced imprecision in ET-specific estimates, and unpublished evidence could alter some tier assignments. Finally, publication bias towards positive construct validity findings is likely, and the absence of null validation studies may overstate the validity of Tier 2 evidence for individual platforms.

One hundred and sixty-five studies across six sensing modalities confirm that digital tremor measurement in ET is technically feasible and clinically anchored in supervised settings, but the ecological, patient-centred, and regulatory-ready evidence required for trial deployment is absent outside the Cala Health TAPS platform [21, 37]. What the field requires is not further construct validity accumulation in single-session clinic studies, but deliberate investment in home deployment, MID derivation, and PRO integration across independent platforms. The TRACE framework, described in full in Supplementary File 1, provides a potential transferable tool for other chronic neurological conditions where these validation gaps are likely to recur.

Artificial Intelligence Disclosure

Artificial intelligence tools (Claude, Anthropic) were used to assist with manuscript preparation and editing. Additionally, Gemini 3.1 Flash Image (Nano Banana 2) was used to co-generate the visual components of Figure 4. No other figures were produced or modified using AI tools. All content was reviewed, verified, and approved by all authors. AI tools were not used to generate or alter primary data.

Additional Files

The additional files for this article can be found as follows:

Supplementary File 1

The TRACE Maturity Framework for Digital Biomarkers in Neurological Disease. DOI: https://doi.org/10.5334/tohm.1205.s1

Supplementary File 2

Electronic Search Strategies. DOI: https://doi.org/10.5334/tohm.1205.s2

Supplementary File 3

TRACE-ET Data Charting Instrument. DOI: https://doi.org/10.5334/tohm.1205.s3

Ethics and consent

This scoping review did not involve primary data collection from human participants or animals. No ethics committee approval or informed consent was required.

Acknowledgements

The authors wish to thank the institutions of Newcastle University, Newcastle upon Tyne Hospitals NHS Foundation Trust, the Walton Centre NHS Foundation Trust, and the University of Liverpool for supporting this work.

DOI: https://doi.org/10.5334/tohm.1205 | Journal eISSN: 2160-8288
Language: English
Page range: 39 - 39
Submitted on: Mar 18, 2026
Accepted on: May 31, 2026
Published on: Jun 8, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 David Ledingham, Antonella Macerello, Nicola Pavese, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.