Postamputation phantom pain is notoriously persistent with few validated treatments. Cryoneurolysis involves the application of low temperatures to reversibly ablate peripheral nerves. The authors tested the hypothesis that a single cryoneurolysis treatment would decrease phantom pain 4 months later.
The authors enrolled patients with a lower-limb amputation and established phantom pain. Each received a single-injection femoral and sciatic nerve block with lidocaine and was subsequently randomized to receive either ultrasound-guided percutaneous cryoneurolysis or sham treatment at these same locations. The primary outcome was the change in average phantom pain intensity between baseline and 4 months as measured with a numeric rating scale (0 to 10), after which an optional crossover treatment was offered. Investigators, participants, and clinical staff were masked to treatment group assignment with the exception of the treating physician performing the cryoneurolysis, who had no subsequent participant interaction.
Pretreatment phantom pain scores were similar in both groups, with a median [quartiles] of 5.0 [4.0, 6.0] for active treatment and 5.0 [4.0, 7.0] for sham. After 4 months, pain intensity decreased by 0.5 [–0.5, 3.0] in patients given cryoneurolysis (n = 71) versus 0 [0, 3] in patients given sham (n = 73), with an estimated difference (95% CI) of –0.1 (–1.0 to 0.7), P = 0.759. Following their statistical gatekeeping protocol, the authors did not make inferences or draw conclusions on secondary endpoints. One serious adverse event occurred after a protocol deviation in which a femoral nerve cryolesion was induced just below the inguinal ligament—instead of the sensory-only saphenous nerve—which resulted in quadriceps weakness, and possibly a fall and clavicle fracture.
Percutaneous cryoneurolysis did not decrease chronic lower extremity phantom limb pain 4 months after treatment. However, these results were based upon the authors’ specific study protocol, and since the optimal cryoneurolysis treatment parameters such as freeze duration and anatomic treatment location remain unknown, further research is warranted.
Chronic postamputation pain is relatively common and difficult to treat once established
Continuous neural blockade with local anesthetic may reduce symptoms of persistent postamputation pain, even beyond the offset of local anesthetic effect
Cryoneurolysis can reduce nociceptive transmission for a longer period of time by ablating peripheral nerves
In patients with established chronic postamputation pain, cryoneurolysis of distal sciatic and femoral nerves did not reduce pain scores at 4 months of follow-up compared to sham
Exploratory analysis suggested cryoneurolysis was associated with more pain among patients with transfemoral and ankle/foot amputations, but less pain among patients with transtibial amputation
Tens of millions of people are living with a lower limb amputation,1 with up to 50 to 85% developing chronic, intractable pain perceived as originating from the missing limb, often described as “phantom limb pain.”2 Phantom pain is notoriously persistent,3 with few adequately powered randomized controlled trials to guide treatment.4 The precise etiology of phantom pain is unclear. However, neural restructuring frequently occurs after limb amputation, and the degree of cortical reorganization is associated with phantom pain intensity.5
One study suggested that a single-injection local anesthetic peripheral nerve block can resolve both phantom pain and cortical abnormalities, although the improvements disappeared after block resolution.6 Nevertheless, this demonstrated that at least in some cases, persistent cortical abnormalities and phantom pain may be maintained from abnormal input from the peripheral nervous system.7 A recent randomized controlled trial reported that prolonging a peripheral nerve block using a 6-day continuous perineural local anesthetic infusion extended limb analgesia for at least 1 month.8 These findings suggest that a peripheral nerve block of extended duration—lasting weeks or months rather than days—may allow prolonged cortical reorganization and provide lasting relief from phantom pain.
A prolonged neural block is provided with cryoneurolysis, which entails the application of very low temperatures (approximately –70°C using nitrous oxide) to reversibly ablate peripheral nerves.9 Guided using real-time imaging, a percutaneously inserted probe has gas circulated throughout its length, inducing cold at the distal end and freezing the target nerve.10 There is no implanted device, and there is no external equipment to prepare, manage, or malfunction—a single administration results in effects measured in weeks to months without any subsequent patient or clinician interventions. While multiple uncontrolled case series suggest a possible analgesic benefit in treating phantom and residual limb pain with percutaneous cryoneurolysis,11–13 the technique has not been validated for postamputation pain in a randomized controlled study.
We therefore designed this multicenter, randomized, observer- and participant-masked, sham-controlled, parallel-arm, partial crossover clinical trial to determine if a single treatment of ultrasound-guided percutaneous cryoneurolysis would provide effective and lasting analgesia for established lower extremity phantom limb pain. Specifically, we tested the primary hypothesis that the change in average phantom limb pain intensity between baseline and 4 months would be greater after cryoneurolysis versus sham treatment (as measured with the Numeric Rating Scale of the Brief Pain Inventory).
Materials and Methods
This study was conducted within the ethical guidelines outlined in the Declaration of Helsinki and followed Good Clinical Practice. The trial was prospectively registered at clinicaltrials.gov (NCT03449667; Principal Investigator: Brian M. Ilfeld, M.D., M.S.; initial posting: February 28, 2018). The protocol was approved by the Institutional Review Board at each of the 6 enrolling centers as well as the United States Army Medical Research and Development Command Human Research Protection Office (Fort Detrick, Maryland). Responsible for the oversight and conduct of the investigation was an independent Data Safety Monitoring Board (appendix). Written, informed consent was obtained from all participants.
Participants
Six medical centers enrolled patients, including public and private civilian, Veterans Affairs, and military treatment facilities. Potential participants were presented with the study in chronic pain clinics, and advertisements were posted in amputee-centered national print and web-based publications. Enrollment was offered to adult patients of at least 18 yr of age with a lower limb traumatic or surgical amputation at least 12 weeks before enrollment. The amputation had to be distal to the hip (femoral head remaining), and patients had to experience at least moderate phantom limb pain defined as a 3 or higher on the Numeric Rating Scale (0 to 10; 0 = no pain; 10 = worst imaginable pain) at least daily for the previous 2 months. Patients had to agree to avoid both changes to their analgesic regimen as well as elective surgical procedures from 1 month before, and at least 4 months after, the initial study intervention. Patients were excluded for an amide local anesthetic allergy, pregnancy, incarceration, inability to communicate with the investigators, morbid obesity (body mass index greater than 40 kg/m2), and possessing any contraindication specific to cryoneurolysis such as a localized infection at the treatment site, cryoglobulinemia, cold urticaria, and Raynaud’s syndrome.
Intervention
Participants were asked to not eat or drink after midnight before the procedure. For women of childbearing age with the possibility of pregnancy, a sample of urine was collected before any study interventions to rule out pregnancy. All participants had a peripheral intravenous catheter inserted, standard noninvasive monitors applied (blood pressure cuff, pulse oximeter, five-lead electrocardiogram), and oxygen administered via a facemask or nasal cannula. Oral and/or intravenous sedatives and analgesics such as midazolam, diazepam, and fentanyl were titrated for patient comfort, if necessary, while ensuring that patients remained responsive to verbal cues.
The specific nerves targeted were the sciatic and femoral (or their distal branches), with the most distal location clearly visualized with ultrasound treated (but before the sciatic bifurcation and at the level of the medial epicondyle for the saphenous nerve). The potential cryoneurolysis entry sites were prepared with chlorhexidine gluconate and isopropyl alcohol and a sterile, fenestrated drape. Using the appropriate ultrasound transducer for the specific anatomic location and subject anatomy (linear vs. curvilinear array), the target nerves were identified in a transverse cross-sectional (short axis) view. A local anesthetic skin wheal was raised adjacent to the ultrasound transducer, and a Tuohy-tip needle was inserted through the skin wheal in plane beneath the ultrasound transducer and directed until the needle tip was immediately adjacent to the target nerve. Local anesthetic (1 to 3 ml, lidocaine 2%) was injected in divided doses with frequent aspiration. This was repeated for the additional target nerve(s). Within 20 min of the last injection, the subject’s limb pain level was evaluated on the 0 to 10 Numeric Rating Scale, and if higher than at baseline before injection, the subject did not continue with treatment, and their participation in the study ended.
Treatment Group Assignment (Randomization)
Remaining participants were allocated to one of two possible treatments: active cryoneurolysis or sham (placebo). Randomization was stratified by institution in randomly chosen block sizes using computer-generated lists by the informatics group of the Department of Outcomes Research at the Cleveland Clinic (Cleveland, Ohio). Treatment group assignment was conveyed to the enrolling sites via the same secure web-based system used to collect and collate all postintervention outcomes (Research Electronic Data Capture, Cleveland Clinic).14
A cryoneurolysis console device was used for all participants (PainBlocker, Epimed International, USA). Cryoneurolysis probes are available that either (1) pass nitrous oxide to the tip inducing freezing temperatures (approximately –70°C); or (2) vent the nitrous oxide at the base of the probe so that no gas reaches the probe tip, resulting in no temperature change. Importantly, these 16-gauge, trocar-tipped probes are indistinguishable in appearance and audible cues, and therefore, investigators, participants, and all clinical staff were masked to treatment group assignment (with the exception of the treating physician performing the cryoneurolysis). After repeated sterile preparation and draping, an angiocatheter-like introducer was inserted beneath the ultrasound transducer and directed until immediately adjacent to the target nerve. The appropriate probe (active vs. sham) was inserted through the introducer, and the cryoneurolysis device was triggered using three cycles of 2-min gas activation separated by 1-min defrost periods.15 The process was repeated for each treated nerve using the same probe for all applications (e.g., all nerves received either active cryoneurolysis or sham/placebo, and not a mix of the two possible treatments).
Of note, the treating physician was not masked to treatment group assignment during the cryoneurolysis procedure. This was because the ice ball forming at the distal end of the probe—with active treatment—is clearly visible by ultrasound, and the lack of an ice ball for placebo participants is equally clear.16 We believe it is essential to continuously visualize the probe and target nerve throughout the freeze and thaw cycles to ensure (1) the entire nerve diameter is fully encompassed by the sphere of ice and (2) the ice ball remains relatively motionless to prevent it from tearing surrounding tissue. This cannot be achieved if the ultrasound is turned off during nitrous oxide administration to mask the provider, and we prioritized patient safety over provider masking. Treating physicians did not have subsequent contact with study participants, or data collection, management, and analysis.
Before discharge, participants and their caretakers were provided with verbal and written instructions as well as the contact information for an investigator. Patients were informed that any sensory deficits from the short-acting lidocaine bolus that they may be experiencing would regress, and that they should not be alarmed by any subsequent increase in pain. Participants were provided with crutches if they so desired, although previous experience suggested that nearly all patients treated with cryoneurolysis continue to ambulate using their prosthesis without difficulty.
Optional Crossover Treatment
Up to 2 months after the primary outcome measurement at month 4, participants could return for an optional repeated intervention procedure (“crossover”) with the alternative treatment (either active cryoneurolysis or sham/placebo), using the same protocol as described for the initial intervention. The crossover treatment was not required for study participation, as the primary analysis included a parallel study design for the initial intervention evaluated before any crossover treatment. However, the optional crossover treatment was offered for two reasons: (1) to ensure that all participants had access to the proposed treatment, regardless of the treatment to which they were initially assigned; and (2) to permit intrasubject differences between treatments to be analyzed (e.g., assessing treatment-effect heterogeneity, or the variability of the causal effect across individuals, which will would not be available from the parallel-group portion of the study alone). These intrasubject differences were secondary analyses, as there would be patient selection bias regarding which participants decided to have the crossover treatment.
The main results of the study were provided to all participants after final analysis.
Outcome Measurements
We selected outcome measures that have established reliability and validity, with minimal interrater discordance, and are recommended for chronic pain clinical trials by the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials consensus statement.17 Outcomes were evaluated at baseline (before intervention), on days 1 and 7, as well as months 1, 2, 3, and 4, relative to the initial and optional crossover treatment(s). In addition, outcomes were evaluated 12 months after the initial intervention. Outcome measures were collected in person for the baseline measurements immediately before the initial intervention as well as the crossover treatments. All other outcomes were collected by investigators at the University of California-San Diego (San Diego, California) by telephone regardless of enrolling center.
The questionnaires differentiated multiple dimensions of limb sensations or pain. Residual limb (“stump”) pain indicated painful sensations localized to the portion of limb still physically present.18 Phantom limb pain indicated painful sensations referred to the lost body part.18 Phantom limb sensations indicated non-painful sensations referred to the lost body part.18
Each type of pain or sensation was defined for patients immediately before questionnaire application at each time point, and patients were instructed to address phantom limb pain when responding to questions unless otherwise specified. Each time the questionnaire was applied, participants were instructed to respond for the previous 3 days.19 Exceptions included day 1 for both the initial and crossover treatments, because at these time points, the interest was in participants’ experiences subsequent to the intervention. At these time points, participants were instructed to respond for the period since the intervention the previous day.
The primary instrument was the Brief Pain Inventory (short form), which assesses pain and its interference with physical and emotional functioning.20 The form includes three domains: (1) pain, with four questions using an Numeric Rating Scale to evaluate 4 pain levels: “current,” “least,” “worst,” and “average”; (2) percentage of relief provided by pain treatments with one question; and (3) interference with physical and emotional functioning using a 0 to 10 scale (0 = no interference; 10 = complete interference). The seven interference questions involve general activity, mood, walking ability, normal work activities (both inside and outside of the home), relationships, sleep, and enjoyment of life.20 The seven functioning questions can be combined to produce an interference subscale (0 to 70). The use of both single items (e.g., mood) and the composite scores are supported by the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials recommendations for assessing pain in clinical trials.17,21 Because phantom limb and residual limb (“stump”) pain have been correlated, the latter was assessed with the same four pain intensity questions.
To provide a global measure of worsening or improvement, the Patient Global Impression of Change was administered, allowing patient evaluation of integrated treatment effects.17 This measure is a seven-point ordinal scale requiring the patient to rate the current intensity of phantom limb pain compared to their pretreatment baseline: 1 for “very much worse” to 7 for “very much improved” (4 is “no change”). Additional psychosocial factors were evaluated using the Beck Depression Inventory, a 21-item instrument measuring characteristic symptoms and signs of depression.22 Each of the 21 factors is rated on a 0 to 3 scale, and then summed to produce the total score of 0 to 63. Mild, moderate, and severe depression are defined with scores of 10 to 18, 19 to 29, and 30 to 63, respectively.23 Last, the frequency and average duration of nonpainful phantom sensations as well as phantom and residual limb pain were assessed.
Statistical Analysis
Treatment group assignment was unmasked only after completion of the statistical analysis. We used descriptive statistics to compare the treatment groups for baseline variables. Groups were considered well balanced on a particular baseline variable if the absolute standardized difference (difference in means, mean ranks or proportions divided by the pooled SD) was less than = 0.46, where n1 and n2 are the per-group sample sizes.24 All analyses were modified intention-to-treat, in which all randomized participants who received any of the study treatment were included and retained in their respective treatment groups.25 CIs were adjusted for the group sequential design with overall alpha of 0.05, such that 95.6% CIs are reported throughout (referred to as “95% CI”). The study was designed with 90% power to detect a mean change of 1.7 or more on the Numeric Rating Scale for “average phantom pain” while adjusting for three interim analyses. Missing data were imputed using last observation carried forward for the primary outcome and using multiple imputation for secondary outcomes and sensitivity analysis on the primary outcome.
Aim 1: Primary Outcome
We assessed the average causal effect of cryoneurolysis (active) versus sham or placebo on phantom limb pain intensity (average pain during previous 72 h) at 4 months after the initial treatment using analysis of covariance to adjust for clinical site, baseline average pain intensity, clinical site, and any imbalanced baseline variable. We also assessed the treatment effect on the change from baseline average pain intensity (instead of adjusting for baseline pain score) in an analogous linear regression model. As a sensitivity analysis, we assessed the median difference (95% CI) of active versus placebo using the Hodges–Lehmann estimator of location shift and compared groups with a Wilcoxon rank sum test stratified by study site. We also assessed the treatment–by–clinical site interaction in the linear regression models.
Assessing Treatment Effect Heterogeneity
We assessed whether the treatment effect on the primary outcome (phantom limb pain intensity during the past 72 h) varied across levels of specific baseline variables (besides clinical site) using linear regression as in the primary analysis and testing the treatment-by-covariate interaction. We assessed treatment effect heterogeneity across level of sex, body mass index, amputation level, phantom pain 20 min after the second lidocaine injection, and baseline average phantom and residual limb pain, with a predetermined significance criterion of P < 0.10 for the interaction, without correction made for these multiple covariate analyses. For the last three variables, which are continuous/ordinal pain scores, the interaction was assessed on a continuous scale, although the results are shown in the forest plot dichotomizing pain into mild (Numeric Rating Scale 3 or less) versus moderate to severe (Numeric Rating Scale greater than 3).
Secondary Outcomes (at 4 Months)
Randomized groups were compared at 4 months on the global measure of improvement (Patient Global Impression of Change scale; Aim 2a) using the Wilcoxon rank-sum test and Hodges–Lehman estimation of location shift, stratified by study site. We used a mixed-effects regression model with a fixed effect for treatment and an unstructured correlation matrix adjusted for study site and baseline pain interference components to assess the treatment effect across the seven components of the Brief Pain Inventory pain interference (Aim 2b). Randomized groups were compared on the Beck Depression Inventory using the Wilcoxon rank-sum test and estimating the treatment effect using the Hodges–Lehmann estimator of location shift, stratified by study site (Aim 2c).
The crossover treatment 4 to 6 months after the initial intervention allowed all participants the opportunity to receive the study treatment, but because it was optional also introduced selection bias from this time point forward. For crossover patients, we assessed the treatment effect using a linear mixed effects regression model with a fixed effect for treatment and random effect for patient, adjusted for treatment sequence and period. We tested for evidence of differential carryover effect with the treatment-by-period interaction. We also descriptively assessed (no treatment effects were estimated) the change from the initial baseline to 12 months for the initial active and sham participants for those who both did and did not receive the crossover treatment.
Interim Analyses
We conducted interim analyses to assess efficacy (rejecting null) and futility (rejecting alternative) at each 25% of the maximum enrollment using a group sequential procedure. Specifically, a gamma spending function was used with parameters –4 and –2 for efficacy and futility, respectively.26 Thus, boundaries at the first through fourth analyses for efficacy (futility in parentheses) were P ≤ 0.0016 (P > 0.9572), P ≤ 0.0048 (P > 0.7186), P ≤ 0.0147 (P > 0.2389), and P ≤ 0.0440 (P > 0.0440) (Supplemental Table A, https://links.lww.com/ALN/C958, and Supplemental Figure A, https://links.lww.com/ALN/C957).
Type I Error and Gatekeeping
We designed the study to use a parallel gatekeeping procedure to control the study-wide type I error at 0.05.27 For this procedure, we therefore a priori prioritized the study outcomes into ordered sets, as Aim 1, Aim 2a, Aim 2b, and then Aim 2c. Analysis would proceeded in that order, and testing would proceed through each “gate” to the next set if and only if at least one outcome in the current set reached significance. The significance level for each set would be 0.05 times a cumulative penalty for nonsignificant results in previous sets (i.e., a “rejection gain factor” equal to the cumulative product of the proportion of significant tests across the preceding sets). Within a set, a multiple comparison procedure (Bonferroni correction) was planned to control the type I error at the appropriate level.
Sample Size Considerations
Our sample size estimate was based on the primary specific aim of whether the addition of cryoneurolysis decreases phantom limb pain intensity resulting from an amputation compared with current standard-of-care treatment at 4 months after cryoneurolysis. Receiver operating characteristic curve analyses demonstrated that changes from baseline of at least 1.7 along a 10-point Numeric Rating Scale accurately identified patients who rated improvements as “much improved” or more, compared with those who perceived no change or worsening after analgesic interventions.28 Multiple additional studies confirm this degree of reduction as clinically meaningful to individual patients with chronic pain.29–31 Although meaningful group differences in the mean change would be somewhat smaller than important changes for individuals, we took a very conservative approach and powered our study to be able to detect group differences in mean change from baseline of 1.7 points or more on the Numeric Rating Scale.
Based on a conservative SD estimate for each group of 3.0 at 4 months, a correlation of 0.50 between baseline and follow-up Numeric Rating Scale, a two-sided test at the 0.05 significance level, power of 0.90, and four equally spaced analyses (three interim and one final, as needed), a maximum of 72 participants in each group (n = 144 total) was required. The expected sample size for this group sequential design (i.e., average sample size over thousands of such trials, stopping when a boundary is crossed) was a total of 100 under the alternative and 102 under the null hypotheses. East 5.3 software (Cytel Inc., USA) was used for sample size calculations and all analyses.
Results
Between March 2018 and March 2021, a total of 144 patients were enrolled at six medical centers (fig. 1). Phantom limb pain fell from a median [quartiles] of 4.0 [2.0, 6.0] immediately before the initial single-injection lidocaine bolus to 0 [0, 3.0] for the active group and 0 [0, 2.0] for the placebo group 20 min after the bolus. No participant experienced an increase in limb pain in the 20 min after the lidocaine injections. Therefore, all participants were randomized to either active treatment with cryoneurolysis (n = 71) or sham/placebo (n = 73). Regarding baseline characteristics, all the variables were balanced between the two randomized groups with absolute standardized difference 0.33 or less (table 1).
Primary Outcome
Pretreatment phantom pain scores were a median [quartiles] of 5.0 [4.0, 6.0] for active treatment (cryoneurolysis) and 5.0 [4.0, 7.0] for sham or placebo. At 4 months, average phantom limb pain scores were 4.3 [1.5, 6] for active and 4.5 [2, 6] for placebo, with estimated difference in means (95% CI) of –0.1 (–1.0 to 0.7), P = 0.759, adjusting for baseline pain score and clinical site while using last-observation-carried-forward (for n = 1 cryoneurolysis and n = 7 placebo patients); the futility boundary was crossed (Supplemental Figure A, https://links.lww.com/ALN/C957). We also assessed change from baseline: pain intensity decreased by 0.5 [-0.5, 3.0] in patients given cryoneurolysis (n = 71) versus 0 [0, 3] in patients given sham (n = 73): estimated difference (95% CI) –0.1 (–1.0 to 0.7), P = 0.759. Finally, the nonparametric Hodges–Lehman estimator comparing active and placebo on 4-month average phantom limb pain scores gave a similar result, with median difference (95% CI) of –0.25 (–1, 0.5), P = 0.565.
Treatment Effect Heterogeneity
There was little notable evidence of treatment effect heterogeneity across levels of most of the selected baseline (prerandomization) variables, except for amputation level (interaction P = 0.003, fig. 2). Active cryoneurolysis was better for a transtibial amputation level, but worse for transfemoral and ankle/foot amputations (table 2).
Assessing Interactions between Treatment and Specified Baseline Factors on Primary Outcome of Month 4 Average Phantom Limb Pain

Forest plot assessing interactions between prespecified baseline factors and the effect of ultrasound-guided percutaneous cryoneurolysis on phantom limb pain at 4 months. *P value was estimated from continuous pain score by multivariable linear regression adjusted for study site and day 0 average phantom limb pain.
Forest plot assessing interactions between prespecified baseline factors and the effect of ultrasound-guided percutaneous cryoneurolysis on phantom limb pain at 4 months. *P value was estimated from continuous pain score by multivariable linear regression adjusted for study site and day 0 average phantom limb pain.
Gatekeeping Rules
Since the primary outcome was not significant, based on our a priori statistical plan to use a parallel gatekeeping approach to control study-wide type I error at 5%, we cannot make an inference on any of the secondary endpoints. Therefore, secondary outcome results are given in the form of estimated difference and CI (not P value), but we do not make any formal inference or conclusions on them—only on the primary outcome.
Secondary Endpoints
Using the 1 to 7 Global Impression of Change Scale at month 4, participants who received active treatment rated their phantom pain as a median of 4 (“no change”) [4, 7] versus 4 (“no change”) [4, 6] for placebo participants with an estimated median difference (95% CI) of 0 (0 to 0) at 4 months (Aim 2A). Using the Brief Pain Inventory interference subscale to measure pain’s interference with physical and emotional functioning at month 4, patients who received active cryoneurolysis scored 23 [0, 39] versus 22 [3, 34] for sham, with a median difference (95% CI) of 0 (–5 to 6; Aim 2B, table 3, fig. 3). The mixed effects model suggested no treatment-by-component interaction, and the estimated difference in means (95% CI) [scale 0 to 10] was 0.2 (–0.5 to 0.9). Using the Beck Depression Inventory (Aim 2C), participants receiving active treatment reported a median change from baseline of –2 [–7, 0] versus –2 [–5, 0] for sham, with a difference (95% CI) of 1 (–1 to 3). Descriptively, cryoneurolysis did not demonstrably improve phantom and residual limb pain outcomes at any time point compared with the sham treatment (table 3, figs. 4 and 5).
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on the Brief Pain Inventory interference domain over time. Data expressed as pain’s interference on each of seven components (higher scores indicate more interference) demarked as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inferences or draw conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on the Brief Pain Inventory interference domain over time. Data expressed as pain’s interference on each of seven components (higher scores indicate more interference) demarked as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inferences or draw conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on worst, average, least, and current phantom limb pain over time (primary outcome: average phantom limb pain at 4 months). Pain intensity indicated using a numeric rating scale of 0 to 10, with 0 equal to no pain and 10 being the worst imaginable pain. Data expressed as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inference or conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on worst, average, least, and current phantom limb pain over time (primary outcome: average phantom limb pain at 4 months). Pain intensity indicated using a numeric rating scale of 0 to 10, with 0 equal to no pain and 10 being the worst imaginable pain. Data expressed as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inference or conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on worst, average, least, and current residual limb pain over time. Pain intensity indicated using a numeric rating scale of 0 to 10, with 0 equal to no pain and 10 being the worst imaginable pain. Data expressed as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inference or conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Effects of ultrasound-guided percutaneous cryoneurolysis (denoted in green) on worst, average, least, and current residual limb pain over time. Pain intensity indicated using a numeric rating scale of 0 to 10, with 0 equal to no pain and 10 being the worst imaginable pain. Data expressed as median (dark horizontal bars) with 25th to 75th percentiles (box), 10th to 90th percentiles (whiskers), mean (diamonds), and outliers (circles). Following our statistical gatekeeping protocol, we do not make inference or conclusions on the secondary endpoints, since no difference was found on the primary endpoint.
Crossover Treatments
The crossover treatment administered 0 to 2 months after the measurement of the primary outcome was optional, resulting in selection bias on patients who did not cross over and, on those who did cross over, potential interference with the longer-term effects of the initial treatment. Therefore, outcomes after the 4-month time point are reported descriptively only. Ninety-one patients participated in the crossover phase, receiving either an active (n = 49) or sham (n = 42) treatment (Supplemental Table B, https://links.lww.com/ALN/C959). Active treatment appeared to be similar to sham on 4 months average phantom limb pain intensity, pain’s interference on physical and psychologic functioning, and Patient Global Impression of Change (Supplemental Table C, https://links.lww.com/ALN/C960). The period–by–treatment interaction P value of 0.04 suggested that there was some evidence of differential carryover effect between the first and second periods. These results would be generalizable to patients like those who chose to receive the crossover, which may differ from the main trial population. As well, active treatment had a larger reduction from baseline in average phantom limb pain intensity compared to placebo, with a mean difference (95% CI) of –1 (–2 to –0.5). The variability in the individual causal effects of active versus placebo as measured by the SD of the individual treatment effects was 1.3. Outcomes at 12 months after randomization did not appear to differ between treatment groups (table 4).
Serious Adverse Events and Major Protocol Deviations
There were two deaths within the year after treatment, neither determined to be related to study participation: one myocardial infarction and one related to COVID-19 infection with severe acute respiratory syndrome. One participant developed dementia of unknown etiology within the 6 months after his initial treatment per an adult child’s report. The only adverse event deemed related to study participation was a woman with a transtibial amputation who first received a sham treatment and subsequently crossed over with an active treatment that was performed just distal to the inguinal ligament for the femoral nerve. This protocol deviation resulted in profound quadriceps femoris weakness and some insensate areas of skin on the medial thigh. These deficits resolved slowly until complete resolution after 12 to 15 months. However, 3 months after the active crossover treatment, she fell while climbing stairs and fractured a clavicle, which required three subsequent surgical fixation procedures.
Discussion
This multicenter, randomized, sham-controlled trial failed to identify a benefit in treating established postamputation phantom limb pain with ultrasound-guided percutaneous cryoneurolysis. This is a somewhat surprising and disappointing finding considering that cryoneurolysis has been used to treat postamputation pain for decades with favorable outcomes reported in uncontrolled case series.11–13 We can only speculate on the reasons for these contrasting findings.
The most obvious potential explanation is that cryoneurolysis does not, in fact, result in lasting, measurable analgesic benefits, and previous reports of posttreatment improvement in uncontrolled series are due to a placebo effect, selective reporting, and/or natural resolution of pain over time.11–13 As possible evidence of a placebo effect, 29% of the sham group experienced a decrease in pain score of at least 1.7, the threshold we prospectively defined as the smallest clinically relevant improvement for individuals based on previously published data (similar to the 36% who had received active cryoneurolysis).28
It is also possible that the improvement in about one third of all patients was not a placebo response to the study intervention, but rather a consequence of the single injection of local anesthetic administered to all participants immediately before the study intervention.6 This would help to explain why participants who chose to cross over did not experience the analgesic benefits of those who responded to the initial treatment and who therefore, presumably, chose not to undergo the crossover treatment. While a possible explanation for our results during the first few months, it is doubtful that a single injection of lidocaine is responsible for the finding that most of these “responders” reported continued improvement after 12 months.6,8
Alternatively, our negative results may be explained by the locations where we applied cryoneurolysis. In the subset of patients with a transtibial amputation—a majority of participants (n = 92)—cryoneurolysis was associated with an improved outcome compared with sham at 4 months (P = 0.003 overall; pairwise comparisons: transtibial vs. ankle, P = 0.017; transtibial vs. transfemoral, P = 0.006). Conversely, patients with a transfemoral or foot/ankle amputation who received active treatment fared worse than their sham counterparts. These could be spurious findings (type I error), but it is worth exploring given that medical progress is usually iterative, and three major differences between transtibial and transfemoral amputations may help inform future research: (1) duration of effect, (2) impact on abnormal input from the peripheral to central nervous system, and (3) target nerve cross-sectional area.
Regarding the first—a reduced duration of a treatment effect for transfemoral amputations—it is important to note that the optimal point for cryoneurolysis along a target nerve remains unknown. We chose to treat both the sciatic and femoral nerves at the most distal locations clearly visualized with ultrasound as low as the bifurcation of the sciatic nerve and medial femoral epicondyle for the saphenous nerve. Our reasoning was that a more proximal lesion could increase sensory, motor, and proprioception deficits in the residual limb, increasing the risk of falls when using a prosthesis for the entire treatment effect duration, usually measured in months. Of 144 participants, the only serious adverse event deemed related to study participation may be seen as supporting this decision: an investigator chose to treat the femoral nerve at the inguinal ligament for a patient with a transtibial amputation instead of more distally at the medial femoral epicondyle, resulting in profound quadriceps femoris muscle weakness lasting more than 1 yr and possibly contributing to a fall 3 months after treatment.
However, using a distal cryoneurolysis location for the remaining participants likely decreased the duration of effect for transfemoral amputations. The primary determinant of cryoneurolysis duration is a function of the distance between the cryolesion and nerve endings, with nerves regrowing at approximately 1 to 2 mm/day.32 Therefore, if our theory that block duration and analgesic benefits are correlated, the short cryolesion–nerve ending distance for transfemoral amputations would greatly decrease both the duration of cryoneurolysis effects and the impact on phantom limb pain. In contrast, because we did not apply cryoneurolysis distal to the medial femoral epicondyle, for transtibial amputations there was a greater length of remaining nerve distal to the cryolesion, resulting in an increased treatment effect duration and therefore possibly analgesic effects.12
The second difference between the two amputation locations—reduced impact on abnormal input from the peripheral to central nervous systems—is also based on our chosen protocol. We did not treat the obturator or posterior femoral cutaneous nerves, which contribute to the innervation at the level of transfemoral amputations, so the coverage provided by the cryoneurolysis intervention was inherently incomplete. In contrast, afferent sensory input for transtibial amputations is carried by the two nerves we did treat—the sciatic nerve and saphenous branch of the femoral nerve. Effective treatment for transfemoral amputations may require the administration of cryoneurolysis to all nerves innervating the lower extremity.
The third difference between the two amputation locations is the target nerve cross-sectional area, which is larger the more proximal within the lower extremity. The premise of our study hypothesis is that phantom limb pain is at least partially sustained by abnormal input from the peripheral to the central nervous systems. Therefore, interrupting the abnormal input with cryoneurolysis requires a thorough neural lesion with a prolonged duration. Reducing the temperature of a nerve below –20°C (but not colder than –100°C) results in a Sunderland second-degree nerve injury characterized by a reversible degeneration of axons known as Wallerian degeneration.33 In contrast, temperatures warmer than –20°C result in a first-degree nerve injury,34 which induces a shorter, unpredictable neuropraxia that can itself result in dysesthesias.35 In other words, an inadequate freeze can actually induce pain. In our study, while treating physicians visualized the ice ball with ultrasound to ensure it encompassed the entire nerve, there is no guarantee that the entire sphere of ice cooled below –20°C, possibly inducing a variable duration neuropraxia that could result in analgesia initially but increase pain subsequently.36 In addition, without Wallerian degeneration, the entire nerve could have remained functional since myelinated fibers can conduct “over” small lesions: while a lesion length of 3 to 6 mm is adequate to severe conduction in laboratory animals, the length in humans remains unknown.37 Since the sciatic nerve is the largest in the human body, cryoneurolysis of more proximal application for the transfemoral amputations may have resulted in incomplete cryolesions. Supporting this theory is our seemingly counterintuitive finding that patients with a transfemoral amputation who received active treatment fared worse than their sham counterparts.
Evidence contradicting these last two interpretations is that amputations at or below the ankle had the same cryoneurolysis administration levels as transtibial yet were not associated with improved outcomes. Possibly explaining this apparent contradiction is that there were only nine ankle cases, and therefore this finding may itself be spurious, with confidence in the result far lower than for the transfemoral (n = 43) and transtibial (n = 92) subgroups.
Limitations
The major limitation of our trial is the reality that the optimal cryoneurolysis treatment parameters such as duration of freeze, duration of thaw, number of freeze and thaw cycles, freeze temperature, probe design, and anatomic treatment location all remain unknown.10 The specific technique used in our trial was based on published (successful) pilot studies and decades of previous clinical experience,9,11,12 but whether other techniques might have different effects is unknown.
A second limitation is the local anesthetic administered before the study intervention for both active cryoneurolysis and sham treatment groups. As such, even participants who underwent the sham procedure had a single-injection peripheral nerve block, which may decrease postamputation pain for up to a few weeks.38 We provided local anesthetic to participants who would receive the active treatment because (1) it negates the discomfort experienced by many patients undergoing cryoneurolysis; and (2) we wanted to confirm that a peripheral nerve block would not induce paradoxical pain: a rare response, but one that could result in months of increased pain after a cryoneurolysis procedure (which we did not observe).39 We provided the local anesthetic block to patients who would receive sham to retain masking of treatment group assignment: we presumed that patients who experienced absolutely no sensory changes during the (sham) study intervention without a peripheral nerve block would assume they had received the sham. While this protocol does not decrease confidence in our primary outcome—we can conclude that the addition of cryoneurolysis failed to improve pain outcomes 4 months after treatment—it does make interpretation of our negative results and designing subsequent research more challenging.
In summary, ultrasound-guided percutaneous cryoneurolysis did not reduce phantom limb pain 4 months after treatment. Although we do not make inferences or draw conclusions on the secondary endpoints due to our gatekeeping procedure and negative primary outcome, assessment of treatment effect heterogeneity remains important. Exploratory post hoc analysis revealed that treatment effect after 4 months was associated with the level of amputation, with transtibial amputation responsive to cryoneurolysis as opposed to ankle/foot and transfemoral, which fared worse than sham. The reasons for this difference remain unclear, and future research is warranted.
Acknowledgments
The authors appreciate the invaluable assistance of the members of the uncompensated Data Safety Monitoring Board, without whom this study would not have been possible: Salim M. Hayek, M.D., Ph.D. (Department of Anesthesiology, Case Western Reserve University, Cleveland, Ohio); Anupama N. Wadhwa, M.B.B.S., M.Sc., F.A.S.A. (Department of Anesthesiology, University of Texas Southwestern, Dallas, Texas); and Gerald Beck, Ph.D. (Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, Ohio).
Research Support
The U.S. Army Medical Research Acquisition Activity (Fort Detrick, Maryland) is the awarding and administering acquisition office. This work was supported by the Assistant Secretary of Defense for Health Affairs endorsed by the Department of Defense (Washington, D.C.), through the Congressionally Directed Medical Research Program under award No. W81XWH-17-2-0051 (PR160263). The project was also partially supported by the National Institutes of Health (Bethesda, Maryland), grant UL1TR001442. Opinions, interpretations, conclusions, and recommendations are those of the authors and are not necessarily endorsed by the funding entities. None of the authors has a personal financial interest in this research.
Competing Interests
Drs. Ilfeld and Gabriel report that the University of California San Diego (San Diego‚ California) has received funding and/or equipment for other research projects from Myoscience (Fremont, California), Epimed International (Farmers Branch, Texas), Infutronics (Natick, Massachusetts), Avanos (Irvine, California), and SPR Therapeutics (Cleveland, Ohio). Dr. Trescot has served on an advisory board for Atricure (Mason, Ohio) and is the Chief Medical Officer for Stimwave Technologies (Pompano, Florida). Dr. Cohen receives research funding (paid to the institution) from Avanos, and research funding (paid to the institution) from Scilex (Palo Alto‚ California) for a randomized controlled trial evaluating a local anesthetic patch for neck pain, and as a consultant for a randomized trial evaluating a steroid for epidural injection. He serves as a consultant for SPR (Cleveland‚ Ohio)‚ and is involved in research projects with their peripheral nerve stimulation system at Walter Reed National Military Medical Center (Bethesda, Maryland). Dr. Sessler is the Chair, Data and Safety Monitoring Board, Neuros Medical Quest (Land O' Lakes‚ Florida) trial. He is a consultant for Pacira Pharmaceuticals (Parsippany-Troy Hills, New Jersey), and this company funds trials in the Department of Outcomes Research‚ The Cleveland Clinic (Cleveland‚ Ohio). Dr. Prologo is a consultant and research grant recipient for Boston Scientific (Natick, Massachusetts), a consultant for Varian Medical Systems (Crawley, United Kingdom), and cofounder, Focused Cryo, Inc. (Smyrna, Georgia). The other authors declare no competing interests.
Reproducible Science
Deidentified patient-level data will be shared for collaborative analyses on request to Dr. Ilfeld (bilfeld@health.ucsd.edu) shortly after publication. The data dictionary and statistical tables and code will be provided as appropriate; a data-sharing contract will be required. The protocol is available by request.
Supplemental Digital Content
Supplemental Figure A, https://links.lww.com/ALN/C957
Supplemental Table A, https://links.lww.com/ALN/C958
Supplemental Table B, https://links.lww.com/ALN/C959
Supplemental Table C, https://links.lww.com/ALN/C960
References
Appendix. Data Safety Monitoring Board and Recruiting Site Investigators
Data Safety Monitoring Board
Salim M. Hayek, M.D., Ph.D. (Chair, Medical Monitor and Patient Advocate)
Case Western Reserve
Cleveland, Ohio
Anupama N. Wadhwa, M.D.
University of Texas, Southwestern
Dallas, Texas
Gerald J. Beck, Ph.D. (statistician)
Cleveland Clinic
Cleveland, Ohio
Recruiting Site Investigators (Perineural Anesthesia Infusion Network for Research and Education Investigators)
No conflicts to report unless otherwise noted.
Baharin Abdullah, M.D. (National Program Manager)
University of California-San Diego
San Diego, California
Conflics of interest: The University of California has received funding and/or equipment for other research projects from Myoscience (Fremont, California), Epimed (Farmers Branch, Texas), Infutronics (Natick, Massachusetts), Avanos (Irvine, California), and SPR Therapeutics (Cleveland, Ohio).
Cleveland Clinic
Cleveland, Ohio
Wael Ali Sakr Esa, M.D., Ph.D., M.B.A.
Ece Yamak Altinpulluk
Omer Bakal
Mauro Bravo
Qiliang Jiang
Zafeer Khan
Kamal Maheshwari, M.D., M.P.H.
Loran Mounir-Soliman, M.D.
Ahmed Salih
Naval Medical Center San Diego
San Diego, California
Robert J. Hackworth, M.D.
Carter Sigmon, M.D.
Stanford University
Stanford, California
Jennifer Padwal, M.D., M.S.
University of Florida
Gainesville, Florida
Amy M. Gunnett, R.N., C.C.R.C.
Amanda Slater
Walter Reed National Military Medical Center
Bethesda, Maryland
Akshay Thaper, B.S.
Morgan Hunt, B.S.