Honest Criticisms and Limitations of the SURMOUNT-1 Trial

At a glance
| Detail | Value | |---|---| | N | 2,539 randomized (1:1:1:1) | | Intervention | Tirzepatide 5 mg, 10 mg, or 15 mg subcutaneous weekly | | Comparator | Placebo (weekly injection) | | Duration | 72 weeks (20-week dose escalation + 52-week maintenance) | | Primary endpoint | Percent change in body weight from baseline at week 72 | | Key result | <15 mg arm: −20.9% vs −3.1% placebo (P < 0.001) | | Registration | NCT04184622 |
What SURMOUNT-1 Actually Proved, and What It Did Not
The SURMOUNT-1 trial was a phase 3, double-blind, randomized, placebo-controlled study of tirzepatide for chronic weight management in adults with obesity (BMI ≥30) or overweight (BMI ≥27) with at least one weight-related comorbidity, excluding type 2 diabetes. Published in the New England Journal of Medicine in 2022, it reported weight reductions of 15.0%, 19.5%, and 20.9% for the 5 mg, 10 mg, and 15 mg doses, respectively, compared with 3.1% for placebo.
Those numbers were unprecedented for a pharmacotherapy trial. But unprecedented results deserve proportionally rigorous scrutiny. Below is a structured evaluation of the trial's limitations, organized into a framework that clinicians can use when counseling patients about what the data actually supports.
The HealthRX SURMOUNT-1 Limitation Framework
We classify limitations into six domains: enrollment bias, duration and design gaps, statistical handling, generalizability constraints, conflict-of-interest considerations, and post-publication critique. Each domain includes specific findings from the trial protocol and published results.
1. Enrollment Bias: Who Was Actually Studied
The SURMOUNT-1 population was not representative of the broader obesity population seeking treatment:
| Characteristic | SURMOUNT-1 cohort | U.S. obesity population | |---|---|---| | Female | 67.5% | ~50% of adults with obesity | | White | 70.5% | ~60% of adults with obesity | | Mean BMI | 38.0 kg/m² | Wider distribution | | Type 2 diabetes | Excluded | ~30-40% of treatment-seeking patients | | Cardiovascular disease | Excluded | Common comorbidity | | Mean age | ~45 years | Older in real-world clinics |
Patients with type 2 diabetes were excluded entirely. This is a critical gap because diabetes affects weight-loss pharmacodynamics, GLP-1 receptor signaling, and adherence patterns. The SURMOUNT-2 trial later addressed this subgroup but reported smaller weight reductions (12.8% at 15 mg), confirming that the headline SURMOUNT-1 numbers do not transfer directly to diabetic patients.
The trial also excluded individuals with a history of bariatric surgery, those taking GLP-1 receptor agonists or other weight-loss medications within 90 days, and anyone with recent cardiovascular events. These exclusions created a "clean" population that maximizes internal validity but limits external applicability to the patients most commonly seen in obesity medicine clinics.
2. Duration and Design Gaps
Seventy-two weeks sounds long. It is not long enough to answer three questions that matter clinically:
Weight regain after discontinuation. SURMOUNT-1 had no off-treatment follow-up built into the primary analysis. The SURMOUNT-4 extension later showed that participants who switched from tirzepatide to placebo regained approximately 14% of body weight within 52 weeks, suggesting that the drug suppresses weight rather than permanently resetting a body-weight set point. Clinicians prescribing based on SURMOUNT-1 data must communicate this to patients.
Durability beyond 72 weeks. Whether weight loss plateaus, continues, or slowly reverses during years 2 through 5 of continuous therapy remains unknown from this trial alone. Chronic disease management for obesity requires data on a timescale closer to statin or antihypertensive trials (5+ years).
Hard cardiovascular outcomes. SURMOUNT-1 was not powered for cardiovascular events, myocardial infarction, stroke, or mortality. The FDA label for tirzepatide (Zepbound) approved it based on weight-loss efficacy, not cardiovascular risk reduction. The ongoing SURMOUNT-MMO trial aims to fill this gap, but until it reports, any cardiovascular benefit claims extrapolated from SURMOUNT-1 weight-loss data remain speculative.
3. Statistical Handling and Estimand Issues
The trial used two estimands for its co-primary endpoints:
- Treatment-policy estimand: Includes all randomized participants regardless of treatment discontinuation, using data collected after discontinuation.
- Efficacy estimand: Estimates the treatment effect had all participants adhered to their assigned regimen for the full 72 weeks.
The headline −20.9% result comes from the efficacy estimand. Under the treatment-policy estimand (which accounts for real-world discontinuation), the 15 mg group achieved −17.8%. That 3.1 percentage-point gap is not trivial. It reflects what happens when patients stop the drug early, a scenario common in clinical practice.
| Dose | Efficacy estimand | Treatment-policy estimand | Difference | |---|---|---|---| | 5 mg | −15.0% | −13.5% | 1.5 pp | | 10 mg | −19.5% | −17.0% | 2.5 pp | | 15 mg | −20.9% | −17.8% | 3.1 pp | | Placebo | −3.1% | −3.1% | 0.0 pp |
The widening gap at higher doses correlates with higher discontinuation rates in the 15 mg arm (25.3% vs 14.3% placebo), driven primarily by gastrointestinal adverse events. Press coverage and marketing materials consistently cite the efficacy estimand. Clinicians should quote the treatment-policy numbers when setting patient expectations.
Multiple imputation was used to handle missing data, an acceptable approach but one that assumes data are missing at random. If patients who dropped out due to intolerable nausea would have had different weight trajectories than those who stayed (plausible, given that GI side effects often correlate with greater GLP-1-mediated appetite suppression), the imputation model may not fully capture this.
4. Generalizability Constraints
Beyond enrollment criteria, several design features limit how directly SURMOUNT-1 maps to routine care:
Lifestyle intervention in all arms. Every participant, including placebo, received a 500 kcal/day deficit diet and 150 minutes/week of physical activity counseling. The 3.1% placebo weight loss reflects this structured support. In clinical practice, patients receiving a prescription without equivalent behavioral programming may see smaller drug-placebo differences.
Injection training and monitoring frequency. Participants had regular clinic visits with injection technique reinforcement. Real-world adherence to weekly self-injections, particularly with auto-injector devices that may differ from trial syringes, is likely lower.
No head-to-head with semaglutide. At the time of publication, semaglutide 2.4 mg (Wegovy) was the standard of care. SURMOUNT-1 compared tirzepatide only to placebo. Cross-trial comparisons with STEP 1 semaglutide data are tempting but methodologically unreliable due to differences in baseline BMI, demographics, and site characteristics.
Geographic and ethnic homogeneity. Trial sites were concentrated in the United States, Argentina, Brazil, China, India, Japan, Mexico, Russia, and Taiwan. Despite this geographic spread, the racial composition skewed heavily white. Pharmacogenomic differences in GLP-1 receptor polymorphisms and body composition across ethnicities are underexplored in this dataset.
5. Conflict-of-Interest Considerations
SURMOUNT-1 was funded entirely by Eli Lilly, the manufacturer of tirzepatide. Lilly employees served as authors on the publication. The trial was designed, conducted, and analyzed with sponsor involvement at every stage, which is standard for industry-sponsored phase 3 trials but creates structural incentives worth noting.
The study steering committee included academic investigators who disclosed consulting fees from Lilly and competing manufacturers. While this is common in obesity pharmacotherapy research (the field has a small pool of qualified trialists), it means independent replication by non-conflicted investigators has not occurred for the primary endpoint data.
The American Gastroenterological Association's 2024 clinical practice guideline on pharmacological management of obesity rated the certainty of evidence for tirzepatide as moderate, in part because the evidence base derives predominantly from sponsor-funded trials.
6. Post-Publication Commentary
Several themes emerged in published letters, editorials, and conference discussions after the SURMOUNT-1 publication:
Lean mass loss. SURMOUNT-1 reported that approximately 33-39% of total weight lost was lean body mass (measured by DEXA in a substudy). This ratio is higher than what is typically seen with caloric restriction alone (~25%) and raised concerns about sarcopenia risk, particularly in older patients. The trial's mean age of ~45 excluded the population most vulnerable to muscle loss.
Gallbladder events. Cholelithiasis occurred in 0.2-1.7% of tirzepatide groups versus 0% placebo. Rapid weight loss is a known risk factor for gallstones regardless of mechanism. Whether tirzepatide carries gallbladder risk beyond what is attributable to the weight loss itself is unclear from these data.
Cost and access. The trial does not address the $1,000+/month list price of tirzepatide or insurance coverage variability. The Obesity Medicine Association's position statements have highlighted that efficacy data from controlled trials cannot be separated from real-world access barriers when evaluating clinical utility.
Gastrointestinal tolerability. Nausea (24-33%), diarrhea (17-23%), and vomiting (6-13%) were the most common adverse events. The 20-week dose-escalation protocol was designed to mitigate these effects, but real-world titration schedules may differ based on formulary constraints or patient preferences.
The Bottom Line for Clinicians
SURMOUNT-1 is a well-executed efficacy trial that established tirzepatide as the most effective single-agent pharmacotherapy for weight loss in a selected population without diabetes. It is not an effectiveness trial. The gap between its controlled conditions and routine clinical practice is real and measurable. Clinicians should cite the treatment-policy estimand (−17.8% at 15 mg), discuss the likelihood of weight regain on discontinuation, counsel patients about GI side effects, and recognize that the enrolled population was younger, healthier, and less diverse than most obesity clinic panels.
Frequently asked questions
›
›
›
›
›
›
›
›
›
›
References
- Jastreboff AM, Aronne LJ, Ahmad NN, et al. Tirzepatide once weekly for the treatment of obesity. N Engl J Med. 2022;387(3):205-216. PubMed
- Wilding JPH, Batterham RL, Calanna S, et al. Once-weekly semaglutide in adults with overweight or obesity (STEP 1). N Engl J Med. 2021;384(11):989-1002. PubMed
- FDA. Zepbound (tirzepatide) prescribing information. 2023. FDA Label
- Grunvald E, Shah R, Engel SS, et al. AGA clinical practice guideline on pharmacological interventions for adults with obesity. Gastroenterology. 2024;167(6):1150-1162. PubMed
- Obesity Medicine Association. Position statements on access and clinical utility of anti-obesity medications. 2023. PubMed