A 72-year-old woman with coronary disease, an eGFR of 52, and a hemoglobin A1c of 7.6 sits in an endocrinologist’s office. She is on metformin, a GLP-1 agonist, and basal insulin. She feels fine. Her last hypoglycemic episode was three weeks ago, in the parking lot of a Trader Joe’s, and her daughter had to give her juice.
The endocrinologist looks at the A1c and frowns. Seven-point-six. The guideline says less than seven. She adjusts the insulin up.
This scene plays out tens of thousands of times a day in the United States. It is treated as good medicine. It is, in fact, the precise clinical strategy that a 10,251-patient randomized trial showed kills people.
The trial was called ACCORD. It was published in the New England Journal of Medicine in June 2008. It stopped early. It stopped because the patients getting the aggressive treatment were dying.
We have known this for eighteen years.
We have not updated.
Let me give you the trial in numbers, because the numbers matter and they are not difficult.
ACCORD enrolled patients with type 2 diabetes at high cardiovascular risk. Mean age 62. Mean diabetes duration 10 years. Mean baseline A1c 8.1%. Half the patients had established cardiovascular disease.
Half were assigned to intensive glycemic control, targeting an A1c below 6.0%. The other half to standard control, targeting 7.0 to 7.9%. Everything else was identical between arms: blood pressure management, lipid management, antiplatelet therapy. The only thing being tested was the A1c target.
The intensive arm achieved a median A1c of 6.4%. The standard arm achieved 7.5%. By any pharmacologic standard, the intervention worked. The biomarker moved.
After 3.5 years, the Data and Safety Monitoring Board stopped the glycemia arm. The reason was simple. In the intensive arm: 257 deaths. In the standard arm: 203 deaths. A hazard ratio of 1.22, with a confidence interval that excluded 1.0. Cardiovascular death was worse: hazard ratio 1.35.
The primary composite endpoint (nonfatal MI, nonfatal stroke, or CV death) was numerically lower in the intensive arm but not statistically significant. Intensive control reduced nonfatal heart attacks. It increased fatal ones. On the composite, it was a wash. On mortality, it was harm.
This was not a small trial. This was not a fluke. ADVANCE and VADT, the two other intensive-control trials of the era, did not replicate the mortality signal but also did not show benefit. The collective signal across three large trials of intensive A1c lowering in established type 2 diabetes was: no macrovascular benefit, possible harm, definite hypoglycemia, definite weight gain, definite cost, definite patient burden.
The theory that lowering A1c reduces cardiovascular events in established type 2 diabetes was the operating assumption of the field for thirty years. The theory was tested. The theory failed.
And yet.
And yet the FDA still approves diabetes drugs on the basis of A1c reduction. Open the label of any drug approved since 2008. The efficacy claims are A1c. Not mortality. Not MACE. A1c.
There is a regulatory architecture here that survived ACCORD without flinching. The 2008 FDA Guidance for Industry on diabetes drugs, issued in the immediate aftermath of ACCORD and the rosiglitazone meta-analysis, added a requirement for cardiovascular safety trials. It did not change the efficacy bar. A new diabetes drug, in 2026, still gets approved because it lowers a number that we have known for eighteen years does not reliably predict the outcomes patients care about.
I want to be precise about what I am claiming. A1c is a useful biomarker. It predicts microvascular complications quite well: retinopathy, nephropathy, neuropathy. The DCCT and UKPDS established this. I am not saying A1c is meaningless.
I am saying that the field’s response to ACCORD should have been to reconsider whether A1c was an adequate surrogate for the macrovascular outcomes that drive mortality and morbidity in older patients with established type 2 diabetes. Instead, the field’s response was to keep approving drugs on A1c and add a parallel requirement for CV safety trials.
This is what surrogate endpoint capture looks like in the wild. The biomarker is institutionalized. The trial that should have unseated it is reframed as a hypoglycemia story, a rosiglitazone story, an “individualized targets” story: anything except what it is, which is a story about a biomarker that doesn’t deliver on its promise.
The hypoglycemia explanation deserves its own discussion because it is repeated everywhere, even in my fellowship training, and it does not survive contact with the data.
The standard reading of ACCORD is: intensive control caused more severe hypoglycemia, hypoglycemia caused arrhythmias, arrhythmias caused death. It is a clean mechanistic story. It allows the field to say “ACCORD was about hypoglycemia, not about A1c targets per se.” It is also, on the available evidence, probably wrong.
In 2010, the ACCORD investigators published a post-hoc analysis in BMJ. They looked at the relationship between severe hypoglycemia and mortality within each arm. Among patients who experienced severe hypoglycemia, mortality was actually higher in the standard arm than the intensive arm. The straightforward interpretation: severe hypoglycemia marks frailty and high risk. It does not, in any clean way, cause the excess mortality in the intensive arm.
This is the methodological move that should be taught in every fellowship: a post-hoc subgroup analysis that contradicts the convenient mechanistic story. The convenient story is repeated anyway, because it lets everyone keep doing what they were doing. The intensivists can say, “We just need to avoid hypoglycemia.” The guidelines can say, “Individualize, but the target remains seven.”
What actually killed people in ACCORD? We don’t know. Hypoglycemia probably contributes. Rapid glucose lowering may matter. Weight gain (the intensive arm gained 3.5 kg) may matter. Rosiglitazone, used heavily in the intensive arm and independently associated with cardiovascular harm in a contemporaneous meta-analysis, may matter. Polypharmacy and drug interactions in patients on four or five glucose-lowering agents may matter.
We do not know. Eighteen years on. The field has not run the trial that would tell us.
Absence of Evidence is not Evidence of Absence.
Here is the part where I name names, because that is what honest commentary requires.
The American Diabetes Association’s Standards of Medical Care in Diabetes is the most influential single document in American diabetes practice. It is updated annually. It has, since 2009, used some version of the phrase “individualize glycemic targets.”
This sounds reasonable. It is reasonable, on its face. Of course targets should be individualized. Of course a 45-year-old with new-onset diabetes is different from a 78-year-old with three prior MIs.
But “individualize” in practice has meant: keep the seven percent target as the default, and carve out exceptions for the frail. The default did not move. The trial that should have moved the default did not move it.
Compare this to oncology, where a single negative phase 3 trial routinely changes the standard of care within twelve months. Compare it to cardiology, where the COURAGE trial of stenting in stable CAD changed practice: slowly, incompletely, but it changed it. The diabetes field absorbed ACCORD and kept going.
Why? Yes I will ineed speculate... The ADA Standards of Care writing committee includes members with extensive industry funding, disclosed on the Diabetes Journals website. The diabetes drug industry depends on A1c as the efficacy endpoint. A guideline that softened A1c targets would soften the commercial case for new agents. A guideline that aggressively reframed A1c as a microvascular surrogate without macrovascular validity would be commercially catastrophic for an industry that markets primarily to primary care physicians who have been trained for thirty years to treat the number.
I am not alleging conscious bias. I am alleging structural incentive. The structure produces the result whether or not anyone in the room intends it.
There is an irony in the ACCORD story that needs to be told, because it is the one redeeming feature of the regulatory response.
The 2008 FDA guidance required cardiovascular outcome trials for new diabetes drugs. This was a burden. The industry hated it. Trials cost hundreds of millions of dollars. Many feared the requirement would kill diabetes drug development.
It did not kill it. It revealed it.
The CV outcome trials run between 2008 and 2020 included EMPA-REG OUTCOME, LEADER, SUSTAIN-6, CANVAS, DECLARE, REWIND, and others. They demonstrated that SGLT2 inhibitors and GLP-1 receptor agonists, drugs initially approved on A1c, actually reduced cardiovascular events, mortality, and in some cases renal endpoints. The CV benefits of these drugs were not predicted by their A1c effects. They were discovered because regulators forced trials that the industry would never have run on its own.
This is the deep lesson of ACCORD that almost no one tells. The trial’s harm signal forced a regulatory requirement that produced the evidence base for the current treatment paradigm. SGLT2 inhibitors are now first-line for type 2 diabetes with CV or renal disease: not because they lower A1c, but because they reduce MACE and heart failure hospitalizations and slow eGFR decline. We learned this because ACCORD scared the FDA into requiring outcome trials.
And then, in 2020, the FDA softened the CVOT requirement. The justification was that the SGLT2 and GLP-1 trials had established class effects and further trials were unnecessary. Maybe. Or maybe regulatory memory is short and industry pressure is constant, and the requirement that produced the most important advances in diabetes care of the last twenty years was sunset because it was inconvenient.
The next class of diabetes drugs will not have CVOTs. We will not know if they help or harm cardiovascularly. We will approve them on A1c. We will repeat the cycle.
What should the field have done?
It should have stopped approving diabetes drugs on A1c alone in established high-risk patients. It should have required mortality and MACE endpoints. It should have rewritten the ADA Standards of Care to lead with cardiovascular and renal outcome data, with A1c as a secondary efficacy measure relevant primarily to microvascular complications. It should have stopped treating “treat to target” as a clinical philosophy and started treating it as a hypothesis that failed to be proven by ACCORD.
It did none of these things.
It added a parallel safety requirement, hedged the guidelines, blamed hypoglycemia, and kept going.
The 72-year-old woman in the endocrinologist's office is still getting her insulin titrated up. The endocrinologist is following the guideline. The guideline is loosely tethered to a literature that includes ACCORD as one entry among many, methodologically equivalent to a single observational study in the synthesis. The trial that should have ended a theory has been absorbed into a footnote.
This is how medicine learns when it does not want to.
I want to close with a discussion that generalizes beyond diabetes.
ACCORD did not fully reorganize the field around the possibility that biomarker/surrogate improvement and clinical benefit can meaningfully diverge.
That matters because much of modern medicine in today's fast world increasingly depends on surrogate endpoints, composite outcomes, accelerated approvals, and post-marketing inference. The question is not whether these tools are ever useful. Many clearly are. The question is how reliably the system recalibrates when a major trial challenges the assumptions underneath them.
My own view is that medicine updates unevenly. It incorporates evidence more readily when the evidence extends an existing framework than when it destabilizes one. That is not primarily a story about bad actors. Most people inside the system are trying, in good faith, to improve patient outcomes. But institutional structures, professional commitments, and clinical habits all create momentum that is difficult to reverse.
I do not think this is a counsel of despair. I think it is a reminder that negative trials deserve the same intellectual seriousness as positive ones.
The trials running today will test a new generation of assumptions. Some of those will survive. Some will not. The harder question is whether we will recognize the difference quickly enough to adapt our regulatory science and medicine when the evidence arrives.