U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Pillay J, Chordiya P, Dhakal S, et al. Behavioral Programs for Diabetes Mellitus. Rockville (MD): Agency for Healthcare Research and Quality (US); 2015 Sep. (Evidence Reports/Technology Assessments, No. 221.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Behavioral Programs for Diabetes Mellitus

Behavioral Programs for Diabetes Mellitus.

Show details

Results

This chapter begins with a summary of our literature search. We then present the findings separately for type 1 diabetes mellitus (T1DM) and type 2 diabetes mellitus (T2DM). Within each section we present a general description of the included studies followed by our findings by Key Question (KQ). Specific details for the organization of the sections for T1DM and T2DM are included below.

Literature Search and Screening

Our database and gray literature searches identified 47,141 citations, and 11 additional records were identified from reference lists of systematic reviews and included studies. For T1DM, we included 34 studies described in 44 publications. For T2DM, we included 132 studies described in 161 publications. Figure 3 describes the flow of literature through the screening process. Appendix D provides a complete list of articles excluded at the full-text screening stage, with reasons for exclusion.

Figure 3 is a flow chart that summarizes the search and selection of articles. It begins with the total number of citations retrieved from the literature searches and ends with the number of studies that satisfied the inclusion criteria of the report. The search identified 63,739 citations from electronic databases and 2,513 from gray literature searching. After removal of duplicates, 47,141 records were screened based on titles and abstracts and we identified 698 potentially relevant studies. We identified 8 additional studies by hand searching the reference lists from included studies and relevant systematic reviews. The full texts of 706 potentially relevant reports were evaluated for inclusion. Using a standardized inclusion–exclusion form, 205 studies were included and 504 were excluded. There are 34 primary reports with 10 associated publications for type 1 diabetes and 132 primary reports with 29 associated publications for type 2 diabetes. One study was included for both type 1 and type 2 diabetes mellitus. The reasons for exclusion were: ineligible intervention (277), ineligible outcomes (51), ineligible study design (58), ineligible publication type (35), ineligible population (30), ineligible setting/country (36), ineligible duration (15) and ineligible language (2). A complete list of excluded studies and reasons for exclusion can be found in Appendix D of the full report.

Figure 3

Flow diagram of study retrieval and selection. T1DM = type 1 diabetes mellitus; T2DM = type 2 diabetes mellitus a One study was included for both T1DM and T2DM.

Type 1 Diabetes Mellitus

This section begins with the results of our literature search, a general description of all included studies, separate summaries of studies that focused on youth followed by those that focused on adults, and a summary of the risk of bias (ROB) assessment. We then present results by KQ. We begin with results of behavioral programs compared with usual care, followed by studies comparing behavioral programs with an active control, and then by those comparing two or more behavioral programs (i.e., comparative effectiveness). The results are grouped first by outcome (e.g., HbA1c) and then by follow-up timepoint. For each outcome results are presented by age groups (youth and adults), where appropriate. We present results as mean differences (MD), standardized mean differences (SMD), or risk ratios (RR), with 95 percent confidence intervals (95% CI) in figures with meta-analyses or in summary tables. Where statistical heterogeneity was considered substantial (>50 percent) we report the I2 Statistic (I2%).

For each KQ, we give the key points and then present a detailed synthesis of the evidence. Appendix E (Table E2) includes the ROB assessments for each trial. Summary tables describing studies are found in Appendix F (Tables F1 and F2); they are organized alphabetically by author. For observational studies, we present a narrative summary of the results for HbA1c. Other outcomes from the observational studies are documented in Appendix G. For KQs 1 and 2, we summarize the strength of evidence (SOE) assessments, which are provided in detail in Appendix H.

Literature Search and Screening

For T1DM, we included 34 studies described in 44 publications (Figure 3). Primary reports were identified for 30 randomized controlled trials (RCT),82-111 1 non-RCT,112 and 3 controlled before-after studies.113-115 Ten additional publications contributed information related to the study methodology, outcomes, or descriptions of the interventions.116-125 One of the studies included both T1DM (49 percent) and T2DM (51 percent) patients; results were reported for each patient group and the study is included in both T1DM and T2DM of this review.107

Characteristics of Included Studies

The majority of studies (30 trials, 2 observational studies) examined diabetes self-management education (DSME); two studies (1 RCT,105 1 observational study114) focused on lifestyle programs (see Appendix A for operational definitions). For DSME, most trials (n=23) were two-arm trials comparing DSME to usual care. Three two-arm RCTs compared DSME to an active control.87,91,92 The active controls included telephone support87 and basic education.91,92 Three RCTs were three-arm trials with one having two active control arms107 and the other two each had a usual care and an active control arm.83,108 For one, the authors combined the usual care and active control arms.83 For the others, we analyzed the usual care and active control arms separately,108 or combined the two active control arms.107,108 One RCT evaluated the comparative effectiveness of the same DSME program delivered in person compared with delivery by internet-based videoconferencing (Skype™).90 Two observational studies compared DSME with usual care.113,115

Both studies focusing on lifestyle programs compared them with usual care. One was a two-arm RCT105 and the other was an observational study.114

Youth

Clinical Trials

Twenty-three RCTs83-90,92,93,96-104,106,108,110,111 and six associated publications117-119,121,122,125 examined the effectiveness of behavioral programs among youth; only one study examined children, hence our use of the term youth to categorize these studies. Most RCTs were two-arm trials and focused on DSME compared with usual care. One RCT compared a DSME program delivered in person compared with delivery using Skype90 and another compared delivery of DSME in person compared with a telephone support active control.87 Two three-arm trials compared a DSME program with usual care and an active control (basic education program),83,108 although the authors of one combined the two control groups for their analyses.83 Sixteen trials were conducted in the United States;83,84,86-88,90,92,96-100,103,104,108,110 six were conducted in Europe,85,89,93,101,102,106 and one was conducted in Australia.111

The mean age of the youth participants ranged from 9.7–15.4 years (median=13.4). One study did not report age.97 The percentage of males ranged from 0–63 percent (median=47). The proportion of nonwhite participants was between 2–82 percent (median=23.5); nine trials did not present information on race or ethnicity.90,93,98-101,106,110,111 For most trials, the mean HbA1c was >7 percent and ranged from 7.4–15.7 percent (median=9.6 percent). One trial did not report absolute baseline HbA1c.103

All trials in youth recruited patients/families from outpatient clinical settings providing usual care throughout the study period. Clinical settings mostly consisted of diabetes/endocrinology clinics located at university-affiliated hospitals, and care was commonly described to include quarterly clinic visits with a multidisciplinary team of providers offering education and additional consults as needed. One study's usual care included eight visits over a one-year period.93 Some studies reported additional components including: regular adherence assessments,83,98 in-clinic goal setting and a daily phone hour with education provided between visits,87,88 access to an emergency hotline,89 and basic care coordination with clinic reminders and assistance with scheduling appointments.96,104 Three trials reported that usual care included more advanced education,87,88,108 and one multicenter trial's exclusion criteria for study centers included the availability of a group education program.85

A basic description of the behavioral programs delivered to youth is provided in Appendix F (Table F1). Although all studies included in the review evaluated programs which, as reported, met our operational definition of a behavioral program, there was considerable diversity in terms of the program content and delivery. Some programs were designed to coincide with office/clinic visits; however, there was variability in the degree of integration with medical care and in program intensity. Some programs were fully integrated into the clinic visit and were delivered by the clinic's health care personnel.93,101,102 Other programs were delivered by non-clinic staff (e.g., trained research assistant, internists) either prior to or after the patient was seen by the health care team.83,92,96,98,104 One study combined in-clinic goal setting with automated weekly delivery of tailored education and support messages.89 Two office-based programs had relatively high intensity with more than 10 contacts.93,96 The majority of office-based programs were delivered to the family, with a focus on family teamwork, conflict, and coping.83,92,96,98,101,102,104 Programs that did not coincide with clinic/office visits largely consisted of weekly or monthly sessions incorporating various behavioral approaches such as problem-solving, coping, and empowerment training.84,86,97,99,100,103,106,110,111 Some also offered a more therapeutic approach together with some degree of self-management training (i.e., behavioral family systems therapy,90,99,108 motivational enhancement therapy combined with solution-focused therapy,85 and multisystemic therapy87,88). Many programs were targeted at adolescents,83,84,87-90,92,93,97,99,102,103,106,108,110,111while others were tailored to children,100 or offered to mixed age groups.85,96,98,101,104 Below, we present a summary of the program delivery factors.

The total duration of the behavioral programs ranged from 1.2–25 months (median=5.6). The number of contact hours ranged from 1–48 hours (median=9.5). Four trials did not report enough information to calculate the number of contact hours.84,89,98,103

Five trials delivered the programs to youth only;84,86,89,103,106 16 delivered the programs to both youth and their parents or family members.83,85,87,88,90,92,93,96-102,104,108 Four trials delivered the program in person to groups of youth only,84,86,106,111 and two trials delivered the program to youth using a mix of in-person sessions supplemented by telephone calls103 or text messaging.89 Eight trials delivered the program in person to individual pairs of youth and family members.83,87,88,90,93,98,108,110 Six trials delivered the program in person to groups of youth and family members.85,97,100-102,106 Three trials delivered the program to individual pairs of youth and family members using a mix of in-person sessions supplemented by telephone calls.92,96,104 Two trials delivered the program to individuals using telehealth99 and Skype.90

For eight trials, the program was delivered by a single health care professional (e.g., nurse, psychologist, registered dietitian).84,86-88,90,106,110,111 Six trials engaged two or more health professionals,85,89,93,101,102,108 seven trials used non-health professionals (e.g., research assistants, health-related students or trainees),83,92,96,98,99,103,104 and one trial used a combination of a health professional and a trainee.97 One trial did not report this information.100

All of the behavioral programs had some degree of tailoring in terms of their content (e.g., individualized goal setting, topics based on age group) and/or delivery (e.g., coinciding with office visits, number of visits determined based on needs assessment). Several had a moderate–to–high level of tailoring in both content and delivery.87,88,90,92,93,96,97,99,103,104,108,110 Four interventions included some degree of community engagement, including involvement of peers and/or school personnel.85,87,88,90

Observational Studies

Two controlled before-after studies explored the effectiveness of behavioral programs delivered to youth and their parents or families. One study compared a DSME intervention with usual care;115 the other compared a lifestyle intervention with usual care.114

The study by Viner et al.115 was conducted in the United Kingdom. The target population was youth with poor glycemic control (HbA1c >8.5 percent). The mean ages were 13.0 and 13.1 years for the intervention and control groups, respectively; mean HbA1c was 10.2 and 10.0 percent for the intervention and control groups. The 1.5-month program was delivered in person to groups of youth (6 meetings) and, separately, to groups of parents (1 meeting). The program was based on motivational and solution-focused techniques, with elements of cognitive behavioral therapy. The content of the program was tailored to youth with adherence issues and also targeted changes at self-identified behaviors. No information was reported for community engagement.

The study by Thomas-Dobersen et al.114 examined a lifestyle program that targeted overweight adolescents; body mass index ranged from 22–36 kg/m2. The study was conducted in the United States. The mean ages were 13.9 and 15.2 years and mean HbA1c was 12.2 and 13.1 percent for the intervention and control groups, respectively. The 3-month program was delivered by a multidisciplinary team in person to groups of adolescents and, in separate group sessions, to their parents. Program content was tailored to adolescents with diabetes although there was minimal tailoring in the delivery of the structured group sessions. No information was reported for community engagement.

Adults

Clinical Trials

Seven RCTs82,91,94,95,105,107,109 with four associated publications,116,120,123,124 and one non-RCT112 examined the effectiveness of behavioral programs among adults. Two RCTs included participants with T2DM. One RCT presented results for HbA1c separately for T1DM and T2DM and is included in both sections of this report.107 The other study did not report results separately for T1DM or T2DM; however, the majority (>75 percent) of participants had T1DM so we have included it in this section of the report.95 Six trials focused on DSME compared with usual care,82,94,95,105,109,112 two examined DSME compared with one91 or two107 active controls, and one compared a lifestyle intervention with usual care.105 Six of the trials were conducted in European countries,82,91,94,95,109,112 one was conducted in the United States,107 and one was conducted in New Zealand.105

The mean age of participants ranged from 30–49 years. The percentage of males ranged from 35–62 percent. The proportion of nonwhite participants was between 4.5–25 percent in two trials;94,107 the other trials did not present information on race or ethnicity. For all trials, the mean HbA1c was >7 percent and ranged from 7.7–9.6 percent. The mean BMI ranged from 24.8–27.6 kg/m2; three trials did not report BMI.95,109,112

Similar to the trials in youth, usual care was usually provided by out-patient diabetes clinics/centers from which the participants were recruited. Usual care was not described by Karlsen et al.95 who took a different approach by recruiting survey respondents, and may have been diverse in the trial of Perry et al.105 which supplemented clinic recruitment with that from radio and newspaper advertisements. Visit frequency was described less often, but for half of the studies was biannually to quarterly.94,105,107,112 The usual care in one trial included provision of and training in a continuous glucose monitoring system.82

A basic description of the behavioral programs delivered to adults is provided in Appendix F (Table F2). Several of the programs incorporated elements of cognitive behavioral therapy,82,94,95,107 with one combining cognitive behavioral therapy with motivational enhancement therapy.94 In one study authors described their program as taking an empowerment approach,91 another incorporated guided self-determination group training,109 and one offered self-management training using an ongoing self-help group style.112 The program presented by Amsberg et al.82 included a 9-month maintenance period during which telephone support calls were provided; this study also incorporated training using a continuous glucose monitoring system. Below, we present a summary of implementation factors.

The total duration of the behavioral programs ranged from 1.5–12 months (median=6 months). The number of contact hours ranged from 9–52 hours (median=16). One trial included an intense phase (2 months) followed by a 9-month support period.82 Five trials delivered the program in person to groups of participants,91,95,107,109,112 two delivered the program in person to individuals,94,105 and one trial used a mix of individual and small group sessions that were delivered in person and by telephone.82 For three of the trials, the program was delivered by a single health care professional (i.e., nurse, registered dietitian, physician).91,94,112 Four trials engaged two or more health professionals,82,105,107,109 and one trial used a health care professional and a peer (with diabetes and trained in program delivery) who served as coleader. All reports described the programs to have a moderate-to-high degree of tailoring of content to the participants' individual needs; fewer had mechanisms (e.g., telephone followup, collaborative delivery by professional and participants) to tailor the delivery of the program.82,95,109,112 One trial incorporated community engagement through the use of a peer coleader;95 the remaining trials either involved no community engagement or did not report this information.

Observational Studies

One controlled before-after study explored the effectiveness of a DSME program among adults (≤65 years) who were receiving intensive insulin therapy.113 The study was conducted in Italy. Baseline HbA1c was ≥7.5 percent in 59 and 63 percent of the intervention and control groups, respectively. The 4-month intervention was an education program including empowerment group teaching and situation simulation, and comprised eight 2-hour group sessions led by a physician or dietitian. There was some tailoring of the content towards patients receiving intensive therapy; no information was reported for community engagement.

Risk of Bias of Individual Studies

A summary of the ROB assessments for the 31 trials is presented in Figure 4; the consensus assessments in all domains for each study are presented in Appendix E. All trials were assessed as having a medium (unclear) or high overall ROB. For objective outcomes (e.g., HbA1c, weight), 58 percent of trials had a medium ROB and 42 percent had a high risk. The assessment of high risk was largely driven by incomplete outcome data (i.e., loss to followup). For trials (n=22) reporting subjective outcomes of interest to this review (e.g., health-related quality of life [HRQL], patient-reported self-management behaviors), all but one trial had a high risk of bias (95 percent). This was primarily due to lack of blinding of participants, study personnel, and outcome assessors.

Figure 4 displays a Risk of Bias summary chart for studies of type 1 diabetes. This figure is described further in the section “Risk of Bias of Individual Studies” as follows: “A summary of the ROB assessments for the 31 trials is presented in Figure 4; the consensus assessments in all domains for each study are presented in Appendix E. All trials were assessed as having a medium (unclear) or high overall ROB. For objective outcomes (e.g., HbA1c, weight), 58 percent of trials had a medium ROB and 42 percent had a high risk. The assessment of high risk was largely driven by incomplete outcome data (i.e., loss to followup). For trials (n=22) reporting subjective outcomes of interest to this review (e.g., health-related quality of life [HRQL], patient-reported self-management behaviors), all but one trial had a high risk of bias (95 percent). This was primarily due to lack of blinding of participants, study personnel, and outcome assessors.”

Figure 4

Risk of bias summary for trials of behavioral programs for type 1 diabetes.

The risk of bias for the three observational studies was assessed using the Newcastle Ottawa Scale. The study by Viner et al.115 was assessed as having medium ROB (seven stars out of a possible nine); the study by Forlani et al.113 was assessed as medium ROB (five stars); and the study by Thomas-Dobersen et al.114 was assessed as low ROB (eight stars). For all studies there was concern about the control of potential confounding variables including baseline HbA1c and socioeconomic status. For Forlani et al. and Viner et al. there were concerns about the representativeness of the exposed cohort.

Five studies (15 percent) received funding from industry; 26 (76 percent) received funding from non-industry sources (e.g., government or foundations). Funding was not reported by three (9 percent) studies.

KQ 1. Behavioral Programs for T1DM and Behavioral, Clinical, and Health Outcomes; Diabetes-Related Health Care Utilization; and Program Acceptability

Key Points: HbA1c

  • There was no significant difference (low SOE) in changes in HbA1c at the end of intervention between behavioral programs and usual care.
  • Behavioral programs compared with usual care reduced HbA1c (moderate SOE) at 6-month postintervention followup; the change was statistically significant but not clinically important.
  • There was no significant difference in reduction of HbA1c between behavioral programs and usual care at followup timepoints longer than 6 months. The SOE for these findings was low because of risk of bias and imprecise effect estimates; further, because the 95% CIs included our threshold for clinical importance (favoring behavioral programs) we cannot rule out benefit for behavioral programs.
  • Behavioral programs compared with an active control reduced HbA1c to a statistically significant and clinically important (moderate SOE) degree at 6-month followup.
  • Compared with active controls, the estimates of effect for behavioral programs showed no significant difference in HbA1c at end of intervention and at 12-month followup. The SOE was low for both; risk of bias as well as imprecise effect estimates and inclusion of a clinically important benefit reduces confidence in their accuracy.

Key Points: Other Clinical and Behavioral Outcomes

  • Participants receiving behavioral programs compared with usual care did not differ in terms of adherence to diabetes self-management at the end of intervention or 6-month followup (low SOE for both); there was insufficient SOE for longer followup and for all comparisons with active controls.
  • Few trials reported on change in body composition, physical activity or fitness, or change in dietary or nutrient intake.
  • Few trials reported on symptoms of depression, or on episodes of severe hypo- or hyperglycemia.
  • The SOE was insufficient to determine whether behavioral programs increased or decreased changes in body composition, physical activity or fitness, or dietary or nutrient intake.

Key Points: Health Outcomes

  • For participants receiving behavioral programs compared with usual care, there was no difference in generic HRQL at the end of intervention (moderate SOE). Few trials reported on generic HRQL at longer followup timepoints.
  • In comparisons with usual care, there was insufficient SOE to assess whether there was any effect on diabetes-specific HRQL at any timepoint, and low SOE of no difference for diabetes distress at end of intervention and 6-month followup. The 95% CIs for diabetes distress included our threshold for clinical importance such that we cannot rule out a favorable effect for behavioral programs.
  • There were no data on HRQL for comparisons of behavioral programs with active controls.
  • No trials reported on micro- and macrovascular complications or on all-cause mortality.

Key Points: Diabetes-Related Health Care Utilization

  • Few trials reported number of diabetes-related hospital admissions, emergency department admissions, or other measures of health care utilization.

Key Points: Program Acceptability

  • There was a 21 percent increased risk of attrition for individuals receiving usual care compared with those receiving a behavioral program.

Detailed Synthesis

HbA1c: Behavioral Programs Compared With Usual Care

Figures 5-7 present our meta-analyses and forest plots of trials reporting HbA1c stratified by age (youth and adults). A negative MD represents a greater reduction in percent HbA1c for the behavioral program compared with usual care. We present separate forest plots for different timepoints—end of intervention, 6-month postintervention followup, and 12-month postintervention followup. We provide a narrative summary of the four RCTs that reported outcomes for longer followup timepoints.

Figure 5 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus usual care at end of intervention. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Usual Care” as follows: At the end of 6-month postintervention followup for youth and adults combined, our meta-analysis (12 trials, 1,463 subjects) showed that HbA1c improved for persons who received a behavioral program compared with those receiving usual care (MD, -0.31 percent; 95% CI, -0.47 to -0.15). The reduction in HbA1c was not clinically important. For youth (10 trials, 1,213 subjects), the difference between groups was statistically significant, but it was not clinically important (MD, -0.28 percent; 95% CI, -0.51 to -0.05). For adults (2 trials, 250 subjects), there was no difference between groups.”

Figure 5

Behavioral programs for type 1 diabetes compared with usual care: HbA1c at the end of intervention. CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

Figure 6 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus usual care at end of 6-month postintervention followup. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Usual Care” as follows: “At the end of 6-month postintervention followup for youth and adults combined, our meta-analysis (11 trials, 1,316 subjects) showed that HbA1c improved for persons who received a behavioral program compared with those receiving usual care (MD, -0.33 percent; 95% CI, -0.51 to -0.15). For youth (9 trials, 1,066 subjects), the difference between groups was statistically significant (MD, -0.30 percent; 95% CI, -0.57 to -0.03). For adults (2 trials, 250 subjects), there was no difference between groups (MD, -0.38 percent; 95% CI, -0.82 to 0.06).”

Figure 6

Behavioral programs for type 1 diabetes compared with usual care: HbA1c at 6-month postintervention. CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

Figure 7 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus usual care at end of 12-month postintervention followup. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Usual Care” as follows: “At the end of 12-month postintervention followup for youth, our meta-analysis (7 trials, 1,333 youth) found no difference in HbA1c between individuals receiving a behavioral program and those receiving usual care (MD, -0.22 percent; 95% CI, -0.49 to 0.05).”

Figure 7

Behavioral programs for type 1 diabetes compared with usual care: HbA1c at 12-month postintervention (youth only). CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

At the end of intervention for youth and adults combined, our meta-analysis (16 trials, 1,155 subjects) found no difference in percent HbA1c between individuals receiving a behavioral program and those receiving usual care (MD, -0.11; 95% CI, -0.33 to 0.11).82-84,89,93-96,98,99,101,105,106,108,110,112 There was no difference between groups for youth (11 trials, 653 subjects)83,84,89,93,96,98,99,101,106,108,110 or for adults (5 trials, 502 subjects)82,94,95,105,112—MD = 0.00 (95% CI, -0.33 to 0.33) and MD = -0.28 (95% CI, -0.57 to 0.01), respectively.

At the end of 6-month postintervention followup for youth and adults combined, our meta-analysis (12 trials, 1,463 subjects) showed that HbA1c improved for persons who received a behavioral program compared with those receiving usual care (MD, -0.31 percent; 95% CI, -0.47 to -0.15).84,86,88,93,94,100,102-104,108,109,111 The reduction in HbA1c was not clinically important. For youth (10 trials, 1,213 subjects),84,86,88,93,100,102-104,108,111 the difference between groups was statistically significant, but it was not clinically important (MD, -0.28 percent; 95% CI, -0.51 to -0.05). For adults (2 trials, 250 subjects), there was no difference between groups.94,109

At the end of 12-month postintervention followup for youth, our meta-analysis (7 trials, 1,333 youth) found no difference in HbA1c between individuals receiving a behavioral program and those receiving usual care (MD, -0.22 percent; 95% CI, -0.49 to 0.05).83,85,102-104,108,111

Four studies provided data at longer followup timepoints (data not shown). Three RCTs (2 youth,103,104 1 adult;94 671 subjects) reported data at more than 1 year, but less than 2 years; there was no difference in HbA1c between groups (MD, -0.40; 95% CI, -0.92 to 0.12). Two trials (1 youth,85 1 adult;94 467 subjects) reported outcomes at 24 months and found no difference in HbA1c (MD, -0.08; 95% CI, -1.96 to 1.8).

One trial in adolescents did not report sufficient data to be included in our meta-analysis; the authors found no statistically significant difference between groups at 6-month followup.97

Three observational studies (2 youth,114,115 1 adult;113 148 subjects) provided data on HbA1c at 12-month followup. One youth study (41 subjects) reported a statistically significant and clinically important improvement in HbA1c for the group receiving the behavioral program (MD, -1.2; 95% CI, -2.24 to -0.16).115 The other youth study (17 subjects) found no difference between groups (MD, 0.67; 95% CI, -1.47 to 2.81).114 The study that was conducted in adults (90 subjects) reported a statistically significant and clinically important improvement in HbA1c for the group receiving the behavioral program (MD, -0.70; 95% CI, -1.31 to -0.09).113 These results should be interpreted with caution because of concerns with bias and confounding in observational studies; the only study assessed as having low risk of bias found no difference.114

HbA1c: Behavioral Programs Compared With Active Control

Figures 8-10 present our meta-analyses of trials reporting HbA1c for youth and adults in comparisons with active controls. We present the results by followup timepoint (end of intervention, 6-month followup, 12-month followup) and age group. One trial in adults was a three-arm trial comparing a behavioral program to two different active controls (didactic education to either groups or individuals); these arms were combined for the meta-analysis.107

Figure 8 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus active control at end of intervention. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Active Control” as follows: “At the end of intervention, our meta-analysis for youth and adults (4 trials, 419 youth and 110 adults) found no difference between behavioral programs and active controls for HbA1c (MD, -0.32; 95% CI, -0.97 to 0.33). When examining the results by age subgroups, similar results were found for youth (MD, -0.33; 95% CI, -1.65 to 0.99; I2=69%).87, 92, 108 and adults (MD, -0.35; 95% CI, -0.81 to 0.11).”

Figure 8

Behavioral programs for type 1 diabetes compared with active control: HbA1c at end of intervention. CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

Figure 9 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus active control at end of 6-month postintervention followup. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Active Control” as follows: “At the end of 6 months postintervention, our meta-analysis for youth and adults combined (4 trials [259 adults,91, 107 208 youth92, 108]) showed that HbA1c improved for those receiving a behavioral program compared with those receiving an active control (MD, -0.44; 95% CI, -0.69 to -0.19); this reduction in HbA1c is clinically important. For youth, the difference was not statistically significant (MD, -0.60; 95% CI, -2.56 to 1.36);92, 108 for adults, the difference was not statistically significant and the effect size was not clinically important (MD, -0.38; 95% CI, -0.93 to -0.17).”

Figure 9

Behavioral programs for type 1 diabetes compared with active control: HbA1c at 6-month postintervention. CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

Figure 10 displays a forest plot of studies reporting the mean difference of HbA1C in behavioral program versus active control at end of 12-month postintervention followup. This figure is described further the section “Detailed Synthesis; HbA1C: Behavioral Programs Compared With Active Control” as follows: “At the end of 12-month followup, our meta-analysis for youth and adults combined (3 trials [110 adults,107 195 youth92, 108) found no difference in HbA1c (MD, -0.44; 95% CI, -1.04 to 0.16). For youth, the difference was statistically significant and clinically important (MD, -0.52; 95% CI, -1.04 to 0.00); the behavioral program studied by Weinger et al.,107 failed to demonstrate any difference (MD, -0.14; 95% CI, -0.61 to 0.33).”

Figure 10

Behavioral programs for type 1 diabetes compared with active control: HbA1c at 12-month postintervention. CI = confidence interval; HbA1c = hemoglobin A1c; n = number of participants; SD = standard deviation

At the end of intervention, our meta-analysis for youth and adults (4 trials, 419 youth87,92,108 and 110 adults107) found no difference between behavioral programs and active controls for HbA1c (MD, -0.32; 95% CI, -0.97 to 0.33). When examining the results by age subgroups, similar results were found for youth (MD, -0.33; 95% CI, -1.65 to 0.99; I2=69%).87,92,108 and adults (MD, -0.35; 95% CI, -0.81 to 0.11).107

At the end of 6 months postintervention, our meta-analysis for youth and adults combined (4 trials [259 adults,91,107 208 youth92,108]) showed that HbA1c improved for those receiving a behavioral program compared with those receiving an active control (MD, -0.44; 95% CI, -0.69 to -0.19); this reduction in HbA1c is clinically important. For youth, the difference was not statistically significant (MD, -0.60; 95% CI, -2.56 to 1.36);92,108 for adults, the difference was not statistically significant and the effect size was not clinically important (MD, -0.38; 95% CI, -0.93 to -0.17).91,107

At the end of 12-month followup, our meta-analysis for youth and adults combined (3 trials [110 adults,107 195 youth92,108) found no difference in HbA1c (MD, -0.44; 95% CI, -1.04 to 0.16). For youth, the difference was statistically significant and clinically important (MD, -0.52; 95% CI, -1.04 to 0.00); the behavioral program studied by Weinger et al.,107 failed to demonstrate any difference (MD, -0.14; 95% CI, -0.61 to 0.33).

HbA1c: Comparative Effectiveness of Two Behavioral Programs

One RCT (72 youth) examined the same DSME program delivered in person compared with delivery by Skype.90 There was no difference in HbA1c between groups at the end of intervention (MD, -0.04; 95% CI, -0.87 to 0.79) or at 6-month followup (MD, -0.24; 95% CI, -1.10 to 0.62).

Adherence to Diabetes Self-Management: Behavioral Programs Compared With Usual Care

This section presents the results from trials that reported on adherence to diabetes self-management. This outcome was measured in a number of ways and we report them separately. The most common measure was self-monitoring of blood glucose (SMBG) and was most commonly reported as the frequency of blood glucose testing over 1 day.84,86,88,96,104 Two studies reported the frequency of testing over the past week;93,109 we converted this to the number of tests per day. We present separate forest plots for different timepoints (end of intervention, 6 month followup). We provide a narrative summary of the one RCT that reported outcomes for longer followup.

At the end of intervention (Figure 11), our meta-analysis (4 trials, 282 youth) found no difference in frequency of SMBG between youth receiving a behavioral program and those receiving usual care (MD, 0.15; 95% CI, -0.54 to 0.84).84,88,93,96

Figure 11 displays a forest plot of studies reporting the mean difference of self-monitoring of blood glucose in behavioral program versus usual care at end of intervention. This figure is described further the section “Detailed Synthesis; Adherence to Diabetes Self-Management: Behavioral Programs Compared With Usual Care” as follows: “At the end of intervention, our meta-analysis (4 trials, 282 youth) found no difference in frequency of SMBG between youth receiving a behavioral program and those receiving usual care (MD, 0.15; 95% CI, -0.54 to 0.84).”

Figure 11

Behavioral programs for type 1 diabetes compared with usual care: self-monitoring of blood glucose (tests per day) at end of intervention. CI = confidence interval; n = number of participants; SD = standard deviation

At the end of 6-month postintervention for youth and adults combined (Figure 12), our meta-analysis (5 trials [4 youth,84,86,88,93 1 adult109], 252 subjects) found no difference in SMBG between individuals receiving a behavioral program and those receiving usual care (MD, 0.40; 95% CI, -0.36 to 1.16). Adults receiving the behavioral program in the trial of Zoffmann et al.109 increased their frequency of SMBG (MD, 1.42; 95% CI, 0.11 to 2.75).

Figure 12 displays a forest plot of studies reporting the mean difference of self-monitoring of blood glucose in behavioral program versus usual care at 6-month postintervention. This figure is described further the section “Detailed Synthesis; Adherence to Diabetes Self-Management: Behavioral Programs Compared With Usual Care” as follows: “At the end of 6-month postintervention for youth and adults combined, our meta-analysis (5 trials [4 youth, 1 adult], 252 subjects) found no difference in SMBG between individuals receiving a behavioral program and those receiving usual care (MD, 0.40; 95% CI, -0.36 to 1.16). Adults receiving the behavioral program in the trial of Zoffmann et al. increased their frequency of SMBG (MD, 1.42; 95% CI, 0.11 to 2.75).”

Figure 12

Behavioral programs for type 1 diabetes compared with usual care: self-monitoring of blood glucose (tests per day) at 6-month postintervention. CI = confidence interval; n = number of participants; SD = standard deviation

One trial (390 youth) reported SMBG at 24-months postintervention.104 The results showed individuals receiving the behavioral program performed more poorly than those receiving usual care (MD, -0.36; 95% CI, -0.69 to -0.03).

Two trials in adults measured adherence of blood glucose testing using an item from the Summary of Diabetes Self-Care Activities (SDSCA) questionnaire.126 This self-report measure assesses the number of days in the previous week that SMBG was practiced. At the end of intervention one trial (74 adults) found that those in the behavioral program reported performing SMBG 1.4 days (95% CI, 0.35 to 2.43) more than those receiving usual care.82 At 6-month postintervention, one trial (244 adults) found no difference between groups (MD, -0.06; 95% CI, -0.60 to 0.48).94

Four trials in youth used the Diabetes Self-Management Profile (DSMP)127 to assess adherence to the diabetes regimen at different timepoints. At the end of intervention, Wysocki et al.108 (54 youth) reported a clinically important improvement in the overall DSMP score for those who received the behavioral program compared with those receiving usual care (MD, 5.00; 95% CI, 0.60 to 9.40). This difference had disappeared by 12-month postintervention. Two studies assessed adherence at 6-month postintervention followup; we did not pool the results as the studies reported different summary measures. In 2012, Nansel et al.104 (390 youth) found no difference between groups (MD, 1.31; 95% CI, -1.12 to 3.74). In an earlier study, Nansel et al.103 (81 youth) reported the proportion of adherence to an optimal diabetes regimen using the modified DSMP. They found no difference between groups (MD, -0.03; 95% CI, -0.06 to -0.01). The fourth study reported that there was no difference between groups on the DSMP at end of intervention; however, the authors did not provide any data.99

Two trials reported on adherence to medication. One trial (190 youth) used a questionnaire item to assess the number of times youth skipped an insulin dose in the past month.85 The authors reported that the odds of skipping one or more doses compared with no doses of insulin at 12-month followup was 0.82 (95% CI, 0.48 to 1.38) and at 24-month followup was 1.30 (95% CI, 0.78 to 2.17) for the group receiving the behavioral program. One trial in adults (74 adults) used the medication item of the Diabetes Self-Care Inventory128 and found no difference at the end of intervention between those receiving the behavioral program and those receiving usual care (MD, 0.22; 95% CI, -0.60 to 1.04).82

Adherence to Diabetes Self-Management: Behavioral Programs Compared With Active Control

One trial (149 adults) found no difference in frequency of SMBG between groups at 6-months postintervention (MD, -0.20; 95% CI, -0.76 to 0.36).91 The same trial measured adherence to several diabetes self-care activities using the SDSCA and found no difference between groups at 6-month postintervention (MD, 0.00; 95% CI, -0.35 to 0.35).91

One trial (54 youth) used the DSMP to assess adherence to the diabetes regimen.108 At the end of intervention and 12-month followup, Wysocki et al.108 found no difference between the group that received the behavioral program compared with those receiving an active control—MD = 2.40 (95% CI, -2.46 to 7.26) and MD = 2.00 (95% CI, -3.78 to 7.78), respectively).

One trial (149 youth)92 used the Diabetes Behavior Rating Scale, which reflects the frequency of routine diabetes care behaviors over the previous week.129 No data were provided; however, the authors reported that at end of intervention, and 6- and 12-month followup, those receiving the behavioral program performed more poorly that than those in the active control group.

Adherence to Diabetes Self-Management: Comparative Effectiveness of Two Behavioral Programs

One RCT (71 youth) studied the same DSME program delivered in person compared with delivery by Skype.90 The authors used the DSMP to assess adherence and found no difference between the groups at the end of intervention or at 6-month followup (MD, 0.85; 95% CI, -4.56 to 6.26 and MD, 0.74; 95% CI, -4.97 to 6.45, respectively).

Other Clinical and Behavioral Outcomes

Table 4 summarizes the results for other clinical and behavioral outcomes. For most outcomes results were reported in single trials.

Table 4. Other clinical and behavioral outcomes for type 1 diabetes.

Table 4

Other clinical and behavioral outcomes for type 1 diabetes.

Health-Related Quality of Life: Behavioral Programs Compared With Usual Care

Studies reporting on HRQL assessed this using generic and diabetes-specific quality of life measures. Generic HRQL was measured by a number of tools (e.g., World Health Organization Well-Being Index,130 Pediatric Quality of Life [PedsQL],131 Wellbeing Questionnaire132), as was diabetes-specific HRQL (PedsQL diabetes module,131 Pediatric Diabetes Quality of Life, Well-being Enquiry for Diabetes133). A group of studies reported on diabetes distress/stress (tools included Problem Areas in Diabetes134 and Diabetes Stress Questionnaire84), for which we analyzed separately from diabetes-specific HRQL. For all analyses we present the results as SMD. Figure 13 presents our meta-analyses of trials, stratified by age (youth and adults), that reported generic HRQL at end of intervention. Longer-term followup results were reported for generic HRQL and are summarized in Table 5. The meta-analysis results in Figure 14 for diabetes-specific HRQL at end of intervention were not stratified by age. Figures 15 and 16 present the meta-analyses for diabetes distress at end of intervention (stratified by age) and 6-month followup, respectively.

Figure 13 displays a forest plot of studies reporting the standardized mean difference of generic health-related quality of life in behavioral program versus usual care at end of intervention. This figure is described further the section “Detailed Synthesis; Health-Related Quality of Life: Behavioral Programs Compared With Usual Care” as follows: “At the end of intervention for youth and adults combined (Figure 13), our meta-analysis (7 trials [5 youth,93, 96-98, 110 2 adult82, 95], 474 subjects) found no difference in generic HRQL between individuals receiving a behavioral program and those receiving usual care (SMD, 0.10; 95% CI, -0.18 to 0.38). The lack of difference remained for the subgroups of adults (2 trials, 137 subjects; MD, 0.35; 95% CI -1.93 to 2.63)82, 95 and youth (5 trials, 337 subjects; MD, 0.01; 95% CI -0.33 to 0.35).”

Figure 13

Behavioral programs for type 1 diabetes compared with usual care: generic health-related quality of life at end of intervention. CI = confidence interval; n = number of participants; SD = standard deviation

Table 5. Behavioral programs for type 1 diabetes compared with usual care: generic health-related quality of life at 6-, 12-, and 24-month postintervention.

Table 5

Behavioral programs for type 1 diabetes compared with usual care: generic health-related quality of life at 6-, 12-, and 24-month postintervention.

Figure 14 displays a forest plot of studies reporting the standardized mean difference of diabetes-specific health-related quality of life in behavioral program versus usual care at end of intervention. This figure is described further the section “Detailed Synthesis; Health-Related Quality of Life: Behavioral Programs Compared With Usual Care” as follows: “Diabetes-specific HRQL was reported by three trials at the end of intervention. Our meta-analysis of these trials (2 youth, 1 adult, 212 subjects) found no difference between behavioral programs and usual care (SMD, 0.08; 95% CI, -1.44 to 1.60; I2=73%). One observational study in adults (90 subjects) found no difference between groups at 12-months postintervention (SMD, 0.03; 95% CI, -0.39 to 0.45).”

Figure 14

Behavioral programs for type 1 diabetes compared with usual care: diabetes-specific health-related quality of life at end of intervention. CI = confidence interval; n = number of participants; SD = standard deviation; Std = standardized

Figure 15 displays a forest plot of studies reporting the standardized mean difference of diabetes distress/stress in behavioral program versus usual care at end of intervention. This figure is described further the section “Detailed Synthesis; Health-Related Quality of Life: Behavioral Programs Compared With Usual Care” as follows: “Distress/stress was reported for six trials; negative scores represent reduced distress. At end of intervention, our meta-analysis for youth and adults combined (4 trials [2 youth, 2 adults], 209 subjects) found no statistically significant difference in diabetes distress for behavioral programs compared with usual care (SMD, -0.31; 95% CI, -0.83 to 0.21). Stratified by age, there was no difference for the studies of youth (SMD, -0.21; 95% CI, -2.84 to 2.60) or adults (SMD, -0.41; 95% CI, -3.78 to 2.96; I2= 57%).”

Figure 15

Behavioral programs for type 1 diabetes compared with usual care: diabetes distress/stress at end of intervention. CI = confidence interval; n = number of participants; SD = standard deviation; Std = standardized

Figure 16 displays a forest plot of studies reporting the standardized mean difference of diabetes distress/stress in behavioral program versus usual care at 6-month postintervention followup. This figure is described further the section “Detailed Synthesis; Health-Related Quality of Life: Behavioral Programs Compared With Usual Care” as follows: “At 6-month followup for youth and adults combined (4 trials [3 youth, 1 adult], 236 subjects), changes to diabetes distress did not differ for behavioral programs compared with usual care (SMD, -0.28; 95% CI, -0.94 to 0.38).

Figure 16

Behavioral programs for type 1 diabetes compared with usual care: diabetes distress at 6-month postintervention followup. CI = confidence interval; n = number of participants; SD = standard deviation; Std = standardized

At the end of intervention for youth and adults combined (Figure 13), our meta-analysis (7 trials [5 youth,93,96-98,110 2 adult82,95], 474 subjects) found no difference in generic HRQL between individuals receiving a behavioral program and those receiving usual care (SMD, 0.10; 95% CI, -0.18 to 0.38). The lack of difference remained for the subgroups of adults (2 trials, 137 subjects; MD, 0.35; 95% CI -1.93 to 2.63)82,95 and youth (5 trials, 337 subjects; MD, 0.01; 95% CI -0.33 to 0.35).93,96-98,110

Three RCTs in youth reported on generic HRQL for longer followup timepoints (Table 5).85,93,98 There was no difference in HRQL between groups at any of the timepoints.

Diabetes-specific HRQL was reported by three trials at the end of intervention (Figure 14). Our meta-analysis of these trials (2 youth,97,110 1 adult,112 212 subjects) found no difference between behavioral programs and usual care (SMD, 0.08; 95% CI, -1.44 to 1.60; I2=73%). One observational study in adults (90 subjects) found no difference between groups at 12-months postintervention (SMD, 0.03; 95% CI, -0.39 to 0.45).113

Distress/stress was reported for six trials; negative scores represent reduced distress. At end of intervention (Figure 15), our meta-analysis for youth and adults combined (4 trials [2 youth,84,93 2 adults82,95], 209 subjects) found no statistically significant difference in diabetes distress for behavioral programs compared with usual care (SMD, -0.31; 95% CI, -0.83 to 0.21). Stratified by age, there was no difference for the studies of youth (SMD, -0.21; 95% CI, -2.84 to 2.60) or adults (SMD, -0.41; 95% CI, -3.78 to 2.96; I2= 57%). At 6-month followup for youth and adults combined (4 trials [3 youth,84,93,111 1 adult109], 236 subjects), changes to diabetes distress did not differ for behavioral programs compared with usual care (SMD, -0.28; 95% CI, -0.94 to 0.38) (Figure 16).

Health-Related Quality of Life: Behavioral Programs Compared With Active Control

One trial in youth failed to demonstrate a difference in diabetes-related quality of life between a behavioral program and an active control at 12-month followup (130 subjects; insufficient data reported to calculate SMD).92

Diabetes-Related Health Care Utilization: Behavioral Programs Compared With Usual Care

Diabetes-related health care utilization was reported infrequently and only for trials comparing behavioral programs to usual care. We summarize the results in Table 6. One RCT in youth found a reduced risk of diabetes-related hospital admissions at end of intervention and at 6-month followup for those receiving behavioral programs compared with usual care.88 The same trial also reported fewer admissions to the emergency department at the end of intervention. Another RCT in youth85 and one in adults94 found no difference in hospital admission at any timepoint. One trial reported that there was no difference in the number of diabetes-related hospital and emergency department admissions at the 6-month followup; however, the authors did not provide any data.97

Table 6. Behavioral programs for type 1 diabetes compared with usual care: diabetes-related health care utilization at end of intervention, 6-, 12-, and 24-month postintervention followup.

Table 6

Behavioral programs for type 1 diabetes compared with usual care: diabetes-related health care utilization at end of intervention, 6-, 12-, and 24-month postintervention followup.

Program Acceptability: Behavioral Programs Compared With Usual Care

Figure 17 presents our meta-analysis stratified by age (youth and adults) of trials that reported participant attrition at their longest followup timepoint. Our meta-analysis (21 trials, 2,503 subjects) found a 21 percent increased risk of attrition for individuals receiving usual care compared with those receiving the behavioral program (RR, 1.21; 95% CI, 1.05 to 1.39).82-86,88,89,93-95,99,100,102-106,108-111

Figure 17 displays a forest plot of studies reporting the risk ratio of participant attrition in behavioral program versus usual care. This figure is described further the section “Detailed Synthesis; Program Acceptability: Behavioral Programs Compared With Usual Care” as follows: “Figure 17 presents our meta-analysis stratified by age (youth and adults) of trials that reported participant attrition at their longest followup timepoint. Our meta-analysis (21 trials, 2,503 subjects) found a 21 percent increased risk of attrition for individuals receiving usual care compared with those receiving the behavioral program (RR, 1.21; 95% CI, 1.05 to 1.39).”

Figure 17

Behavioral programs for type 1 diabetes compared with usual care: participant attrition. CI = confidence interval; n = number of participants

Program Acceptability: Behavioral Programs Compared With Active Control

Three RCTs (218 youth87,108 and 160 adults91) compared behavioral programs with active comparators. The pooled analysis (data not shown) found no difference between the groups for participant attrition (RR, 1.05; 95% CI, 0.46 to 2.4).

Program Acceptability: Comparative Effectiveness of Two Behavioral Programs

One RCT (72 youth) compared the same DSME program delivered in person compared with delivery by Skype.90 There was no difference between the groups in participant attrition (RR, 0.55; 95% CI, 0.28 to 1.11).

Summary of Key Findings and Strength of Evidence for KQ 1

There was moderate SOE showing differences in HbA1c at 6-month postintervention followup with greater reduction in HbA1c for individuals who were enrolled in behavioral programs compared with those receiving usual care (Table 7). For other timepoints, there was low SOE for no significant difference in HbA1c. At followup greater than 6 months, the estimated effects were imprecise and because the 95% CIs included our threshold for clinical importance we cannot rule out benefit for behavioral programs. There was low SOE showing no difference in adherence to diabetes self-management at end of intervention and 6-month followup. There was moderate SOE of no difference at the end of intervention for generic HRQL, and low SOE of no difference for diabetes distress at end of intervention and at 6-month followup. The 95% CIs for diabetes distress included our threshold for clinical importance such that we cannot rule out a favorable effect for behavioral programs. There was insufficient SOE for diabetes-related HRQL, and for outcomes related to changes in body composition, physical fitness, and dietary intake.

Table 7. Type 1 diabetes: summary of key findings and strength of evidence for behavioral programs compared with usual care.

Table 7

Type 1 diabetes: summary of key findings and strength of evidence for behavioral programs compared with usual care.

There was moderate SOE showing differences in HbA1c at 6-month postintervention followup with a clinically important reduction in HbA1c for individuals who were enrolled in behavioral programs compared with those receiving an active control (Table 8). At end of intervention and 12-month followup, there was low SOE showing no difference in HbA1c; because the 95% CIs included our threshold for a clinically important effect, we cannot rule out a benefit for behavioral programs. There was insufficient evidence for adherence to diabetes self-management at any followup timepoint.

Table 8. Type 1 diabetes: summary of key findings and strength of evidence for behavioral programs compared with an active control.

Table 8

Type 1 diabetes: summary of key findings and strength of evidence for behavioral programs compared with an active control.

KQ 2. Subgroups for Effectiveness in T1DM

This KQ evaluated whether behavioral programs differed in effectiveness for subgroups of patients with T1DM. For this question, we searched for subgroup analyses reported by individual trials that focused on whether a particular program was more or less effective in reducing HbA1c (the outcome reported by the most studies) based on age (children and adolescents [≤18 years], young adults [19-30 years], adults [31-64 years], older adults ≥65 years]), race or ethnicity, socioeconomic status, time since diagnosis (≤1 year vs. >1 year), and level of glycemic control (HbA1c <7 vs. ≥7 percent). We also looked at subgroups at the study level, for example when the mean age of participants fell within one of the age categories, or the majority (≥75 percent) of the participants was stated as racial/ethnic minorities. We evaluated the SOE for the subgroups based on age (Figures 5-10); insufficient data were reported or available for other subgroups.

Key Points

  • Based on between-study results for comparisons with usual care, results were consistent with the general trend when looking at all studies. At 6 months, behavioral programs reduced HbA1c in studies of youth by a statistically significant 0.28 percent and in studies of adults by a non-statistically significant 0.38 percent. At end of intervention, the point estimates indicated greater benefit for adults (0.28) than youth (0.00), although neither of these values reached statistical significance. None of the point estimates exceeded the a priori established clinically important difference of 0.4 percent HbA1c.
  • The effectiveness of behavioral programs compared with active controls appeared higher for youth than for adults at12-month followup; the effectiveness for youth was clinically important. The small number of studies in most subgroups provided insufficient SOE.
  • One trial reported results separately for youth with baseline HbA1c≥8 percent and found favorable results for this subgroup.
  • No trials reported on HbA1c by race or ethnicity, socioeconomic status, or time since diagnosis.

Detailed Synthesis

Age

In KQ 1, we presented our results by age groups (youth and adults). Behavioral programs appeared to be more effective in reducing HbA1c for adults than for youth at end of intervention when compared to usual care (Figure 5); the effect size in the meta-analysis for adults82,94,95,105,112 was greater in absolute terms than for the youth83,84,89,93,96,98,99,101,106,108,110 (MD = -0.28 vs. 0.00 respectively); the results for adults approached statistical significance and the 95% CI contained our threshold for clinical importance. At 6-month followup, the effect sizes for youth84,86,88,93,100,102-104,108,111 and adults94,109 appeared similar (MD = -0.28 vs. MD = -0.38, respectively); only the results for youth reached statistical significance, although the 95% CIs in both groups included a clinically important effect size favoring behavioral programs. No study in adults reported at 12-month followup; the youth results showed no difference (MD, -0.22; 95% CI, -0.49 to 0.05) although the 95% CI included a clinically important effect for behavioral programs.

When compared with active controls at end of intervention, the effect sizes for youth (MD, -0.33; 95% CI -1.65 to 0.99) and adults (MD, -0.35; 95% CI -0.81 to 0.11) were both similar to the overall effect size and nonsignificant with imprecise 95% CIs. At 6-month followup, the effect size was larger for the youth92,108 than for the adults91,107 (MD -0.60 vs. -0.38) but both results failed to reach statistical significance. At 12-month followup, results for youth were statistically significant and clinically important (MD, -0.52; 95% CI, -1.04 to 0.00);92,108 for adults there was no difference at 12-month followup (MD, -0.14; 95% CI, -1.28 to 1.00).107.

In the studies that included adults only, the mean age across the studies ranged from 30.3–49.2 years. None of the studies reported results separately for young adults or older adults.

Level of Glycemic Control

One RCT (101 youth) conducted a subgroup analysis of 54 youth with suboptimal baseline glycemic control (HbA1c ≥8 percent).96 At the end of intervention, Katz et al.96 found that those receiving the behavioral program had greater odds of maintaining or improving their HbA1c compared with those receiving usual care (odds ratio, 3.4; 95% CI, 1.0 to 11.9). This compares favorably to the overall study results which found no difference in change in glycemic control for the group receiving the behavioral program (MD, 0.30; 95% CI, -0.22 to 0.82). No data were reported for the subgroup of youth with optimal baseline HbA1c. Subgroup analysis at the study level was not conducted because the mean baseline HbA1c was >7 percent for all studies.

Other Subgroups

No data were reported for any of our other pre-specified subgroups: race or ethnicity, socioeconomic status, or time since diagnosis.

Summary of Key Findings and Strength of Evidence for KQ 2

At end of intervention, there was low SOE of no significant difference for both youth and adults, but the effect size appeared greater for adults, approached statistical significance, and its 95% CI included a clinically important value favoring behavioral programs (Table 9). The pooled effect estimate for youth was precise, but there was inconsistency in the individual study results with clinically important effects both for and against behavioral programs. Similar to the SOE when combining studies of youth and adults at 6-month followup (KQ 1), there was moderate SOE showing greater reduction in HbA1c for youth attending behavioral programs compared with usual care. The SOE for adults was low for no difference due to high risk of bias and imprecision (related to low sample size); nevertheless, the 95% CI included a large effect size suggesting there may be some benefit. There were no changes to the SOE at 12-month followup because of the lack of adult studies reporting this data.

Table 9. Type 1 diabetes: summary of key findings and strength of evidence for subgroups (by age) receiving behavioral programs compared with usual care.

Table 9

Type 1 diabetes: summary of key findings and strength of evidence for subgroups (by age) receiving behavioral programs compared with usual care.

For subgroups based on age in comparisons with active controls, the small number of studies (and sample sizes) led to wide pooled 95% CIs which in some cases included values of clinical importance both for and against behavioral programs; because of these factors, the SOE was graded as insufficient in all but two cases (Table 10). In studies of youth with followup to 12 months, there was low SOE of a clinically important benefit for behavioral programs; in studies of adults with 6-month followup, there was low SOE for no difference in HbA1c.

Table 10. Type 1 diabetes: summary of key findings and strength of evidence for subgroups (by age) receiving behavioral programs compared with active controls.

Table 10

Type 1 diabetes: summary of key findings and strength of evidence for subgroups (by age) receiving behavioral programs compared with active controls.

KQ 3. Potential Moderation of Effectiveness for T1DM: Components, Intensity, Delivery Personnel, Method of Communication, Degree of Tailoring, and Level of Community Engagement

To assess whether the effectiveness of behavioral programs differed based on various program factors (i.e., intensity, delivery personnel, method of communication, degree of tailoring, and level of community engagement), we performed univariate meta-regressions for comparisons between behavioral programs and usual care at longest followup (Table 11). See Table 3 in Methods for our classification scheme. See the Characteristics of Included Studies section for a summary, and the description of interventions for each study in the summary tables in Appendix F.

Table 11. Results from univariate meta-regressions analyzing the association between different program factors and the effectiveness of behavioral programs in improving HbA1c for T1DM.

Table 11

Results from univariate meta-regressions analyzing the association between different program factors and the effectiveness of behavioral programs in improving HbA1c for T1DM.

We did not have enough studies to conduct a multiple variable meta-regression analysis, nor were there sufficient studies for analysis of those comparing behavioral programs with active controls or other behavioral programs. We conducted the analysis for HbA1c; other outcomes did not have sufficient studies (≥10 studies) associated with them to support meaningful analyses. All but one study105 fell under the category of DSME, therefore we did not conduct a regression analysis on program components.

Key Points

  • Program intensity, including duration, contact hours, and frequency of contacts, appeared not to influence program effectiveness; the results were not statistically significant but were very precise (i.e., narrow 95% CIs) for no incremental effect when increasing intensity.
  • Although not reaching statistical significance, delivery of programs to individuals appeared beneficial compared with delivery to groups.

Detailed Synthesis

Table 11 summarizes the results of the univariate meta-regressions conduced with 25 studies.82-86,88,89,93-96,98-106,108-112 Duration of intervention (months), intensity (contact hours) and frequency of contacts were analyzed as continuous variables. Frequency of contacts is a composite variable combining duration and contact hours (contact hours per month). The delivery personnel variable had three categories. The remaining variables were dichotomized as shown in Table 11. The analysis for support persons assessed the impact of programs targeted at youth alone compared with those targeted at both youth and their parents or families; adult studies82,94,95,105,109,112 were not included in this analysis. The results indicated that the variables of duration, contact hours, and contact frequency appear not to influence program effectiveness; the coefficients are essentially zero (e.g., an additional month of program duration would not reduce HbA1c to any greater extent) and the 95% CIs are very precise without any indication of potentially producing a clinically important effect considering our threshold of 0.4. Delivery to individuals appears to be beneficial compared with delivery to groups (i.e., positive coefficient indicating switching to group delivery increased HbA1c); the result approached statistical significance and the 95% CI included a value meeting our threshold for clinical importance. Evidence was insufficient for other program factors; the lack of reporting for community engagement precluded any interpretation of the results.

KQ 4. Harms for T1DM

No studies reported on the associated harms (i.e. activity-related injury) of behavioral programs.

Type 2 Diabetes Mellitus

This section begins with a description of the results of our literature search and screening, a general description of the included RCTs and the behavioral programs investigated, and a summary of our ROB assessment. We follow this by presenting an overview on the effectiveness of behavioral programs for key outcomes, and then presenting the results for KQs 5 and 6. The results on effectiveness are grouped by outcome category (i.e., clinical, behavioral, and health) and then by comparison group (i.e., usual care, active control, and other interventions [comparative effectiveness]), and postintervention followup timepoint. For this section, results are presented as MD, SMD, or RR, with associated 95% CIs. Where statistical heterogeneity was considered substantial (>50 percent) we report the I2 Statistic (I2%). For results on KQs 5 and 6 for which we performed network meta-analysis, we describe the creation of groups (nodes) of interventions, and present the results including the MD and associated 95 percent credibility intervals, the rank order of each node, and a percentage referring to the node's “probability of being best” (PB). The analysis for KQ 6 also included a set of univariate meta-regressions; we present these results in a summary table.

For each KQ, we provide key points and then present a detailed synthesis of the evidence. Table E2 in Appendix E includes the ROB assessments for each RCT. A summary table describing the studies and interventions is included in Appendix F (Table F3). Appendix I contains summary tables of the effectiveness for all outcomes of behavioral programs compared with usual care (Table I1), active controls (Table I2), and other behavioral programs (Table I3). The results for the network meta-analyses for HbA1c in the subgroup analyses for KQ 6 are found in Appendix J. The Supplementary File includes figures (forest plots) of pairwise meta-analyses between behavioral programs and usual care and active control groups, for all outcomes across all timepoints where more than one study reported findings.

Literature Search and Screening

For T2DM, we included 132 primary reports of RCTs,107,135-265 and 29 associated publications266-294 (including one abstract)293 providing information related to the study methodology, outcomes, or description of the interventions (Figure 3). One of the studies was also included in the section on T1DM because it provided data on HbA1c outcomes separately for T1DM and T2DM.107

Characteristics of Included Studies

The majority of RCTs were two-arm trials with the following comparisons: 1) DSME with usual care (55 trials)135,136,138-143,147,149,153,155,158,162,163,171,173,176-179,183,187,193-197,203,206,211,213,215,218-220,223-226,228,229,231,233,235,238,242,245-247,253,257-260 or an active control (7 trials),146,154,181,182,198,201,202 2) DSME and support with usual care (8 trials)151,189,207,208,210,216,217,222 or with an active control (1 trial),164 3) lifestyle programs with usual care (18 trials)137,143,145,157,160,167,190,205,236,239,240,249,251, 254,255,261-263 or an active control (7 trials),156,161,165,166,169,186,252 and, 4) between two behavioral programs (21 trials).144,150,152,159,170,172,180,185,188,199,204,209,212,221,232,237,243,244,248,250,256,264 Thirteen three-arm RCTs were included, with eight comparing behavioral programs with usual care,188,200,214,234,241 or active control,168,192,230 and five having one intervention arm compared with two controls.107,174,175,184,265 Three four-arm trials148,191,227 examined (1) two lifestyle programs compared with two dietary interventions,148 (2) one lifestyle program compared with two active controls (dietary and physical activity interventions) and a usual care arm,191 and (3) the comparative effectiveness between DSME and three DSME and support programs delivered by different personnel.227 Trials were conducted in 16 countries but the majority (63 percent) were undertaken in the United States. The primary reports of nine RCTs (7.3 percent) were published prior to the year 2000,137,140,159,165,204,213,232,245,251 and 57 (46 percent) were published since 2010.107,135,139,146,148,152-155,162,164,167,168,170-173,175,179,181,182,191,193,194,198,199,201,202,206-211,214,216-219,221,224,227,228,233,234,236,237,240-242,244,247-249,252,253,256

The mean age of the participants was between 45 and 72 years (median=58). Six studies did not report age.139,160,193,224,242,245 The percentage of males ranged from 0–100 percent (median=40 percent). The proportion of nonwhite participants was between 0 and 100 percent; the majority (≥75 percent) of participants in 32 trials reported nonwhite race/ethnicity,137,141,143,151,153,162,171,179, 188,189,195,197,205-208,210,215-219,222,228,229,231,233,240,246,247,257,262 and 9 trials included few (<10 percent) people of nonwhite race /ethnicity.149,183-185,212,239,249,251,256 Baseline HbA1c was between 6.3 and 12.3 percent (median=8 percent); five trials did not report this information.138,238,242,245,251 Median duration of diabetes was 8.1 years (range 1-18 years). The median percentage of participants prescribed treatment with insulin was 19.5 percent; one study assessed the effectiveness of a lifestyle program in a sample of patients who were all initiated on insulin therapy,145 and another studied a DSME program in patients receiving ongoing intensive insulin treatment.181 Body mass index ranged from 23.8–39.1 kg/m2 (median=33.0 kg/m2).

Table F3 in Appendix F includes details on each behavioral program studied. Several trials evaluated more than one behavioral program; there were 166 intervention arms in total. Overall, median program duration was 6 months (range 1–96) and median number of contact hours was 12 (range 1–208). Technology was the primary method of communication for 17 programs studied in 16 trials,138,139,147,167,171,179,185,187,194,241,247,253,258,259,264,265 and was used alone or in combination with in-person communication in 42 programs; based on our inclusion criteria, all programs were delivered with some form of communication with delivery personnel. Sixty-four programs were delivered to individuals only, 56 to groups only, and 44 had some mixture of individual and group delivery (see Table F3 for details). Half (83 of 166; 50 percent) of programs were delivered by one health care professional, with (n=16) or without (n=67) the assistance of a non-health care professional; other programs were delivered by a multidisciplinary team (48 arms; 29 percent) or solely by non-health care professionals (31 arms; 19 percent) (see Table F3). Data on the delivery personnel could not be determined for two studies.187,236

Risk of Bias of Individual Studies

A summary of the ROB assessments for the 132 trials is presented in Figure 18; the consensus assessments for all domains in each study are presented in Appendix E. All trials were assessed as having a medium (unclear) or high overall ROB. For objective outcomes (e.g., HbA1c, weight, blood pressure), 42 percent of trials had a medium ROB and 58 percent had a high risk. The assessment of high ROB was largely driven by incomplete outcome data (i.e., loss to followup). For trials (n=92) reporting on subjective outcomes of interest for this review (e.g., HRQL, depression), 13 percent had a medium ROB; the remainder (87 percent) had a high ROB. This was primarily due to lack of blinding of participants, study personnel, and outcome assessors (see Methods section and the Supplementary File for a description of decision rules for these assessments).

Figure 18 displays a Risk of Bias summary chart for trials of behavioral programs for type 2 diabetes. This figure is described further in the section “Risk of Bias of Individual Studies” as follows: “A summary of the ROB assessments for the 132 trials is presented in Figure 14; the consensus assessments for all domains in each study are presented in Appendix E. All trials were assessed as having a medium (unclear) or high overall ROB. For objective outcomes (e.g., HbA1c, weight, blood pressure), 42 percent of trials had a medium ROB and 58 percent had a high risk. The assessment of high ROB was largely driven by incomplete outcome data (i.e., loss to followup). For trials (n=92) reporting on subjective outcomes of interest for this review (e.g., HRQL, depression), 13 percent had a medium ROB; the remainder (87 percent) had a high ROB. This was primarily due to lack of blinding of participants, study personnel, and outcome assessors.”

Figure 18

Risk of bias summary for trials of behavioral programs for type 2 diabetes.

Twenty-four trials (18 percent) received funding from industry. One-hundred-six (80 percent) received funding from non-industry sources (e.g., government or foundations); of these, 15 (11 percent) received funding from both industry and non-industry sources. Funding was not reported for seven (0.5 percent) studies.

Effectiveness of Behavioral Programs Across Outcomes

We report on the overall effectiveness of behavioral programs before describing our results for KQs 5 and 6. This serves to summarize the findings on outcomes that did not contribute to the analyses for KQ 5 or 6, and to provide information for interpreting the results for KQs 5 and 6. We provide a summary of the results for our key outcomes, based on outcome category, comparison group, and timepoint. Because several trials studied more than one behavioral program, results are usually characterized by the number of comparisons rather than trials. The results for all outcomes are presented in summary tables in Appendix I; Table I1 contains results for behavioral programs compared with usual care and Table I2 contains those for comparisons with active controls. Most of these results are based on meta-analyses for two or more comparisons, and we indicate when no outcome data were available. Behavioral programs are not analyzed based on their components for these analyses; KQs 5 and 6 focused on potential moderation in effect by program components and other factors. Table I3 contains the results for key outcomes at longest followup (i.e., up to 12 months) from studies reporting on comparative effectiveness between different behavioral programs. This table is organized by outcome category and is grouped by comparisons in the manner the behavioral programs differed (e.g., comparing delivery personnel or intensity).

Key Clinical Outcomes: HbA1c and Change in Body Composition

HbA1c

Individuals receiving behavioral programs compared with usual control improved their glycemic control (i.e., reduced percent HbA1c) at end of intervention (66 comparisons; 8,715 subjects; MD, -0.35; 95% CI, -0.56 to -0.14; I2=74%),135,137,139,141,142,145,147,151,153,155,160,162,171,173,175-177,179,184,188-191,197,203,205-208,210,213-220,222-226,228,229,231,233,236,239-241,247,249,253-255,257-262,265 but not at 6-month (23 comparisons; 4,138 subjects; MD, -0.16; 95% CI, -0.36 to 0.04; I2=61%)136,140,143,146,163,173,178,183,193-196,211,215,229,234,235,241,246, 249,259 or 12-month followup (9 comparisons; 1,494 subjects; MD, -0.14; 95% CI, -0.4 to 0.12; I2=59%).146,158,163,173,178,193,223,234 The results were of a smaller magnitude when behavioral programs were compared with active control groups at end of intervention (25 comparisons; 7,518 subjects; MD, -0.24; 95% CI, -0.41 to -0.07; I2=70%).107,154,161,164-166,168,169,174,175,184,186,191,192,198,202,230,252,265 For 6-month followup, the effect size was similar but the results reached statistical significance (6 comparisons; 595 subjects; MD, -0.19; 95% CI, -0.37 to -0.01).107,156,181,182,201 The estimate was nonsignificant and imprecise at 12-month followup (6 comparisons; 486 subjects; MD, -1.10; 95% CI, -2.56 to 0.36).107,164,192,201 No result was clinically important based on our prespecified threshold of 0.4 unit change in percent HbA1c. The meta-analyses for HbA1c indicated high heterogeneity in effect between studies across timepoints (I2 ranged from 61–98 percent). As described in the Methods, we performed sensitivity analyses to explore this issue; however, none of the prespecified variables reduced the heterogeneity to below 50 percent so we present the original results.

In three trials (701 subjects) providing comparative effectiveness between DSME delivered to groups compared with delivery to individuals or via a mixture of individual and group delivery, there was a beneficial effect for those individuals receiving DSME in groups at up to 12-months followup (MD, -0.36; 95% CI, -0.63 to -0.08).192,212,234 In contrast, there was a benefit at end of intervention shown in a trial comparing individual DSME and motivational interviewing with group-based empowerment DSME and supervised group exercise (143 subjects; MD, -0.30; 95% CI, -0.58 to -0.02).244 Several comparative effectiveness studies found no difference in HbA1c changes between groups. Some examples include the addition of an additional treatment (e.g., problem solving therapy,168 music therapy199) or a support aspect to a DSME or lifestyle program;172,227,230 others include comparisons between peer and health professional delivery of a program component (see Appendix I).144,172,227,256

Six trials reported on HbA1c but did not provide data suitable for inclusion in the meta-analysis. Five trials comparing a behavioral program with usual care did not find a significant difference between groups.149,157,167,187,200 One trial comparing two behavioral interventions with different delivery methods also found no difference between groups.159

Visualization of funnel plots did not suggest publication bias, and using the Egger test78 for this outcome resulted in no significant indication of bias for comparisons with usual care (p=0.25) or active controls (p=0.21) at end of intervention.

Change in Body Composition

Compared with usual care, behavioral programs assisted participants in reducing their BMI (kg·m-2) at all three timepoints—end of intervention (36 comparisons; 4,280 subjects; MD, -0.51; 95% CI, -0.66 to -0.36),135,137,139,145,151,153,155,162,171,175,179,184,189,190,206,208,210,214,215,224,226,233,239-242,246,249,251,255,257,259-61 6-month followup (14 comparisons, 1,840 subjects; MD, -0.21; 95% CI, -0.32 to -0.1),136,143,146,163,183,193,211,215,241,246,249,251,259 and 12-month followup (5 comparisons; 867 subjects; MD, -0.92; 95% CI, -1.44 to -0.4).146,157,163,193,238 When compared to active controls, behavioral programs did not reduce BMI at any followup timepoint. Body weight (kg) was reduced at end of intervention in those receiving behavioral programs compared with those receiving usual care (37 comparisons; 4,070 subjects; MD, -1.68; 95% CI, -2.06 to -1.30),137,141,145,147,153,160,167,176,178,184,188,190,191,200,203,205,213,214,217,222,224-226,239,246,249,254,258-263,265 or active control (15 comparisons; 6,212 subjects; MD, -1.30; 95% CI, -2.48 to -0.12; I2=78%).148,154,165,166,169,174,184,186,191,198,202,252,265 There was no reduction in weight at other timepoints; one trial showed an increase in weight at 12-month followup for the behavioral program compared with active control arm (95 subjects; MD, 3.70; 95% CI, 1.67 to 5.73).201 Waist circumference (cm) was reduced at end of intervention (17 comparisons, 1,521 subjects),145,153,162,167,190,203,214,215,224,226,241,254,255,259,261 in those comparisons with usual care—MD = -3.17 (95% CI, -4.36 to -1.98; I2=64%). One study found significant reduction in waist circumference at 6-month followup for those receiving a behavioral program compared to an active control (38 subjects; MD, -5.70; 95% CI, -6.54 to -4.86).156 There was no difference found in two studies comparing behavioral programs to usual care at 12-month followup;157,163 no data were available at 12-month followup for studies comparing behavioral programs to active control.

One comparative effectiveness trial (99 subjects) found that BMI was reduced (MD, -1.80; 95% CI, -2.51 to -1.09) at end of intervention for individuals receiving a cognitive-behavioral-therapy based lifestyle program including a portion-controlled diet compared with DSME including a meal plan.170 Participants in this study who received the lifestyle program also reduced their weight and waist circumference more than those receiving the DSME program—MD = -5.10kg (95% CI, -7.22 to -2.98) and MD = -3.60cm (95% CI, -5.33 to -1.87), respectively.

Behavioral Outcomes: Change in Dietary Intake and Physical Activity; Medication Adherence

Participants receiving behavioral programs compared with usual care reduced their energy intake (daily intake of kilocalories) to a small extent at end of intervention (11 comparisons; 1,164 subjects; MD, -149.62; 95% CI, -243.01 to -56.23; I2=68%)135,137,155,167,188,191,215,216,245,261 and 6-month followup (3 comparisons; 469 subjects; MD, -64.05; 95% CI, -96.44 to -31.66).163,167,215 There was no significant change at any timepoint in energy intake for comparisons with active controls, and no effect reached statistical significance for percent kilocalories from saturated fat.

Changes in intensity/duration of physical activity were measured by subjective (e.g., days per week in most cases) and objective (via accelerometers) means. Fifty percent of the studies reporting days per week of physical activity used the Summary of Diabetes Self-care Activities (SDSCA) questionnaire. Two trials (382 subjects) found that participants of behavioral programs increased the number of days per week of physical activity to a greater extent than those in usual care arms at 12-month followup (MD, 0.90; 95% CI, 0.90 to 0.90).163,238 These and several other trials138,163,184,219,226,236,238-240,253 did not find any difference at end of intervention or 6-month followup. One trial with 40 participants showed a negative affect for a behavioral program compared with an active control at end of intervention (MD, -1.06; 95% CI, -1.82 to -0.31).184 There was no difference reported for objective measurements of exercise duration/intensity (7 comparisons), or for measures of fitness (5 comparisons) in trials comparing behavioral programs to usual care or active controls.

Two comparative effectiveness trials found significant benefit for changes in physical activity. Based on self-report of days per week of engaging in moderate-to-intense physical activity, Vadstrup et al.244 found improvement (121 subjects; MD, 1.30; 95% CI, 0.80 to 1.80) for the group provided individual DSME and motivational interviewing compared with group-based empowerment DSME and supervised group exercise. Using the Modified Canadian Aerobic Fitness Test which estimates relative maximal oxygen consumption, Plotnikoff et al.,209 found improved fitness levels from supplementing DSME and support with a physical activity intervention (88 subjects; SMD, 0.62; 95% CI, 0.19 to 1.05).

Measurement of medication adherence was undertaken using various tools including the SDSCA,138,171 the Hill-Bone Compliance Scale,168,175 and the Morisky Adherence Scale.253 A significant effect for medication adherence—in favor of the usual care group—was maintained from end of intervention to 12-month followup in one trial (191 subjects; SMD, -0.50; 95% CI -0.79 to -0.21);238 other studies comparing behavioral programs to usual care found no difference at end of intervention or 6-month followup. Comparisons with active controls also found no difference at any followup timepoint.

Health Outcomes: Quality of Life, Micro- and Macrovascular Complications, All-Cause Mortality

Quality of Life

Outcomes for quality of life were categorized into five subcategories based on their focus (i.e., generic vs. diabetes-specific) and the similarity between studies in measurement scales. Groups of studies reported outcome data based on the SF-36 Health Survey (physical and mental component scores), and the Problem Areas in Diabetes (PAID) scale (0–100; lower score favorable) measuring diabetes distress. Accordingly, three of our subcategories represent these tools (i.e., Quality of Life–SF36 Physical, Quality of Life–SF36 Mental, and Diabetes Distress), for which we present results as MD. Other subcategories were created to combine other generic (Quality of Life–Other; e.g., WHO Quality of Life Brief, W-BQ12, EuroQol 5D) and diabetes-specific (Diabetes-specific Quality of Life; e.g., Diabetes Quality of Life, Diabetes Distress Scale, Appraisal of Diabetes, Diabetes Symptom Checklist) quality of life questionnaires; these results are presented as SMDs.

There was no difference in Quality of Life-SF36 (Physical) or Quality of Life-SF36 (Mental) when measured at end of intervention for comparisons with usual care,155,214,222,239 or up to 6-months followup for comparisons with active controls.169,181,252 There was no difference found for Quality of Life–Other in comparisons (n=7) with usual care up to 6-month followup,195,196,206,226,249,253 or in comparisons (n=4) with active controls up to 12-months followup.154,192 Results favored behavioral programs compared with usual care for Diabetes Distress (8 comparisons, 1,384 subjects) at end of intervention (MD, -1.82; 95% CI, -3.43 to -0.21),142,147,211,218,225,226,228,233 but not at longer followup.146,211,234 The result at end of intervention is not clinically important based on our prespecified threshold of a 0.5 SD using the mean SD of the included studies. One study (167 subjects) evaluating this outcome in a comparison to active controls found no difference at 6-month followup.181 There was no difference in Diabetes-specific Quality of Life at any followup timepoint to 12-month followup when comparing behavioral programs to usual care,146,163,175,177,189,215,253 or at end of intervention for programs compared with active controls.154,168,175

One trial assessed the effects on quality of life when the support phase of a DSME and support program was delivered by peers, clinical practice staff, or health care professionals (diabetes educators). Siminerio et al.227 found that Diabetes Distress worsened for the group receiving support from peers when compared to the group receiving support from the educators (74 subjects; MD, 24.70; 95% CI, 15.02 to 34.38). This effect is considered clinically important. There was no difference in Diabetes Distress when delivery of nonprofessional clinic staff was compared to that by health care professionals.

Micro- and Macrovascular Complications

Authors of the LookAHEAD trial (5,145 subjects) studied outcomes of myocardial infarctions, stroke, heart failure, diabetic nephropathy, diabetic retinopathy, and diabetic neuropathy. Diabetic retinopathy was reduced by 14% (hazard ratio, 0.86; 95% CI, 0.75 to 0.98) in participants receiving their intensive lifestyle program compared with an active control (didactic education and support) over a median of 8 years.278 A secondary analysis of nephropathy using a post hoc outcome of very-high-risk chronic kidney disease—a combination of the a priori outcomes albuminuria and estimated glomerular filtration rate, found a lower incidence of nephropathy for the intensive lifestyle program at the 8 year end-of-intervention timepoint (risk difference 0.27 cases per 100 person-years; hazard ratio, 0.69; 95% CI, 0.55 to 0.87).293 Results for the other outcomes in this trial did not reach statistical significance—myocardial infarction (RR, 0.86; 95% CI, 0.70 to 1.05), stroke (RR, 1.06; 95% CI, 0.79 to 1.44), heart failure (RR, 0.83; 95% CI, 0.64 to 1.08), and diabetic neuropathy (RR, 1.13; 95% CI, 0.92 to 1.38).

All-Cause Mortality

One study examined all-cause mortality as an pre-specified outcome;252 there were enough data in 27 reports to calculate a difference in all-cause mortality for the associated comparisons. There was no difference in all-cause mortality between participants receiving behavioral programs and usual care (25 comparisons; 4,659 subjects; RR, 1.28; 95% CI, 0.84 to 1.94); mortality between behavioral programs and active control groups (5 comparisons, 6,050 subjects) was 14 percent lower for those receiving behavioral programs (RR, 0.86; 95% CI, 0.77 to 0.96).

KQ 5. Potential Moderation of Effectiveness for T2DM: Components, Intensity, Delivery Personnel, Method of Communication, Degree of Tailoring, and Level of Community Engagement

Key Points: HbA1c

  • In a network meta-analysis with usual care serving as the reference, behavioral programs showing effect sizes above our threshold for clinical importance represented all three major program component categories of DSME, DSME and support, and lifestyle.
  • The effect sizes of all minimally intensive DSME programs (≤10 contact hours) were lower than our threshold for clinical importance, but were all higher than that for educational interventions not meeting our criteria for a behavioral program (e.g., didactic education programs).
  • Programs having the higher effect sizes and probabilities of being best (≥5 percent) were more often delivered in person rather than including technology.

Key Points: Body Mass Index

  • Lifestyle programs resulted in the highest effect sizes for BMI.
  • Program intensity appeared less important than method of delivery; providing some individual (rather than solely group-based) delivery appears beneficial.

Detailed Synthesis

We conducted network meta-analyses for the outcomes of HbA1c and BMI. These outcomes represent two of our key outcomes that were reported by the most studies. Tables 12 (HbA1c) and 13 (BMI) provide descriptions of the nodes (no two containing the same combination of variables), and include the results including the rank order of each node, the MD relative to usual care, the associated 95 percent credibility interval, and a percentage referring to the node's “probability of being best” (PB). These tables also indicate which studies contributed to each node, with the sample size of the applicable study arms, although it should be noted that the network approach accounts for direct and indirect comparisons such that other information contributes to the results. We summarize our approach and the results for each outcome below. Figures 19 and 20 contain the plots showing the relative ranking of the different nodes; the studies within each node are cited in the accompanying tables. A consistency analysis was performed for the HbA1c analysis and it was found that only two quadratic loops (of a total of 43 total quadratic and triangular loops) showed statistically significant inconsistency.

Table 12. Network meta-analysis for effect moderation on HbA1c results in T2DM: description of nodes and results.

Table 12

Network meta-analysis for effect moderation on HbA1c results in T2DM: description of nodes and results.

Table 13. Network meta-analysis for effect moderation on body mass index results for T2DM: description of nodes and results.

Table 13

Network meta-analysis for effect moderation on body mass index results for T2DM: description of nodes and results.

Figure 19 displays a plot of network meta-analysis for HbA1C. This figure is described further in the section “Detailed Synthesis; HbA1c” as follows: “The results of the network meta-analysis indicated that, in comparison to the reference of usual care, 14 nodes produced MDs which fell at or above our clinically important threshold (0.4) for change in percent HbA1c. Four of these nodes represent DSME, five represent DSME and support, and five represent lifestyle programs. Six nodes represent medium-intensity programs (11–26 contact hours), six represent high-intensity programs (≥26 contact hours), and two (one DSME and support, and one lifestyle) represent low-intensity programs (≤10 contact hours). The mean contact hours for the programs represented by these effective nodes was 26.4 (range 7-40.5 hours); the mean total program duration was 8 months (range 2-12). None of the nodes representing low-intensity DSME programs showed clinically important effects; all had greater impact on HbA1c than basic educational controls, but lower impact than a stand-alone dietary or physical activity intervention. Three of four nodes representing DSME programs with MDs showing clinically important effect were delivered by health care professionals. Eleven of the 14 nodes representing clinically important effects were delivered in person rather than incorporating some form of technology. Behavioral programs in the nodes with the highest PB (36 and 10.7 percent, respectively) were delivered in person rather than by incorporating technology. Similar observations were noted for the other four nodes having PB ≥5 percent, of which three were delivered in person and one was delivered using some form of technology; the latter group of studies provided supportive telephone calls between in-person sessions during lifestyle interventions tailored to minorities. All effective nodes representing some use of technology were of moderate or high intensity. An outlier having an MD of 2.80 (95% CI, 1.14 to 4.48) represented a study by Brown et al. which found greater HbA1c reduction at end of intervention in a group receiving DSME compared with one receiving DSME with the addition of a care manager.”

Figure 19

Plot of network meta-analysis results for effect moderation on HbA1c in T2DM. This plot depicts the results from our network meta-analysis for the outcome of HbA1c (negative values favorable) when comparing groups (“nodes”) of interventions, (more...)

Figure 20 displays a plot of network meta-analysis for Body Mass Index. This figure is described further in the section “Detailed Synthesis; Body Mass Index” as follows: “Averaging the baseline values in the studies, BMI at baseline was similar for programs classified as DSME (32.4 kg·m2), DSME and support (33.0 kg·m2), and lifestyle (32.9 kg·m2). The effect sizes for BMI from behavioral programs relative to usual care ranged between -1.77 kg·m2 and 3.29 kg·m2. The node with the most beneficial MD only represented one study157 evaluating a low-intensity lifestyle program with multiple brief contacts over 6 months. Nodes with rank orders 2 and 3 were both lifestyle programs of low and medium intensity, respectively. The node having the most studies (n=12) represented a DSME program of medium intensity (11–26 hours) which was delivered in person to groups; the results indicated this program to have 0 percent PB. One difference between the programs in this node and those with higher PB is that the higher PB all offered some individual delivery, rather than relying only on group delivery. Likewise, the majority of nodes having the highest MDs (i.e., 8 of the highest 10) offered some individual delivery.”

Figure 20

Plot of network meta-analysis results for effect moderation on body mass index for T2DM. DSME = diabetes self-management education; h = hours This plot depicts the results from our network meta-analysis for the outcome of body mass index (BMI) when comparing (more...)

HbA1c

Accounting for all variables of program components and delivery variables (Table 3) when creating the network was deemed not appropriate for various reasons. When choosing which variables to use, we prioritized them by considering factors including the: reliability and specificity with which we could categorize programs in each variable based on extent of reporting, overlap in meaning between variables, and the ability to inform those individuals making decisions to implement these programs in community settings. Deciding between program duration (months) and intensity (contact hours), the latter was chosen because it accounts for duration to some extent, aligns with our focus on interactive programs, and better enables one to estimate resource requirements in terms of personnel and space. Degree of tailoring was not chosen because every program incorporated this to some extent and categorizing this (e.g., minimal versus moderate in terms of content and delivery) was considered unreliable based on study reporting. Moreover, the use of technology (captured in the delivery method variable) was also considered a way to tailor the program to individuals, particularly in cases of poor access due to travel or time constraints. The level of community engagement was also not used because, when incorporated, this was largely via use of lay or peer providers which was captured in the delivery personnel variable. The remaining variables were placed in order (program components, program intensity, method of communication, method of delivery, and delivery personnel) and we then created nodes trying to incorporate as many variables as possible without having numerous nodes either empty (a theoretical grouping of variables that did not represent a studied program), or with only one or two programs. Dividing the data by the first variable of program components (DSME, DSME and support, and lifestyle) resulted in a relatively large number of DSME comparisons. For this group, we were able to use all five variables to create 24 potential nodes (18 which contained comparisons). We did not capture the variable of delivery personnel for the DSME and support, and lifestyle groups because most nodes would in this case contain at most one comparison.

When interpreting the results, we relied primarily on the relative ranking of the nodes, and looked for trends in the findings based on program variables that appeared to determine whether the effects would offer clinical benefit. Some nodes had very few studies, small sample sizes, and/or wide credibility intervals, thus we did not make any firm conclusions for a single node (or for differences in 561 potential comparisons) but rather from looking across nodes with similar features.

The results of the network meta-analysis indicated that, in comparison to the reference of usual care, 14 nodes produced MDs which fell at or above our clinically important threshold (0.4) for change in percent HbA1c. Four of these nodes represent DSME, five represent DSME and support, and five represent lifestyle programs. Six nodes represent medium-intensity programs (11–26 contact hours), six represent high-intensity programs (≥26 contact hours), and two (one DSME and support, and one lifestyle) represent low-intensity programs (≤10 contact hours). The mean contact hours for the programs represented by these effective nodes was 26.4 (range 7-40.5 hours); the mean total program duration was 8 months (range 2-12). None of the nodes representing low-intensity DSME programs showed clinically important effects; all had greater impact on HbA1c than basic educational controls, but lower impact than a stand-alone dietary or physical activity intervention. Three of four nodes representing DSME programs with MDs showing clinically important effect were delivered by health care professionals.

Eleven of the 14 nodes representing clinically important effects were delivered in person rather than incorporating some form of technology. Behavioral programs in the nodes with the highest PB (36 and 10.7 percent, respectively) were delivered in person rather than by incorporating technology. Similar observations were noted for the other four nodes having PB ≥5 percent, of which three were delivered in person and one was delivered using some form of technology; the latter group of studies provided supportive telephone calls between in-person sessions during lifestyle interventions tailored to minorities.143,160 All effective nodes representing some use of technology were of moderate or high intensity.

An outlier having an MD of 2.80 (95% CI, 1.14 to 4.48) represented a study by Brown et al.152 which found greater HbA1c reduction at end of intervention in a group receiving DSME compared with one receiving DSME with the addition of a care manager.

Body Mass Index

We created nodes using four variables for BMI (i.e., program component, program intensity, method of communication, and method of delivery). Of the 39 plausible nodes (each differing by only one level of one variable), there were studies with data to populate 26 nodes.

Averaging the baseline values in the studies, BMI at baseline was similar for programs classified as DSME (32.4 kg·m2), DSME and support (33.0 kg·m2), and lifestyle (32.9 kg·m2). The effect sizes for BMI from behavioral programs relative to usual care ranged between -1.77 kg·m2 and 3.29 kg·m2. The node with the most beneficial MD only represented one study157 evaluating a low-intensity lifestyle program with multiple brief contacts over 6 months. Nodes with rank orders 2 and 3 were both lifestyle programs of low and medium intensity, respectively. The node having the most studies (n=12) represented a DSME program of medium intensity (11–26 hours) which was delivered in person to groups; the results indicated this program to have 0 percent PB. One difference between the programs in this node and those with higher PB is that the higher PB all offered some individual delivery, rather than relying only on group delivery. Likewise, the majority of nodes having the highest MDs (i.e., 8 of the highest 10) offered some individual delivery.

KQ 6. Subgroups for Factors Moderating Effectiveness in T2DM

Key Points

Glycemic Control
  • In terms of overall effectiveness at longest followup for HbA1c, participants with suboptimal glycemic control (≥7 percent HbA1c) appear to benefit more than those with good control (<7 percent) from behavioral programs when compared to usual care and active controls. The effect sizes were not clinically important for either group.
  • Few differences were evident when evaluating potential moderation by program factors in a subgroup of studies having participants with suboptimal baseline glycemic control. Of the two nodes representing low-intensity programs that were found to have clinically important effects in the original network analysis, one was shown not effective for participants with suboptimal glycemic control. Active controls of dietary or physical activity interventions were not as effective for participants with suboptimal control.
Age
  • Older adults (≥65 years) did not benefit at longest followup in terms of reduction in HbA1c from behavioral programs in comparison with usual care or active controls. In adults <65 years, the effect size for behavioral programs compared with active controls at longest followup was clinically important.
Race/Ethnicity
  • Subgroup analysis of our meta-analyses comparing behavioral programs to usual care and active controls indicated that programs offered to predominantly minority participants (≥ 75 percent nonwhite) appear to provide more benefit than those offered to populations with a lower proportion (<75 percent) of nonwhite individuals. The effect size for minority participants reached clinical importance.
  • Based on univariate regression analyses for the subgroups based on race/ethnicity, none of the program factors (e.g., intensity, delivery personnel) reached statistical significance for influencing the effectiveness of behavioral programs compared to usual care on HbA1c. The subgroup of majority/white participants appeared to benefit more from lifestyle programs than from DSME or DSME plus support programs.
  • Glycemic control appeared to be worse for the minority (HbA1c=8.8 percent) compared with the majority/white (HbA1c=7.6 percent) subgroup.

Detailed Synthesis

As is common with systematic reviews, all of our results for this KQ relied on between-study rather than within-study comparisons, such that the effect of randomization is removed and the results are considered observational and possibly biased through confounding by other study-level characteristics.

Glycemic Control

Initially, we conducted a subgroup analysis on the outcome of HbA1c by baseline glycemic control (HbA1c <7 vs. ≥7 percent) using the pair-wise meta-analysis results for HbA1c at longest followup timepoint (data not shown). For behavioral programs compared with usual care, our meta-analysis showed a small benefit (MD, -0.12; 95% CI, -0.22 to -0.01; I2=3%) for HbA1c for participants with a baseline HbA1c <7 percent (6 trials, 1,239 subjects);194,196,223,246,249,260 the analysis showed greater benefit (although not clinically important) for participants with a baseline HbA1c ≥7 percent (76 trials; 11,086 subjects; MD, -0.32; 95% CI, -0.42 to -0.21; I2=71%). There was no difference in change in HbA1c for persons with baseline HbA1c <7 percent receiving a behavioral program compared with an active control (3 trials, 169 participants; MD, -1.43; 95% CI, -3.57 to 0.71; I2=99%);174,186,201 persons with HbA1c ≥7 percent at baseline had greater reduction in HbA1c after receiving behavioral programs compared with an active comparator (20 trials, 7,709 subjects; MD, -0.18; 95% CI -0.30 to -0.06; I2=38%), but this was not clinically important.

To explore potential moderation of effect based on the factors of interest, we performed a subgroup analysis of our network meta-analysis described in the section for KQ5. We removed the studies in which baseline HbA1c was <7 percent (n=9)174,186,194,196,201,223,246,249,260 and repeated the analysis for a subgroup with baseline HbA1c ≥7 percent; there were an insufficient number of studies with baseline HbA1c <7 percent to run the analysis using these studies, or to perform meta-regression analysis. The results are presented in Table J1 in Appendix J. The categorization of all nodes remained the same in relation to the variables of interest. The changes in this subgroup analysis include: 1) the effect sizes for nodes ranked 1 and 13 reduced substantially to ranks of 31 and 23 (from -1.37 to 0.09 and from -0.45 to -0.15, respectively), and 2) the active (dietary or physical activity) control became less effective (MD -0.14 vs. -0.39) for participants having ≥7 percent HbA1c.

Age

The same set of subgroup analyses performed for baseline glycemic control was conducted for our age subgroups; the study population in nine studies reporting on HbA1c had a mean age ≥65 years.147,155,166,196,203,218,221,223,230,236 We first performed subgroup analyses by age group (≥65 years vs. <65 years) using the pair-wise meta-analyses results for HbA1c at longest followup timepoint in comparisons between behavioral programs and both usual care and active control (data not shown). For behavioral programs compared with usual care, the meta-analysis for participants <65 years indicated that HbA1c reduced to a statistically significant extent at longest followup (76 comparisons; 11.491 subjects; MD, -0.31; 95% CI, -0.42 to -0.21; I2=72%); for older adults the results indicated no difference (7 comparisons; 734 subjects; MD, -0.24; 95% CI, -0.50 to 0.03; I2=55%). For comparisons with active controls for participants <65 years, the benefit of behavioral programs was statistically and clinically significant (26 comparisons; 7,669 subjects; MD, -0.41; 95% CI -0.70 to -0.12; I2=93%). For older adults, behavioral programs compared with an active control (3 comparisons, 206 subjects) failed to reduce HbA1c (MD, -0.23; 95% CI, -0.60 to 0.14; I2=0%).

Subsequently, we performed a subgroup analysis for populations <65 years by removing the data from the studies (n= 9)147,155,166,196,203,218,223,230,236 having mean age ≥65 from our network meta-analysis described in the section for KQ5. The results are presented in Table J2 in Appendix J. The categorization of all nodes remained the same in relation to the variables of interest. The only notable change in this subgroup analysis was that the effect size for the active control of a dietary or physical activity intervention became clinically important (MD, -0.55) although the PB remained at 0 percent.

Race/Ethnicity

We conducted subgroup analyses based on race/ethnicity (i.e. ≥75 percent nonwhite [minorities] and <75 percent nonwhite participants) for the outcome of HbA1c at longest followup for behavioral programs compared to usual care and active controls (data not shown). Using the pairwise meta-analysis for HbA1c when comparing behavioral programs to usual care, there was a clinically important effect for minority participants (33 comparisons; 4,774 participants; MD, -0.42; 95% CI -0.56 to -0.27; I2=55%)137,141,143,151,153,162,171,179,188,189,195,197,205-208,210,215-219,222,228,229,231,233,240,246,247,257,262which was greater than that seen for the comparisons with <75 percent minorities (24 comparisons; 5,110 participants; MD, -0.16, 95% CI -0.31 to 0.00; I2=75%).139,142,147,160,175-177,183,184,194,196,214,220,224,234,235,239,249,253,254,258,259 For comparisons between behavioral programs and active control groups, there was no statistically significant reduction in HbA1c among minorities (5 comparisons, 400 participants; MD, -0.32; 95% CI -0.67 to 0.04; I2=0%);164,173,182,198,230 studies with a larger proportion of white participants also showed no difference (10 comparisons, 6,214 participants; MD, -0.50; 95% CI -1.24 to 0.23; I2=99%).107,168,169,175,184,201,202,252 Glycemic control at baseline appeared to be worse for the minority (8.8 percent HbA1c) compared with the majority/white (7.6 percent HbA1c) subgroup.

We also conducted univariate meta-regressions for each race/ethnicity subgroup. For this analysis, we used outcome data for changes in HbA1c at longest followup in comparisons between behavioral programs and usual care. Table 14 shows the results for each variable examined. No statistically significant finding was generated. The subgroup of majority/white participants appeared to benefit more (with a difference near our threshold of change in HbA1c) from lifestyle programs compared with DSME or DSME plus support, but the results did not reach statistical significance.

Table 14. Results for race/ethnicity subgroups using univariate meta-regressions analyzing the association between different program factors and the effectiveness of behavioral programs compared to usual care in improving HbA1c for T2DM.

Table 14

Results for race/ethnicity subgroups using univariate meta-regressions analyzing the association between different program factors and the effectiveness of behavioral programs compared to usual care in improving HbA1c for T2DM.

Views

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...