U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Pillay J, Chordiya P, Dhakal S, et al. Behavioral Programs for Diabetes Mellitus. Rockville (MD): Agency for Healthcare Research and Quality (US); 2015 Sep. (Evidence Reports/Technology Assessments, No. 221.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Behavioral Programs for Diabetes Mellitus

Behavioral Programs for Diabetes Mellitus.

Show details

Discussion

Key Findings and Discussion for Type 1 Diabetes Mellitus (Key Questions 1–4)

This section presents the main findings, followed by a discussion of the findings for key questions (KQs) 1-4 evaluating the effectiveness of behavioral programs for type 1 diabetes mellitus (T1DM). The key findings for KQs 1 and 2 include a summary of the strength of evidence (SOE) assessments. Further discussion is included in the subsequent sections of this chapter focusing on (1) the applicability of the findings, (2) contextualizing our results within previous literature, and (3) future research needs.

KQ 1. Behavioral Programs for T1DM and Behavioral, Clinical, and Health Outcomes; Diabetes-Related Health Care Utilization; and Program Acceptability

There was moderate SOE showing reduction in hemoglobin A1c (HbA1c) at 6-month postintervention followup with percent HbA1c reduced by 0.31 for individuals who were enrolled in behavioral programs compared with those receiving usual care. For all other timepoints, there was no significant difference in HbA1c; the SOE was low due to risk of bias and imprecise effect estimates. For followup timepoints of 12 months or longer, the 95% CIs included our threshold for clinical importance such that we cannot rule out benefit for behavioral programs based on the available evidence. For individuals who were enrolled in behavioral programs compared with those receiving an active control, there was moderate SOE showing a clinically important reduction in HbA1c of 0.44 percent at 6-month postintervention followup. There was no difference in HbA1c at other timepoints, however the SOE was low and we cannot rule out a benefit for behavioral programs.

There was low SOE showing no difference in adherence to diabetes self-management (i.e., frequency of blood glucose checks or overall self-management behaviors) at end of intervention and 6-month followup for comparisons with usual care. For comparisons with active controls there was insufficient SOE for adherence to diabetes self-management at all followup timepoints. There was moderate SOE of no difference at the end of intervention for generic HRQL, and insufficient evidence at longer followup. In comparisons with usual care, there was insufficient SOE to assess whether there was any effect on diabetes-specific HRQL at any timepoint, and low SOE of no difference for diabetes distress at end of intervention and 6-month followup. There were no data on HRQL for comparisons of behavioral programs with active controls. No trials reported on micro- and macrovascular complications or on all-cause mortality. The SOE grading was highly influenced by the moderate or high risk of bias (ROB) of individual studies, the imprecise estimates of effect, and (for insufficient SOE grades) the limited amount of data.

Evidence was insufficient to determine whether behavioral programs increased or decreased the number of diabetes-related hospital admissions, emergency department admissions, episodes of severe hypoglycemia, or episodes of severe hyperglycemia. Behavioral programs appear to be acceptable to patients with T1DM based on a proxy measure; our meta-analysis showed a 21 percent increased risk of attrition usual care compared with behavioral programs.

KQ 2. Subgroups for Effectiveness in T1DM

For the KQ, we examined the differential effect of patient characteristics on the effectiveness of behavioral programs for T1DM. In comparisons with usual care, results were consistent with those from KQ 1 when combining all studies of youth and adults. At 6 months, behavioral programs reduced HbA1c in youth by a statistically significant 0.28 percent and in adults by a non-statistically significant 0.38 percent. At end of intervention, the point estimates indicated greater benefit within the adult subgroup (0.28) than the youth subgroup (0.00), although neither of these values reached statistical significance. None of the point estimates exceeded the a priori established clinically important difference of 0.4 percent HbA1c.

For subgroups based on age in comparisons with active controls, the small number of studies (and sample sizes) led to wide pooled 95% CIs which in some cases included values of clinical importance both for and against behavioral programs; the SOE was thus graded as insufficient in all but two cases. In studies of youth with followup to 12 months, there was low SOE of a clinically important (reduction by 0.52) benefit for behavioral programs; in studies of adults with 6-month followup, there was low SOE of no difference in HbA1c.

KQ 3. Potential Moderation of Effectiveness for T1DM: Components, Intensity, Delivery Personnel, Method of Communication, Degree of Tailoring, and Level of Community Engagement

To assess whether program factors (i.e., intensity, delivery personnel, method of communication, degree of tailoring, and level of community engagement) moderated the effectiveness of behavioral programs for T1DM, we performed univariate meta-regressions for comparisons between behavioral programs and usual care at longest followup. Program intensity, including duration, contact hours, and frequency of contacts, appeared not to influence program effectiveness; individual delivery appeared more favorable than group delivery of programs but the results did not reach statistical significance. We did not have enough studies to perform multivariable analysis, neither did we have enough to perform the univariate regressions for outcomes other than HbA1c.

KQ4. Harms for T1DM

No studies reported on the associated harms (i.e., activity-related injury) of behavioral programs.

Discussion of Key Findings for T1DM

Overall, behavioral programs seem to have some benefit in T1DM for reducing HbA1c, when followup extends beyond the immediate postintervention period up to 6 months. The delay in benefit may in part reflect the time required for this marker of glycemic control, indicating control over the past 2-3 months, to demonstrate change. Notable though, is the large diversity in program duration whereby end of intervention was anywhere between 1.5 and 25 months. Another contributor may be that a period of time is needed to integrate newly learned self-management behaviors into one's life; however, our findings of no differences in self-management behaviors at any followup timepoint when behavioral programs were compared to usual care do not support this hypothesis. The beneficial findings for HbA1c at 6 months appear to be tempered by the findings of no difference at longer followup timepoints, although we are unable to confidently rule out benefit at long-term followup. An argument that the findings of benefit could be an artifact of differential attrition between groups—with those more motivated to or more successful in making positive changes returning for followup assessment—appears to be unlikely because of the lower (21%) attrition rate found for behavioral programs compared to usual care.

There are at least a couple reasons why our findings may underestimate the effect of these programs should they be implemented in routine practice. The usual care group in several studies received some form of attention from the investigators (e.g., periodic telephone calls to maintain contact and encourage study participation), and this may have resulted in improved glycemic control for the comparator group and reduced the relative effects of the behavioral program. Participants (or their providers) in the usual care or active control groups (not being blinded to group assignment in most studies) may have become more motivated to practice better self-management (including blood glucose regulation using insulin titrations), which could also attenuate differences between groups. Differences in the “usual care” provided may have also played a role, although this affect may be minimal considering recent evidence that variations in standard care in studies of behavioral interventions for youth with T1DM did not significantly impact study results.295

Our finding of a statistically significant and clinically important reduction by 0.44 percent HbA1c at 6-month followup for comparisons between behavioral programs and active controls is notable. As per our operational definition, behavioral programs consisted of interactive programs having a duration ≥4 weeks with the inclusion of behavior change techniques; because of this, traditional, didactic educational91,92,107,108 or support interventions87 were considered comparators rather than interventions. By offering an intervention to both study arms, these studies may have introduced less potential bias from lack of allocation concealment and blinding. Although quite promising, when drawing conclusions regarding the overall benefits of behavioral programs, this finding needs to be interpreted in light of results showing no differences for HbA1c at other timepoints and insufficient evidence to make conclusions about several other outcomes.

Many of the included studies were directed at adolescents. Self-management of T1DM during adolescence is complex, often characterized by personal challenges and uncertainty, transitions to adult care, less frequent health care visits, and diminished parental involvement; consequently, glycemic control deteriorates over the course of childhood and adolescence for many youth with T1DM.296-299 For these reasons, many of the studies included in this review aimed to prevent deterioration of glycemic control rather than to improve it. The statistically significant reductions in HbA1c at 6-month followup (versus usual care), and the clinically important reductions in HbA1c at 6- and 12-month followup (0.60 and 0.52 percent, respectively) in comparisons with active controls in youth lend substantial support for these programs. Likewise, incorporating more demanding self-management behaviors may negatively impact social and emotional functioning, such that our findings of no difference in generic HRQL at end of intervention may be interpreted as positive.

Most studies for T1DM were undertaken in populations with baseline glycemic control ≥8.5 percent HbA1c. While this may affect the applicability of the findings to some extent, clinicians may view this as highly relevant to their patient population of which many—particularly in their pubertal years—are struggling to achieve optimal control. Furthermore, the Diabetes Control and Complications Trial (DCCT)20 found that these individuals receive the greatest benefit from HbA1c reduction.

For T1DM, there was evidence that effectiveness appears not to be moderated by program intensity (i.e., duration, contact hours, or frequency of contacts), and that delivery to individuals compared with groups may be beneficial. We were unable to undertake any analysis to comment on the difference between educational and lifestyle programs, or the addition of a support component to DSME programs. Many individuals with T1DM under good glycemic control may have other risk factors (e.g., overweight, hyperlipidemia, hypertension) for which lifestyle programs may be warranted. Although some behavioral programs were of fairly long duration with highly intense contact with patients,88 only one explicitly incorporated a support component.82

Our pair-wise meta-analyses used the Hartung-Knapp-Sidik-Jonkman random effects model73-75 that typically provides a more conservative estimate of the 95% CI around pooled effect sizes than the common DerSimonian and Laird approach; the latter approach has been shown to lead to too many statistically significant results especially in the face of heterogeneity and few studies. The effect of our approach is that some results—especially those pooling few studies—are found statistically nonsignificant when another approach may find significance; moreover, the 95% CI in some cases spreads wider than those of the individual studies. For example, our reported 95% CI for the effect on HbA1c for youth receiving a behavioral program compared with an active control at 6-month followup is -2.56 to 1.36 (not significant due to inclusion of 0 [no effect]), although the DerSimonian and Laird approach provided an estimate of -0.95 to -0.25 (significant). This factor also applies those findings for T2DM on the overall effectiveness of behavioral programs across all outcomes.

Key Findings and Discussion for Type 2 Diabetes (KQs 5 and 6)

This section presents the key findings for type 2 diabetes mellitus (T2DM). We begin by summarizing the effectiveness of behavioral programs across our key outcomes, based on comparator (i.e., usual care or active controls) and followup timepoint. Thereafter, we provide a brief summary and discussion of the findings for KQs 5 and 6 evaluating the potential of program components and delivery factors to moderate the effectiveness of behavioral programs for T2DM. Further discussion is included in the subsequent sections of this chapter focusing on (1) the applicability of the findings, (2) contextualizing our results within previous literature, and (3) potential needs for future research.

Effectiveness of Behavioral Programs Across Outcomes

There is evidence showing a beneficial effect of behavioral programs, compared to both usual care and other active interventions, at end of intervention for glycemic control; however, at longer followup results were only statistically significant at 6 months for comparison with active controls, and none of the results were considered to be clinically important based on our threshold of a 0.4 percent change in HbA1c. There was substantial statistical heterogeneity in these pairwise meta-analyses, supporting our subsequent analysis for KQs 5 and 6 to determine which program factors, and population characteristics, influence (and optimize) the effects.

Behavioral programs showed some benefits in terms of reducing BMI (0.21-0.92 kg/m2 to 12-month followup), weight (1.3-1.68 kg; end of intervention) and waist circumference (3.2 cm; short term), and daily energy intake (64-150 kilocalories per day to 6 months)—mainly when compared with usual care. There was little evidence around the outcomes related to changes in physical activity and medication adherence, and findings were consistently of no difference.

Health-related quality of life was reported by fewer studies than anticipated. On average, findings of no difference were found for most studies and outcomes, except for Diabetes Distress where results favored behavioral programs compared with usual care at end of intervention but not at longer followup. Effects on diabetes complications were only reported for one study. Diabetic retinopathy was reduced by 14% and very-high-risk kidney disease by 31% in participants receiving a ≥8 year-long intensive lifestyle program compared with didactic education and support in the largest trial, conducted by the LookAHEAD research group.278,292 Mortality between behavioral programs and active control groups (5 comparisons; 6,050 participants) was 14 percent lower for those receiving behavioral programs (RR, 0.86; 95% CI, 0.77 to 0.96). There was no difference for comparisons with usual care (25 comparisons; 4,659 participants; RR, 1.28; 95% CI, 0.84 to 1.94).

KQ 5. Potential Moderation of Effectiveness for T2DM: Components, Intensity, Delivery Personnel, Method of Communication, Degree of Tailoring, and Level of Community Engagement

In a network meta-analysis with usual care serving as the main reference, programs demonstrating effect sizes for HbA1c at or above our threshold for clinical importance (i.e., 0.4 percent HbA1c difference between groups) represented all three major program component categories of diabetes self-management education (DSME), DSME and support, and lifestyle. The effect sizes of minimally intensive DSME programs (≤10 contact hours) were less than our threshold for clinical importance, but were all higher than that of educational interventions not meeting our criteria for a behavioral program (e.g., didactic education programs represented by many active controls). Programs having larger effect sizes and higher probabilities of being best (≥5 percent) were more often delivered in person rather than including technology. All effective programs using some form of technology were of moderate or high intensity.

Lifestyle programs resulted in the largest effect sizes for BMI. Program intensity appeared less important than method of delivery; providing some individual (rather than solely group-based) delivery appears beneficial for improvements in BMI at longest followup.

KQ 6. Subgroups for Factors Moderating Effectiveness in T2DM

All of our results for this KQ relied on between-study rather than within-study comparisons, such that the effect of randomization is removed and the results are considered observational and possibly biased through confounding by other study-level characteristics.

In terms of overall effectiveness at longest followup for HbA1c, participants with suboptimal or poor glycemic control (≥7 percent HbA1c) appear to benefit more than those with good control (<7 percent) from behavioral programs when compared to usual care and active controls. The effect sizes were not clinically important for either group. Few differences were evident when evaluating potential moderation by program factors after rerunning the network meta-analysis of KQ 5 with a subgroup of studies having participants with suboptimal or poor baseline glycemic control.

Older adults (≥65 years) did not benefit at longest followup in terms of reduction in HbA1c from behavioral programs in comparison with usual care or active controls. In adults <65 years, the effect size for behavioral programs compared with usual care was statistically significant (reduction of 0.31 percent) and compared with active controls at longest followup was clinically important (0.43 percent). In a subgroup analysis of our original network meta-analysis of HbA1c—removing the studies of participants with a mean age ≥65—the most noticeable change was the increase in effect size for active controls incorporating dietary or physical activity interventions, which produced clinically important effects (0.55 percent reduction in HbA1c). The active controls still showed zero probability of success, perhaps due to the heterogeneity between, or small sample sizes of, the associated comparisons.

In comparison to usual care and active controls, behavioral programs offered to predominantly minority participants (≥ 75 percent nonwhite) appear to provide more benefit for glycemic control than those offered to populations with a lower proportion (<75 percent) of nonwhite individuals. The effect size for minority participants reached clinical importance when comparing behavioral programs to usual care (0.43 percent reduction in HbA1c). Based on univariate regression analyses for the subgroups based on race/ethnicity, none of the program factors (e.g., intensity, delivery personnel) reached statistical significance for influencing the effectiveness of behavioral programs compared to usual care on HbA1c. Lifestyle programs appeared favorable over DSME or DSME plus support for the group of studies (n=24) with predominantly white individuals (p=0.07).

Discussion of Key Findings for T2DM

The focus of our review for T2DM was on identifying factors contributing to the effectiveness of multicomponent programs. Our review includes the highest number of studies to date, and focuses on programs meeting current recommendations to change patient behaviors and patient-important outcomes (e.g., HRQL). We relied on strict inclusion criteria to study interactive programs incorporating behavioral strategies aiming to change multiple behaviors, without confounding by changes to medical management (e.g., medication changes, differing frequency of provider visits). Another strength of the review is our analytical approach; the network meta-analysis enabled differentiation of the various comparators, and incorporation of comparisons (e.g., intervention vs. intervention) often not amenable to other strategies. Moderate- and high-intensity (≥11 hours contact time) programs appear to be necessary to provide individuals with clinically important effects on HbA1c; this outcome may also benefit from in-person delivery rather than using technology. For BMI, providing some individual delivery, rather than solely relying on group formats, appears beneficial.

Our review adds to previous findings in that lifestyle programs—not specifically training people in diabetes related self-care behaviors but focusing more on weight reduction and increases in physical activity—may provide similar or more benefit than DSME programs for improving glycemic control for individuals with T2DM. A feature of behavioral programs that may be particularly attractive to patients is that unlike some common drug therapies used in the management of type 2 diabetes, behavioral programs have the potential to reduce HbA1c without contributing to weight gain. Our review confirms previous suggestions that programs with an interactive nature, employing behavioral approaches and covering multiple behaviors, are beneficial when compared with didactic educational interventions. Although perhaps not to a clinically important degree for individuals, the burgeoning growth of this disease means that even small gains in glycemic control from behavioral programs may serve as a substantial benefit for public health.

Our finding that single-topic, non-educational interventions (active controls of dietary or physical activity interventions) offer more benefit than do basic education interventions, supports the need to carefully distinguish and account for different comparators during the systematic review process. We used longest followup timepoint for the analyses to answer KQ 5 and 6, which may capture the “durability” of the programs better than restricting the analysis to the immediate postintervention period.

It appears from our network meta-analysis results for HbA1c, that both individual and group delivery can be beneficial; this agrees with other work in this area300 (also see below section on Findings in Relation to What is Already Known). Our results for KQ6 suggest that other factors (or combination of factors) may influence the effects of this variable; for instance, delivery format may be highly dependent upon the population served and program content. Studies within nodes having high effect sizes which offered programs in groups tended to be those offered to minorities, including Mexican Americans,150,152,208 where support from peers was incorporated as a key program feature.

We were unable to draw any conclusions about the choice of delivery personnel from the network meta-analysis when answering KQ 5; there were too few studies in the categories of DSME and support, and lifestyle to account for this variable when creating the nodes. Drawing from the pair-wise meta-analysis results for those trials comparing two or more interventions (i.e. comparative effectiveness), there may be no difference when program delivery is conducted by heath care professionals or by lay providers (e.g., peers with diabetes, community health workers). Four trials (575 subjects) found no difference (MD, 0.00; 95% CI, -0.23 to 0.23)144,172,227,256 in effectiveness when programs were delivered by peers compared with health care professionals. One trial (72 subjects) found no difference when the support phase of DSME was provided by clinic staff compared with diabetes educators (MD, 0.02; 95% CI, -0.60 to 0.64).227 Most trials reported on extensive training programs for those delivering their programs. One reason why programs delivered by health care professionals were not superior may be that physicians, nurses, and dietitians receive little or no training in behavioral techniques as part of their formal education. This may be particularly true when extensive knowledge and expertise in theoretically guided approaches (e.g. motivational interviewing), or several behavior change techniques are required. Diabetes educators, highly regarded for their thorough knowledge and skills in diabetes education, may require substantial training and supervision when starting to apply advanced behavioral techniques such as motivational interviewing; to date this technique has shown benefit for improved glycemic control in the short term when delivered by clinical psychologists232,250 but not by diabetes educators.248 It could be speculated that the benefits for glycemic control may improve with time after those delivering the programs gain experience.

Our findings for KQ 6 suggest that people with good baseline glycemic control (<7 percent HbA1c), advancing age (≥65 years), and white/European ancestry (studies not having a majority of minority participants) may not benefit to the same extent as participants with suboptimal or poor glycemic control, racial/ethnic minorities, and those of younger age. The finding of better success for patients with poorer glycemic control has been found in previous systematic reviews (for one example see Duke et al.300). Intuitively, individuals with good glycemic control may not achieve as much benefit from behavioral programs—there is little room for improvement and good self-management behaviors may already be practiced regularly. Our findings may have been different if we had chosen a different level of glycemic control for subgroup analysis; after consultation with several experts we were unable to define a “poor control” cut-point. Some caution is warranted when considering our findings for the age subgroups; there were limited studies where the average participant age was ≥65 years, as specified for our subgroup analysis. Moreover, we relied on between-study differences for these subgroup analyses rather than within-study analysis for individual programs. Many trials included a broad range of ages up to 72 years, and the median age of the entire sample in this review was 58; the overall applicability of the results for KQ5 appear to apply to middle- and older-aged adults. Results may have differed for other patient-important outcomes such as quality of life; however, there were insufficient data for these analyses.

The findings for ethnicity need to be interpreted in light of our method of analysis and differences in baseline glycemic control between subgroups. Glycemic control appeared to be worse for the minority (HbA1c=8.80 percent) compared with the majority/white (HbA1c=7.60 percent) subgroup; it is thus hard to distinguish if ethnicity or glycemic control is more likely to have the greater influence in moderating program effectiveness. Ethnic minority groups have been shown to have higher HbA1c levels than Caucasian groups; this finding holds after adjusting for factors affecting glycemic control (i.e., age, sex, BMI, duration of disease, mean plasma glucose) and thus may not be influenced by behavioral programs.301 Conversely, a systematic review by Nam et al.302 which found benefit for culturally tailored diabetes education, found that lower baseline HbA1c levels better predicted positive responses to the programs. There are likely additional factors involved. Many investigators enrolling a large proportion of ethnic minorities in the trials included in this review adapted programs in ways to make them more culturally and linguistically acceptable—often including peers in the delivery or social support groups—which may have enhanced their effectiveness. Our reliance on study-level data to create subgroups (i.e., the entire study was delivered to minorities) may have limited our ability to capture differences in effects from programs delivered to a wider population base, which may reflect routine practice in many community health settings.

Although our discussion has centered on our findings related to our KQs, which focus on effect moderation, the important benefits shown by the LookAHEAD research group252 should be highlighted. Reduction in retinopathy by 14 percent and nephropathy by 31 percent in those participating in a long-duration, intensive lifestyle program cannot be ignored.278,293 Additionally, our findings from pairwise meta-analysis of 14 percent reduced mortality between those receiving behavioral programs and active controls was heavily influenced by the large weight (contributing to >50 percent of the pooled effect) of this study in the analysis.

Findings in Relation to What Is Already Known

For T1DM, this review provides a current examination of the effectiveness of behavioral programs for multiple outcomes and across all age groups. Few systematic reviews have been conducted over the past decade,3,5,6 and most reviews have assessed the effects of a broad range of interventions (some of which were didactic education or single topic interventions) in diverse settings.3,4,6,7 All we identified have focused on children and adolescents, and several included newly diagnosed patients. When calculated, effect sizes for glycemic control and psychosocial outcomes in general demonstrated very modest improvement at longest followup.4,5 [Of note, much previous work reports results using a standardized effect size measure, rather than an unstandardized mean difference in absolute value of percent HbA1c, as used in this review. Our results of 0.31 (vs. usual care) and 0.43 (vs. active control) percent reduction at 6-month followup represent approximately a 0.22 and 0.28 standardized effect size, respectively, which are commonly considered small].303 Our results which incorporate more recent and larger studies confirm the findings of previous reviews.

In their systematic review and meta-analysis in 2006, Murphy et al.6 called for larger, multicenter trails to better investigate the effects of psychoeducational interventions for T1DM. They also stated that no adequately powered RCT had proven effective for patients with poor glycemic control. Our review included reports from two multicentre trials (one by these authors) comparing behavioral programs (clinic-integrated group family sessions focused on family teamwork,102 and DSME with motivational interviewing and solution-focused brief therapy85) to standard care and enrolling patients with poor glycemic control (baseline HbA1c ≥9 percent in both trials).85,102 Neither study found benefit in terms of HbA1c. These authors also noted a need to determine if content or contact was what mattered most; studies (n=2) in their review that compared intervention to attention/active controls showed little effect due to improvements for the comparator group.6 Our finding of a higher effect size for comparisons with active controls than with usual care (at 6 months) suggest that content may have an effect. In a 2000 review, Hampson et al.4 noted that outcomes should be evaluated at an appropriate time to reflect the impact of the intervention. Our results for glycemic control seem to agree with this assertion; HbA1c improved at 6-month followup but not at end of intervention which may have reflected the sensitivity of this outcome marker.

Several systematic reviews have performed some form of analysis to identify factors moderating the effectiveness of self-management and educational programs for T2DM. In 2002, Norris et al.51 reported on a meta-regression examining several factors including intervention characteristics (e.g., program duration, number of contacts, contact time, group vs. individual delivery) on effectiveness of self-management education for HbA1c from 37 comparisons; the authors also evaluated the effectiveness based on baseline glycemic control and age. The only significant factor was the total contact time, with the authors concluding that HbA1c was reduced by 0.04 percent for every additional hour of contact time, over the range 1-28 hours. However, the meta-regression was conducted for comparisons of the educational interventions with a combination of usual care and active controls (“additional care delivered”)—several of which received the same contact time as the intervention group. When considering this factor, there was a nonsignificant positive relationship between the differences in contact time and improved HbA1c. Although our review took a different approach by using a network meta-analysis to incorporate a large suite of comparisons, we found very similar results—most programs showing effect sizes at longest followup (to 12-months) in the clinically important range have contact times in the moderate- or high-intensity categories (≥11h) and the mean contact time was 26.4 hours. We were also able to confirm that active controls (especially didactic educational programs) offer less benefit in reducing HbA1c than do behavioral programs meeting our operational definition.

Another group led by Norris31 undertook regression analysis to investigate similar factors for 22 weight loss interventions for people with T2DM. The authors found no significant interaction with followup interval, duration of intervention, intervention contacts, or baseline weight. Unlike the previous work, the authors separated out comparisons by comparator group and thus had little data (2-6 studies) for each analysis. Both reviews led by this author31,51 included studies evaluating interventions focusing on one behavior (e.g., diet only), and studies where the effects of the intervention could not be clearly distinguished from that of additional disease/care management components.304,305 This may explain in part why our effect sizes for HbA1c at end of intervention are smaller than that (0.76 percent) found by Norris et al.51

Shortly after the work by Norris and colleagues, another group used a similar approach to analyze which variables within an educational intervention best explained the variance in glycemic control. Evaluating HbA1c results assessed immediately after 28 interventions, Ellis et al.54 found a similar effect size as our results (0.32 percent reduction) and that face-to-face (i.e., in-person) delivery, cognitive reframing teaching method, and inclusion of exercise content collectively explained 44 percent of the variance in HbA1c. Their failure to obtain significance for the “dose” of the interventions was suggested by the authors to reflect the lack of variation in the dose of interventions; they suggested that a better marker than number of contacts or duration of intervention may have been total contact hours or a combined variable (such as our use of contacts per month for the univariate meta-regressions). Since all of the interventions examined included a diet component, the benefit from adding an exercise component would seem to suggest these were what we usually classified as lifestyle interventions. Our results for KQ 5 are similar, in that they suggest in-person (face-to-face) delivery may be more efficacious than delivery via technology for patients with T2DM.

We can also compare our findings to those of three more recent reviews. Chodosh et al.46 examined essential components of chronic disease self-management programs (diabetes, hypertension, and osteoarthritis) and found statistically significant differences for diabetes programs (n=26) that provided feedback (e.g., support after self-management program completion); this effect was consistent across the outcomes of HbA1c, blood glucose, and weight. This finding reflects our results—suggesting DSME and support programs have higher efficacy than DSME programs—although the overall effect reported by these authors (0.81 percent) is higher than ours; again this difference in effect size may reflect an overestimate of effects of self-management interventions by inclusion of studies which include changes to medical management.306,307 In a qualitative examination of 11 interventions showing beneficial effects for socially disadvantaged populations, Glazier et al.55 observed several factors contributing to effectiveness, including one-to-one interventions, providing feedback, and high intensities with >10 contact times delivered over a longer period of time (≥6 months). These are consistent with our findings. The findings for feedback, or “booster sessions”, and providing >10 contact hours were also found by Fan and Sidani48 in another qualitative comparison of effect sizes of 50 RCTs. These authors also observed that larger effect sizes were found for one-on-one or mixed formats versus group formats; our results with respect to delivery method were inconclusive.

Our findings for KQ 5 are similar to those of previous work, although we have provided some new insight from use of a larger sample of studies, exclusion of programs not meeting current recommendations or introducing possible confounding by medical care variation, and an innovative analytical approach to assess multiple variables and account for a suite of comparisons not always applicable to other techniques.

Applicability

Type 1 Diabetes

The inclusion criteria for most studies did not specify a minimum HbA1c level; however, for all studies the mean HbA1c was over 7 percent. For most (70 percent), the mean HbA1c was over 8.5 percent. The results of this report may only be applicable to individuals with suboptimal and poor glycemic control.

For studies targeting youth, the mean age across most studies ranged from 12 to 15 years. Therefore, the results should be generally applicable to older children and adolescents. One trial targeted younger children (8 to 12 years);100 it is unclear whether the results of this report are applicable to younger children.

For studies targeting adults, the mean age across studies ranged from 30 to 49 years. No studies specifically targeted older adults (≥65 years), therefore it is unclear if the results are applicable to older adults.

Approximately 50 percent of studies specified that participants have a minimum duration of T1DM of ≥1 year. For studies that targeted youth, the mean duration of diabetes ranged from 2.7 to 7.3 years. The results of this report may only be applicable to children and adolescents who have been diagnosed with T1DM for at least 2 years. For studies that targeted adults, the mean duration of diabetes ranged from 7.5-23 years. It is unclear whether the results of this report are applicable to adults whose T1DM has been recently diagnosed.

We did not find evidence to confirm or refute whether behavioral programs are more or less efficacious for other subgroups, including sex or racial or ethnic minorities.

All of the studies targeting adults were conducted in the United Kingdom, Europe, or New Zealand. It is unclear whether the results from these studies are applicable to community health settings in the United States. For youth, most studies (70 percent) were conducted in the United States; the remaining studies were conducted in Europe and Australia. Despite potential differences in settings and health systems, results were similar across the studies.

The studies were conducted primarily in outpatient diabetes clinics affiliated with a secondary or tertiary care hospital. Our findings are generally applicable to these settings in the United States.

Type 2 Diabetes

The range of baseline HbA1c in the included RCTs was 6.3-12.3 percent (median=8.0) which would appear to make the results of this review applicable to the majority of people enrolling in behavioral programs. We conducted subgroup analyses for KQ 6 based on baseline glycemic control (<7 vs. ≥7 percent HbA1c) at the study level, which provided some insight into the relative effectiveness based on this level of glycemic control. This analysis may be limited by the small number of studies in the <7 percent subgroup (n=9 RCTs) and because the analysis was based on between-study rather than within-study variability in glycemic control which may not accurately reflect differences for individual programs. The results of this report are therefore most applicable to people having HbA1c levels ≥7 percent.

The range of mean ages in the included studies was 45-72 years (median=58), therefore the results of the pairwise meta-analyses on overall effectiveness and of the analysis for KQ 5 are most applicable to middle- and older-aged adults. Our subgroup analysis for KQ 6 based on age (<65 vs. ≥65 years) provided some data on the relative effectiveness for these age groups, but similar to that for baseline HbA1c, may be limited by the small sample of studies on older adults (n=9) and our analytical approach. Our exclusion criteria related to duration of diabetes (mean <1 year)—implemented in order to capture programs providing training in ongoing self-management and lifestyle behaviors—limits the relevance of this review for newly diagnosed patients. The mean duration of diabetes ranged from 1-18 years with a median of 8.1 years. No study performed subgroup analysis based on duration of diagnosis (≤1 vs. >1 year) and we were unable to perform this at the study level because the mean in all cases was above 1 year. The results appear to be applicable to both men and women, and for people on a variety of diabetes treatment regimens (19.2 percent were on insulin). Overall, there was fairly good representation of individuals reporting a minority racial/ethnic background. Subgroup analysis based on those studies reporting of race/ethnicity (24 comparisons for <75 percent minorities vs. 33 comparisons for ≥75 percent minorities) was conducted to increase the relevancy of the findings to these population groups.

The results seem applicable to community health settings in the United States. The majority (63 percent) of trials were conducted in the United States, and based on our inclusion criteria related to Human Development Index62 all studies were performed in countries of similar development status. Some trials were conducted in academic settings in health fields—thought to have application in community health settings—although there may be some differences if these programs were delivered in different settings. Although details were reported inconsistently, health systems differences (i.e., usual care) may vary widely between study populations and could potentially influence the results obtained from behavioral programs. The effect from this difference should be minimal for this review, since we limited our results to changes from baseline between groups randomly assigned and judged to receive similar medical care.

Limitations of the Comparative Effectiveness Review Process

This review followed rigorous methodological standards, which were detailed a priori. Nevertheless, several limitations are inherent within systematic reviews in general.

First, there is a possibility of selective reporting bias (e.g., researchers only reporting positive outcomes) and publication bias, whereby unexpectedly strong results from large trials are selectively reported. In terms of selective outcome reporting, we were able to locate several trial registries and protocols to compare planned and published outcome reporting; most studies included in this review were judged as having low bias in this respect. We may have missed some reports of behavioral programs in diabetes, particularly those showing weak results. We believe publication bias is minimal: (1) our literature search was comprehensive, systematic, and included published and unpublished literature (e.g., some reports were located by contacting authors of studies published in abstract form256 or without data on our outcomes of interest);90 (2) there was large variation in effect sizes reported; and (3) we did not have a minimum sample size for inclusion, and several of the included studies were small. Visualization of funnel plots did not suggest publication bias, and using the Egger test78 for our outcome with the most data (HbA1c) resulted in no significant indication of bias for comparisons with usual care (p=0.25) or active controls (p=0.21) at end of intervention. Selected studies were confined to the English language because we felt that these reports would be most applicable to the end-users of this review who create recommendations or implement programs for people with diabetes within the United States. Moreover, effect sizes in language restricted reviews have shown to not differ significantly (overestimating effect sizes by 2 percent) from those not having restrictions.308 Study selection bias was limited by having two independent reviewers perform screening and selection; we feel confident that study exclusion was based on explicit and appropriate reasoning which was clearly understood by reviewers.

Our decisions on study design were based largely on the availability of studies employing designs having lowest potential for bias. For T1DM, we expected to have a limited amount of evidence from RCTs, so we included other controlled studies. For T2DM, we only included RCTs which may have left out some studies evaluating outcomes and issues of relevance to this review. The body of evidence from RCTs was known in advance to be large, and provided 132 primary reports of trials undertaken in many health settings with diverse populations. In addition, adding non-RCT evidence would have substantially increased the potential bias in results. Behavioral interventions are already moderately complex—in terms of variability in social and environmental contextual factors—and trials of such interventions rarely include blinded allocation or outcomes assessment; because of these factors we thought it desirable to avoid additional limitations arising from selection bias and confounding, for which non-RCTs and observational studies are more prone.

The interventions evaluated in the included trials were highly diverse in their content, delivery, and setting. Our inclusion criteria attempted to reduce some of the diversity by including studies of interventions meeting a fairly rigid operational definition of a behavioral program. We also excluded studies where the effects of the behavioral program could not be isolated (e.g., due to confounding by differences between groups in medical care management), where the patient population would not have already received previous basic education (e.g., enrollment of only newly diagnosed patients), and when the setting was not applicable to community health settings in the United States. Furthermore, we categorized the comparators into three groups to avoid further complexity in comparisons. Our categorization of the comparators and interventions was based on the factors of interest in this review, was informed by previous literature and input from our Key Informants and Technical Expert Panel, and was based in several cases on multiple reviewer deliberation and consensus. Nevertheless, we likely did not capture all factors of importance to some stakeholders. The diversity in programs and other contextual factors was apparent when considering the high heterogeneity in results from the pairwise meta-analysis for HbA1c and some other outcomes in T2DM. Our analyses in KQs 3, 5 and 6 related to factors influencing the effectiveness of behavioral programs for both T1DM and T2DM.

Our analyses for T2DM should be interpreted based on our approaches to address program durability and the relatively high-level categorization of program components. Our network meta-analyses and subgroup analyses used outcome data at longest postintervention followup, which for the majority of studies was end of intervention (i.e., after all contact between participants and program personnel ceased) or, for fewer, between 1-6 months followup. Only 8 of 112 trials had followup longer than 6 months. This approach was used to include as many studies as possible (i.e., those that did report data for end of intervention) and also to reflect the durability of the programs in terms of their potential for impacting long-term health. Our results from the pairwise meta-analyses for HbA1c in T2DM at each followup timepoint indicated reduced effectiveness at followup durations longer than end of intervention; this suggests that the mean effect sizes from our network meta-analysis at longest followup may underestimate the effects at end of intervention.

One of the reasons to differentiate between DSME and DSME plus support was to account for the variation in intensity between these categories, due to the support or maintenance phases (having lower contact frequency) in DSME plus support programs. Our definition of end of intervention was standardized for all programs, rather than taking into account any distinct phases within programs. There was large variation between programs in terms of the distribution of contacts, including the reporting of such, and attempting to capture effects based on relative intensity within programs or specific to the maintenance phase would have been difficult and unreliable. Because of this, one might have anticipated that the effects from DSME programs (without a maintenance phase) would have been higher than other programs having the same overall contact time. This does not appear to be the case, and our results would suggest that adding a support phase (often offering psychosocial support and/or behavior change strategies targeting behavior maintenance) was an important program feature of many lifestyle and DSME plus support programs regardless of the distribution of visits.

As stated in the Results chapter, we did not include program tailoring and degree of community engagement in the analysis for KQ5; these factors were considered to overlap in meaning to some extent with delivery method (e.g., use of technology enhancing tailoring) and delivery personnel (e.g., use of non-health providers providing community engagement), and the ones we used were thought to better represent the differences between the programs assessed in this review. With our focus on programs incorporating interaction with program personnel, we cannot comment on the effects of programs delivered entirely by way of technology which may provide sophisticated mechanisms to interact with and motivate participants or closely monitor disease management. Cost analysis of implementing differing behavioral programs was not addressed in this review.

Limitations of the Evidence Base

The evidence base was inadequate to fully answer the Key Questions, particularly with respect to the limited number of outcomes evaluated in several studies. We were unable to fully evaluate all outcomes of interest for several KQs. For KQ 1 for T1DM, there were limited data available to assess the SOE for many outcomes, including behavioral outcomes related to changes in dietary intake or physical activity, and clinical and health outcomes apart from HbA1c and HRQL. Our assessment of factors contributing to effectiveness of behavioral programs for T1DM (KQ 3) was limited to the outcome of HbA1c and to univariate meta-regressions (rather than network meta-analysis to simultaneously examine multiple comparisons and factors) because too few studies provided data on other outcomes. No studies contributed data for our assessment of harms (KQ 4). For KQs 5 and 6 related to T2DM, our network meta-analysis allowed for multiple comparisons but there were still too few studies reporting on outcomes besides HbA1c and BMI to enable meaningful groupings into nodes to examine multiple factors simultaneously. Considering that behavioral changes are the key mediators to achieving clinical and health outcomes, analysis based on valid outcomes of changes to physical activity or diet would be ideal; greater use of these outcomes, especially via objective means, would be beneficial. The meta-regressions used for the subgroup analysis on ethnicity in KQ 6 are limited by comparator (only usual care) and did not allow us to capture multiple variables in a single analysis. In addition, our subgroup analyses for KQ 2 and 6 were mostly limited to indirect methods (i.e., relying on between-study rather than within-study comparisons). Several outcomes of importance to patients and policymakers, such as quality of life, development of complications, and health care utilization, were reported by few studies to confidently support conclusions of effect, or to analyze in terms of moderation by program factors.

Many trials had methodological limitations introducing some ROB. Blinding of participants and personnel are arguably difficult for trials of behavioral programs especially when the comparator is usual care. According to our decision rules for assessing ROB, a low ROB for participant and personnel blinding was granted if the comparator was an attention or active control and the authors stated some means to blind the study hypothesis from participants, and if there was a structured training and protocol followed for the personnel. Participant blinding in this manner was rarely reported. Lack of blinding of participants, and their healthcare providers, may result in underestimation of the effects of behavioral programs compared to comparators, due to cointervention; adjustments of insulin or oral antidiabetic medications may have been performed to a greater extent in the comparison groups than in the intervention groups. This effect may have been heightened because none of the studies we reviewed included any limitations or restrictions on adjustment of insulin or other medications. Blinding of outcome assessors was also rarely reported, despite the high feasibility of ensuring this procedure. These two domains resulted in medium or high ROB being assigned for most trials for their subjective outcomes. For both subjective and objective outcomes, medium or high ROB was assigned in many cases from lack of intention-to-treat analysis (e.g., only reporting on results for completers) and/or from high participant attrition. Despite our inclusion of only RCTs, some studies had small sample sizes and a few failed to achieve adequate baseline comparability in demographic or clinical characteristics.

Research Gaps

Table 15 highlights some potential research needs based on our KQs.

Table 15. Potential research needs, by Key Question.

Table 15

Potential research needs, by Key Question.

Conclusions

This systematic review found that behavioral programs (especially DSME) for T1DM have some benefit on glycemic control when followup extends beyond the immediate postintervention period up to 6 months after the program. There was no significant difference at end of intervention or followup longer than 6 months, although our confidence in these findings is low and we cannot rule out benefit. There was no difference in generic HRQL at end of intervention, or in diabetes distress or self-management behaviors at up to 6-month followup, although the SOE was low for these findings with the exception of generic HRQL at end of intervention (moderate SOE). Data were insufficient to draw any conclusions for other timepoints for generic HRQL, diabetes distress, and self-management, and for other outcomes including diabetes-specific HRQL, change in body composition or lifestyle behaviors, micro- and macrovascular complications, and mortality. Encouraging patients with T1DM to participate in behavioral programs to improve outcomes apart from HbA1c is not supported by the current evidence.

For T2DM, our analyses showed limited benefit in glycemic control from DSME programs offering ≤10 hours of contact with delivery personnel, and suggested that in-person delivery of behavioral programs is more beneficial than incorporation of technology. We found that programs focused on lifestyle or on DSME can have similar benefit in terms of glycemic control, and that lifestyle programs appear better for reducing BMI. Whether the behavioral program is delivered by a health care professional or a trained lay person, or via individual or group format appears less important based on the available evidence. Behavioral programs seem to benefit individuals having suboptimal or poor glycemic control more than those with optimal control. Tailoring programs to ethnic minorities—such as offering culturally appropriate materials and incorporating group interaction with peers—appears beneficial. While efforts should be made to provide culturally sensitive programs, community health settings that serve populations that are diverse in language and ethnicity may not have the opportunity to provide this flexible programming to meet each group's needs.

The finding that behavioral programs offer some benefit in terms of glycemic control in individuals with diabetes underscores the need for care providers to be educated in behavioral techniques, and related topics such as facilitating support groups and family communication training—something that is often missing within the formal training of physicians, nurses, dietitians, and pharmacists. This review was unable to assess the differential effects on program success by single versus multiple health care providers, or by delivery teams having differing compositions of providers (including trained lay professionals)—this topic deserves further evaluation. Few trials evaluated patient-important outcomes (e.g., quality of life) in a manner to pool results across studies. Use of widely accepted quality of life measures would be beneficial.

Efforts at integrating behavioral programs into care settings that incorporate the latest management guidelines should be prioritized. Program evaluation is an important component to build into the implementation of any behavioral program for diabetes, to ensure that it is the correct fit to be effective for the population that it is attempting to serve. At this time, there remains a need for clinicians to evaluate each patient's success after participating in these programs, should additional means be necessary to control their disease more adequately to prevent devastating complications.

Views

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...