U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

ECRI Health Technology Assessment Group. Determinants of Disability in Patients With Chronic Renal Failure. Rockville (MD) : Agency for Healthcare Research and Quality (US); 2000 May. (Evidence Reports/Technology Assessments, No. 13.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Determinants of Disability in Patients With Chronic Renal Failure

Determinants of Disability in Patients With Chronic Renal Failure.

Show details

Appendix E: Sample Analysis of DMMS Wave 2 Data

The purpose of this section is to illustrate the methods that could be used for analyzing data such as DMMS Wave 2 if such data were complete, reliable, and generalizable to the population of interest. For the purposes of this illustration, we have completed two analyses -- one that includes only laboratory variables, such as might be most applicable to SSA's disability sequential evaluation process, and a second that includes both laboratory and demographic variables recorded for patients at the beginning of the DMMS Wave 2 study. The outcome measures of interest for this illustration are working status 9-12 months after dialysis initiation and self-reported ability to work 9-12 months after dialysis initiation. These are used as surrogate measures of ability to work.

Death within 1 year is another outcome measure of interest to SSA and was used as the outcome measure in the illustration in the Results section of this report. Because of the potentially large number of predictor variables and the small number of patients who died within 1 year, statistical analysis results would be unreliable; and therefore this outcome measure has been omitted from further study.

General Description

There are several steps that may be taken to identify the best predictors of inability to work (as indirectly measured by employment status and self-reported ability to work). To evaluate SSA's current Listings using the DMMS Wave 2 data, the following steps would be necessary:

  1. Imputation of missing data.
  2. Recoding of data as necessary for regression analysis.
  3. Logistic regression of current Listings, with ability to work as the outcome measure and the items in the Listings as the predictor variables.
  4. Receiver operating characteristic (ROC) analysis of diagnostic performance of the current Listings, using currently defined cut-points for distinction of positive and negative cases.

However, analysis of current Listings cannot be illustrated here because of two primary limitations. First, all patients in the DMMS Wave 2 study were on dialysis, and therefore it is not possible to enter whether a patient was receiving dialysis as a predictor variable. Second, individual physiologic symptoms as contained in the Listings are not measured the same way in the DMMS Wave 2 database. For example, the current Listings require persistent elevated serum creatinine for disability determination, whereas DMMS Wave 2 measured creatinine at a few distinct points of time.

Were we to find that the current Listings did not demonstrate acceptable prediction of inability to work, we would then perform a second series of analyses on all the relevant data in the DMMS Wave 2, using the following steps:

  1. Imputation of missing data for relevant variables.
  2. Evaluation of variable interaction.
  3. Logistic regression analysis of "important" variables and interactions among variables.
  4. Exploratory analysis of significant predictor variables.
  5. ROC analysis of diagnostic efficacy of the resulting regression equation.

One could then determine the best combination of predictors of inability to work. Because employment status and self-reported inability to work are indirect measures of ability to work, such suggestions would rest on the assumption that these measures accurately represent inability to work. However, given that some patients who indicate that they are able to work are in fact not working (29 percent of those self-reportedly able to work full time), this may not be the case.

An illustrative analysis of the DMMS Wave 2 data for prediction of working status and self-reported inability to work 9 to 12 months after dialysis initiation is provided below. Although we were able to complete these analyses with the data available to us, the results should not be considered to be recommendations for changes in the Listings. Rather, at most, they may be used to guide future research.

Imputation of Missing Variables

One limitation of the DMMS Wave 2 was a large amount of missing data that made regression analyses unreliable. We initially performed sample regression analyses on randomly chosen halves of the database with death as the outcome variable. Probably because of erratically missing data for different patients in each set, the resulting regression equations for each half of the database did not contain the same predictor variables. This indicated a lack of reliability in the database, possibly because the regression process requires that all data be present for each patient.

There are, however, methods for making substitutions for missing data, referred to here as "imputation." Modern statistical computer packages (such as SPSS, used here) offer several different methods for missing variable imputation that are differentially appropriate depending on the characteristics of the variables. Listed below are some, but not all, of the options available:

  • Series mean
  • Mean of nearby points (specify # of points)
  • Median of nearby points (specify #)
  • Linear interpolation
  • Linear trend at point

For the DMMS Wave 2 database, a normal distribution of values cannot be assumed due to the heterogeneous characteristics of the patient population. Patients with ESRD may be viewed as several diverse subgroups depending on the origin of the condition (e.g., young patients with glomerulonephritis, elderly patients with diabetes). Therefore, a series mean would not be appropriate; rather, the mean or median (depending on the continuous or categorical nature, respectively, of the variable in question) of nearby points (also called "next nearest neighbor") would be most statistically meaningful. The "nearby points" referred to in these methods are other cases (patients) who exhibit similar scores on other variables to the patient for whom the data point is missing. In most statistical packages, the analyst can specify the number of nearby points to be considered (two and up). For the purpose of our illustration here, we chose to use the mean or median of two nearby points -- the mean for continuous variables (such as most laboratory measures in this database) and median for categorical ones (such as presence of comorbidities, quality-of-life, and demographic measures).

Table E-1 indicates the number of available cases before and after data imputation for the variables entered into the regression analysis. The total number of patients was 546.

It is clear that SPSS did not necessarily impute data to fill in every missing point in the database and that the amount of imputation possible varied depending on the amount of data originally available. There are several variables here for which a majority of missing data has been imputed. This is considered "extrapolation" of data, and may not be reliable. We have therefore discarded data for which more than 30 percent of data are imputed (variables originally having data for about fewer than 400 patients). The loss of these variables from analysis limits the usefulness of the resulting diagnostic test because important variables may be missing.

Even once this imputation of data was completed, regression analysis on random halves of the database showed disparate results, indicating that the small number of patients in the database, in contrast to the large number of variables, continues to be a problem. For this reason, the results presented here should not be considered accurate, only illustrative.

After discarding extrapolated data and eliminating redundancies, there were 64 variables available for regression analysis.

Recoding of Variables

For both the purposes of assessing the current Listings and for the use of regression analysis, some recoding of the existing variables in the DMMS Wave 2 questionnaire was necessary (see Appendix A and Appendix C for existing coding). Note that we have not performed all of the necessary recoding for this illustration. Because PD and HD patients are equally represented in DMMS Wave 2, it would be necessary to either "weight" the HD cases so that they counted four times for every single PD patient, or to examine PD and HD patients separately. We have not done this here for simplicity's sake. We have, however, performed other required recoding, (e.g., we recoded multilevel categorical variables into several binary variables).

Table E- 1. Variables for which data were imputedAppendix E: Sample Analysis of DMMS Wave 2 Data

Table E- 1. Variables for which data were imputedAppendix E: Sample Analysis of DMMS Wave 2 Data.

Table

Table E- 1. Variables for which data were imputedAppendix E: Sample Analysis of DMMS Wave 2 Data.

Investigation of Interacting Variables

There are many different methods for investigating possible interactions among independent variables, including a priori MANOVAs and neural networks. A full description of these methods is beyond the scope of this sample analysis.

This step was not taken for the purposes of this illustration; no interaction effects are entered into the regression equations below.

Logistic Regression Analysis

For the purposes of illustration, we have used the backwards stepwise entry method here. We have included a constant in the model due to the exploratory nature of the analysis. (The option is provided by SPSS to exclude the constant.)

The regression analysis results in an equation of the form:
Y = C + b0x0 + b1x1 + b2x2...bnxn
where Y is the outcome variable (here, full time employment coded as 0 or 1), b0..n are the coefficients for each dependent measure, and x0..n are the values of the variables included in the equation.

Analysis With Laboratory and Physiologic Measures

It was first important to assess prediction of ability to work using the variables of interest to SSA. Specifically, their Listings include physiologic and disease state measures to assess disability eligibility at step 3 of the disability sequential evaluation process. SSA does not evaluate other factors such as social, history, and demographic issues at this step of the process.

We therefore used only the 40 laboratory and physiological measurements listed in the above table for prediction of employment status and self-reported ability to work:

  1. Categorical Variables: 1 = presence of condition; 0 = Absence:
    Treatment modality Image f2966_f001.jpg Ethnicity Image f2966_f001.jpg Angina Image f2966_f001.jpg CABG Image f2966_f001.jpg Cardiac arrest Image f2966_f001.jpg Cerebrovascular disease Image f2966_f001.jpg TIA Image f2966_f001.jpg PVD Image f2966_f001.jpg Absent foot pulse Image f2966_f001.jpg Claudication Image f2966_f001.jpg Congestive heart failure Image f2966_f001.jpg Pericarditis Image f2966_f001.jpg Pulmonary edema Image f2966_f001.jpg Lung disease Image f2966_f001.jpg Neoplasm Image f2966_f001.jpg Diabetes Image f2966_f001.jpg Hypertension Image f2966_f001.jpg Glomerulonephritis Image f2966_f001.jpg Smoking status
  2. Serum Ca Image f2966_f001.jpg Phosphorus Image f2966_f001.jpg Serum icarbonate Image f2966_f001.jpg Hematocrit Image f2966_f001.jpg Hemoglobin Image f2966_f001.jpg Serum creatinine before first dialysis Image f2966_f001.jpg Serum creatinine at first dialysis Image f2966_f001.jpg BUN before first dialysis Image f2966_f001.jpg Predialysis BUN Image f2966_f001.jpg Postdialysis BUN Image f2966_f001.jpg Predialysis weight Image f2966_f001.jpg Postdialysis weight Image f2966_f001.jpg Serum aluminum Image f2966_f001.jpg Cholesterol Image f2966_f001.jpg Triglycerides Image f2966_f001.jpg Serum intact PTH Image f2966_f001.jpg Age Image f2966_f001.jpg Median predialysis SBP Image f2966_f001.jpg Median predialysis DBP Image f2966_f001.jpg Dry weight BMI Image f2966_f001.jpg Median predialysis weight

It must be remembered, when reading the results below, that the outcome measures are surrogates of true ability to work. Therefore, these equations cannot be considered representative of results one might actually use in a disability evaluation process. This is merely an illustrative example.

Prediction of full-time employment status 9 to 12 months after dialysis initiation

Using a backwards stepwise removal of variables method of regression, all variables were initially entered into the equation for prediction of full time employment status, and accounted for 27.3 percent of the variation in the value of the outcome variable (full time employment status coded 0 or 1). The regression method then removed variables accounting for the least amount of variance, until the log likelihood decreased by less than 0.01 percent. The final equation contained six variables, which accounted for only 19.2 percent of the variance in the outcome variable. If this statistic is evaluated in isolation, it would be assumed that the resulting equation does not predict full-time employment status accurately. However, this statistic can be somewhat deceptive, and the results look somewhat more positive when shown as diagnostic test characteristics, discussed later.

In the resulting equation (below), predictor variables are listed in brackets after their respective coefficient value. The categorical predictor values are binary (0 or 1), while several laboratory and physiological variables are continuous:

Y = -6.00 - 0.42[Smoking status] + 0.07[Hemoglobin] + 0.06[Creatinine] + 0.01[Serum intact PTH] - 4.41[Claudication] - 1.76[Pulmonary edema]

A negative coefficient would indicate that the presence of that condition/characteristic would make the individual more likely not to be working at followup. This is true for smoking status, presence of claudication, and presence of pulmonary edema. A positive coefficient would indicate that the presence of that condition would make the individual more likely to be working at followup. This is true for higher hemoglobin, higher serum creatinine, and higher serum intact PTH.

It is necessary that coefficients in the equation not be considered in isolation as a measure of the weight given to a particular variable. Because some variables are coded binarily (0 or 1) while others are continuous on different scales (blood pressure: 70 to 200 v. creatinine: 2-12), the relative weighting of the different variables in this equation cannot be easily compared.

Prediction of self-reported ability to work full time 9 to 12 months after dialysis initiation

For prediction of self-reported ability to work, the same processes were used as described above. The final equation accounted for just 19.4 percent of the variance in the outcome variable, and contained 10 variables:

Y = -2.21 - 0.45[Diabetes] - 0.50[Smoking status] + 0.12[Phosphorus] + 0.06[Hemoglobin] + 0.06[Serum creatinine] + 0.01[Serum intact PTH] - 0.06[Dry BMI] - 0.45[Treatment modality] - 1.11[Claudication] + 0.65[Cardiac arrest]

Diabetes, smoking status, higher dry BMI, hemodialysis, and claudication are all variables tending an individual toward reporting an inability to work full time. Higher serum phosphorus, hemoglobin, serum creatinine, serum intact PTH, and cardiac arrest tend an individual towards reporting an ability to work full time. Again, this equation may have better prediction value for working status than is indicated by the variance statistic.

The poor predictive value of these results may indicate one or both of two scenarios: that the outcome measures do not reflect ability to work, or that physiologic measures alone cannot predict patient's employment status or self-assessment of ability to work. First, the indirect measures of inability to work that we used for this illustration may be more reflective of an individual's current sociological situation. For example, it has been reported that it is easier for a patient to go on disability than to find a working situation that is sympathetic to the needs of an individual on dialysis (Friedman and Rogers, 1988; Ferrans and Powers, 1985).

The second and related issue is that there are many factors that may affect an individual's decision to work, or self-assessment of ability to work, that may be unrelated to true ability or inability to work. A physiologic condition does not determine in isolation whether an individual can perform a given task. Thus, self-reported ability to work is likely influenced by working conditions and insurance coverage (Friedman and Rogers, 1988; Ferrans and Powers, 1985).

To determine whether either or both of the above-mentioned scenarios may be plausible, it was important to include sociological and demographic variables in the equations. The results are discussed below.

Analysis With Laboratory and Demographic Measures

Illustrated below are the regression analyses performed using demographic and laboratory values for prediction of the outcome variables.

Sixty-four variables were entered into the equation at Step 1:

  1. Categorical Variables: 1 = presence of condition/characteristic 0 = Absence:
    Diabetes Image f2966_f001.jpg Hypertension Image f2966_f001.jpg Glomerulonephritis Image f2966_f001.jpg Single Image f2966_f001.jpg Married Image f2966_f001.jpg Widowed Image f2966_f001.jpg Divorced Image f2966_f001.jpg Full time Image f2966_f001.jpg Retired Image f2966_f001.jpg Disabled Image f2966_f001.jpg Clerical Image f2966_f001.jpg Professional Image f2966_f001.jpg Tradesperson Image f2966_f001.jpg Manual labor Image f2966_f001.jpg Student Image f2966_f001.jpg White Image f2966_f001.jpg Black Image f2966_f001.jpg Other race Image f2966_f001.jpg CHD Image f2966_f001.jpg Full time 2 yrs previous Image f2966_f001.jpg Part time 2 yrs previous Image f2966_f001.jpg Working part time at start of dialysis Image f2966_f001.jpg Education less than high school Image f2966_f001.jpg High school graduate Image f2966_f001.jpg College graduate Image f2966_f001.jpg Treatment modality Image f2966_f001.jpg Ethnicity Image f2966_f001.jpg Angina Image f2966_f001.jpg CABG Image f2966_f001.jpg Cardiac arrest Image f2966_f001.jpg Cerebrovascular disease Image f2966_f001.jpg TIA Image f2966_f001.jpg PVD Image f2966_f001.jpg Absent foot pulse Image f2966_f001.jpg Claudication Image f2966_f001.jpg Congestive heart failure Image f2966_f001.jpg Pericarditis Image f2966_f001.jpg Pulmonary edema Image f2966_f001.jpg Lung disease Image f2966_f001.jpg Neoplasm Image f2966_f001.jpg Independent eating Image f2966_f001.jpg Independent transferring Image f2966_f001.jpg Independent ambulating
  2. Continuous variables:
    Serum Ca Image f2966_f001.jpg Phosphorus Image f2966_f001.jpg Serum bicarbonate Image f2966_f001.jpg Hematocrit Image f2966_f001.jpg Hemoglobin Image f2966_f001.jpg Serum creatinine before first dialysis Image f2966_f001.jpg Serum creatinine at first dialysis Image f2966_f001.jpg BUN before first dialysis Image f2966_f001.jpg Predialysis BUN Image f2966_f001.jpg Postdialysis BUN Image f2966_f001.jpg Predialysis weight Image f2966_f001.jpg Postdialysis weight Image f2966_f001.jpg Serum aluminum Image f2966_f001.jpg Cholesterol Image f2966_f001.jpg Triglycerides Image f2966_f001.jpg Serum intact PTH Image f2966_f001.jpg Age Image f2966_f001.jpg Median predialysis SBP Image f2966_f001.jpg Median predialysis DBP Image f2966_f001.jpg Dry weight BMI Image f2966_f001.jpg Median predialysis weight

Prediction of full-time employment status 9 to 12 months after dialysis initiation

For the prediction of working status, backwards stepwise removal of variables resulted in the removal of all but 17 variables and the constant. These 17 variables accounted for 57.9 percent of the variance of the outcome variable. The initial equation, including all 64 variables, accounted for 64.7 percent of the variance; thus, the 54 variables deleted accounted for only 6.8 percent of the variance altogether. Note that the amount of variance accounted for is much higher for this equation than for either of the equations above that contained only physiologic values.

Y = -5.01 - 0.48[Hypertension] - 1.42[Neoplasm]-1.98[Pulmonary edema] + 1.53[Cardiac arrest] -0.78[Ethnicity] -1.02[Less than high school education] -0.59[High school education] + 2.23[Work full time at start] + 1.15[Work part time at start] - 1.62[Work part time 24-6 mos prior to dialysis] + 0.60[Professional job]- 0.27[Serum calcium]+ 0.08[Hemoglobin] + 0.01[BUN before first dialysis] - 0.01[Predialysis weight in lbs] + 0.74[Serum albumin] + 0.01[Serum intact PTH]

Presence of hypertension, pulmonary edema, low education level, neoplasm, working part time 24 to 6 months prior to dialysis, higher predialysis weight, and higher serum calcium tend an individual towards not working full time. On the other hand, if a patient was working full time at the start of dialysis, working part time at the start of dialysis, had a professional job, higher serum albumin, higher serum intact PTH, cardiac arrest, and/or higher hemoglobin levels, he or she would tend toward working full time at 9 to 12 months after the start of dialysis. Presence of demographic variables in this equation may be superfluous because they are significantly influenced by disproportionate occurrence of diseases in these populations.

Prediction of self-reported ability to work full time 9 to 12 months after dialysis initiation

For this analysis, the same 64 variables listed above were entered into the regression equation, but the outcome variable of interest this time was the binary variable "Can work full time at followup" indicating the patient's self-assessment of ability to work. In this case, the resulting equation, which included 10 variables and a constant, accounted for 42.9 percent of the variance, somewhat less than for the prediction of working status at followup. The resulting equation was as follows:

Y = -0.32 - 0.47[Diabetes] + 1.64[Full time at start of study] -1.52[Less than high school education] - 0.80[High school graduate] - 0.44[College graduate] - 0.18[Serum calcium] + 0.08[Serum creatinine before first dialysis] - 0.01[Predialysis weight] + 0.47[Serum albumin] + 1.00[Cardiac arrest]

This equation implies that patients who have diabetes, are college graduates, have higher serum calcium, or have a high school education or less are likely to not consider themselves able to work full time. Those who were working full time at the start of dialysis, who had cardiac arrest, high serum albumin, or serum creatinine tend to be more likely to consider themselves able to work. However, the equation makes clear that none of these characteristics or conditions can be considered in isolation when making a disability determination.

Summary

These illustrative regression analyses have shown that laboratory and physiological measures alone do not take into account all the factors that lead to an individual's decision whether he or she does or can work. Given the current sociological and demographic issues that confront a patient on dialysis, it may be reasonable to suggest that physiologic values alone cannot predict working status. The equations that resulted from the inclusion of both physiologic and sociodemographic variables fared much better and were able to account for about half of the variance in the outcome variables. However, because these results were not able to be replicated on random halves of the database, none of the findings can be considered definitive.

However, these results cannot be considered indicative of what would be appropriate for disability evaluation. Because the outcome measures were surrogates of ability to work that we know are not accurate substitutions, the results may be quite different if a more direct measurement were used as the outcome measure.

If these equations could be considered useful and accurate, additional steps would be necessary to identify the best predictors of ability to work. First, these equations treat continuous variables without considering any predefined "cut point" for diagnosis of positive versus negative. For example, in the current Listings, a cut point of 4 mg/dL is used for serum creatinine, above which the individual is considered disabled, and below which the individual is considered able to work (in combination with other factors). Our above regression analyses treated continuous variables as continuous, rather than redefining them as binary (positive/negative) with a cut point.

Once the regression analysis has identified key predictor variables, one may then want to do some exploratory analyses to see if even better results could be obtained were the continuous variables translated into binary ones using a diagnostic cut point. It may be useful to do an ROC analysis, like the one illustrated below, for individual predictor variables to identify the most appropriate cut point.

Second, reporting the amount of variance accounted for by these equations does not portray an accurate picture of the usefulness of these equations in a diagnostic process, because the statistic does not reflect the type of errors that are occurring when the equation is used. If the equation was resulting in too many people being considered "disabled," this may be an acceptable type of error. However, if too many people were being considered able to work, then the use of this diagnostic equation may lead to denial of disability insurance to individuals who need it. The sample ROC analysis, below, offers methods for assessing the practical accuracy of such combinations of predictor variables.

ROC Analysis

In order to best assess an equation's diagnostic capabilities, it is necessary to examine its diagnostic test characteristics in the form of ratios expressing the percentage of cases correctly classified. As we mentioned in the Descriptive Statistics section of this report in our univariate example of diabetes as a predictor of employment, diagnostic test characteristics are essential because significance tests do not accurately depict the amount of diagnostic error a test produces.

There are four types of test characteristics commonly considered:

  • True positive (TP) (patient has condition, test detects condition)
  • False negative (FN) (patient has condition, test fails to detect it)
  • False positive (FP) (patient is normal, test mistakenly detects condition)
  • True negative (TN) (patient is normal, test finds patient normal)

Appendix E: Sample Analysis of DMMS Wave 2 Data

Appendix E: Sample Analysis of DMMS Wave 2 Data.

Table

Appendix E: Sample Analysis of DMMS Wave 2 Data.

Sensitivity = TP / (TP + FN) [the proportion of patients with the disease who are detected by the test]

Specificity = TN / (TN + FP) [the proportion of patients without the disease who are correctly diagnosed as negative]

Positive predictive value (PPV) = TP / (TP + FP) [the proportion diagnosed positive that are truly positive]

Negative predictive value (NPV) = TN / (TN + FN) [the proportion diagnosed negative that are truly negative]

In the cases of the above equations, the "positive" case is when Y = 0, or the patient is not working full time (or reporting that he or she is not able to work full time). Conversely, when Y = 1, the case is considered "negative," and thus the person is self-reportedly able to work or working full time. Thus, a true positive for this equation is when the equation predicts that a patient will not be working, and indeed, the patient is not working.

It is also important to note that in the examples provided here, a "false positive" may not truly be "false" because no "gold standard" is available to determine whether the equation is right or wrong, versus whether the surrogate outcome measure is right or wrong at portraying ability to work. The test characteristics displayed below really reflect the predictive value for working status or self-reported ability to work, not true ability to work.

In the above equations, rarely does Y equal exactly 1.0 or 0.0. For each patient, the predicted value of Y ranges from 0 to 1. The equation will not work well for some patients, who may score around 0.5. It is then important to determine whether such patients are considered "positive" or "negative" when using this test. This is called the "cut value" or "threshold." The diagnostic test characteristics are also influenced by where the cut value is placed. The default value in SPSS is 0.5. The threshold can be varied from 0 to 1 and graphed in a ROC curve, as will be shown below.

The following table indicates the test characteristics for the equation above predicting working status using only physiologic variables when the cut value is 0.5:

Appendix E: Sample Analysis of DMMS Wave 2 Data

Appendix E: Sample Analysis of DMMS Wave 2 Data.

Table

Appendix E: Sample Analysis of DMMS Wave 2 Data.

Notice in this table that most of the errors result from the false-positive designations-individuals predicted by the equation as not working who are in fact working. If this equation were used in the disability evaluation process, however, this error would likely have no effect; these individuals are working and are therefore not applying for disability. The only truly effectual error is that in which patients who may be disabled are denied disability. This may happen with the false-negative cases, 14 individuals who were predicted to be working who are in fact not working1. This is 3.6 percent of the entire population who, if this were prediction of ability to work, might be receiving disability payments while no longer considered disabled.

These statistics suggest that an equation with just physiologic variables may be much more useful than indicated by the regression analysis described above. However, this assumes that working status is a true measure of ability to work, which is likely not the case.

The following table provides information on the equation that used both physiologic and sociodemographic variables when the cut value is 0.5:

Appendix E: Sample Analysis of DMMS Wave 2 Data

Appendix E: Sample Analysis of DMMS Wave 2 Data.

Table

Appendix E: Sample Analysis of DMMS Wave 2 Data.

This table shows that the inclusion of sociodemographic variables improves the specificity substantially, and thus more individuals truly able to work are classified correctly as such. The improvement comes as a result of a lower false-positive rate; few patients who are working full time would be classified as disabled. The improvement yielded by this equation, therefore, does not lead to any real improvement in disability evaluation, because these 252 "false-positive" individuals would be working and therefore not apply for disability. Only 6.9 percent (26 out of 366) of the population considered here that might be receiving disability would be moved off disability rolls (if the outcome measure were an accurate representation of ability to work). This statistic is higher than that above, when only physiologic variables are used (3.6 percent).

Assuming that the surrogate outcome measure of working status could be used as a gold standard for ability to work (which, as discussed above, it cannot), an ROC curve could then be generated that represented how the test characteristics of these equations change with different cut points. The results could then be used to choose a cut point that would best fit the needs of the diagnostic test. Most diagnostic tests prefer high sensitivity and low specificity versus the opposite, so that no positive cases are missed. Other diagnostic tests work best if a balance is reached between sensitivity and specificity. In this case, disability evaluation may work best if sensitivity is maximized.

Figures E-1 and E-2 show the regression equations for predicting working status in ROC space, where sensitivity is plotted against 1-specificity. E-1 includes only physiologic variables, while E-2 also includes sociodemographic variables. In this space, a curve that falls closer to the upper left quadrant indicates better prediction value (higher sensitivity and specificity). This curve shows that as sensitivity increases specificity decreases, and vice versa. It is therefore possible to choose a cut point/threshold that will maximize either sensitivity at the expense of specificity, or vice versa, or to balance the two out, as is done with a cut point (Y-value) of 0.5, marked on the graph. When sensitivity is maximized at the expense of specificity (as might be done with a cut point of 0.7, shown on the graphs in Figures E-1 and E-2), most patients will be assigned to the category of "Not working full time," and many of these will be erroneous assignments. Because of the small proportion of patients who are working full time at followup, this may not be considered a major error.

Figure E- 1. ROC curve of physiologic variables for prediction of working status at 9- to 12-month followup.

Figure

Figure E- 1. ROC curve of physiologic variables for prediction of working status at 9- to 12-month followup.

Figure E-2. ROC curve of demographic and physiologic variables for prediction of working status at 9- to 12-month followup.

Figure

Figure E-2. ROC curve of demographic and physiologic variables for prediction of working status at 9- to 12-month followup.

On the other hand, if specificity were maximized at the expense of sensitivity (as might be done with a cut point of 0.3, shown on the graph below), more patients would be identified as being able to work; but some of these would be individuals who are not able to work and would be erroneously denied disability.

Summary

These sample analyses have illustrated the statistical methods that may be used to determine predictors of disability when data such as that in DMMS Wave 2 are available. The results presented here cannot be considered indicative of the results that might be found if an unflawed measure of ability to work were available to use as the outcome measure.

We have illustrated the importance of having available data for all cases to be included in the regression analysis and outlined ways of identifying the best variables to enter into the regression equation. The results of the regression equation, however, should be considered a starting point for identifying the best combination of predictors and should not be considered the final solution. In particular, the results provided by regression analysis do not indicate the diagnostic value of the equations and, thus, ROC analysis should accompany the results. Additional exploratory analysis of significant predictor variables would then be prudent.

Footnotes

1

The fact that so few individuals were working also suggests that we have overfit some of the multiple regression equations. Overfitting is particularly likely when an equation consisting of many variables is based on only a few events.

2

As in the preceding example, this is a relatively small number of events; therefore,we may have overfit the data.

Views

  • PubReader
  • Print View
  • Cite this Page

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...