U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Lehmann HP, Andrews JS, Robinson KA, et al. Management of Acne. Rockville (MD): Agency for Healthcare Research and Quality (US); 2001 Sep. (Evidence Reports/Technology Assessments, No. 17.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Management of Acne

Management of Acne.

Show details

3Results: Summary of Literature

Our results fall into two categories: This chapter addresses assessment of the literature and the evidence; the next chapter provides an exhaustive review of the evidence comparing specific pairs of choices of acne therapy. Chapter 5 reports our answers to the specific questions.

Literature Search and Abstract Review

Results from the searches and the abstract review process were maintained in databases developed in ProCite® (Acne Citations Database and Acne Deletions Database). A summary of the search results is provided in Table 1. The bulk of the searching was completed in January and February of 1999, with final searches of PubMed and CENTRAL completed in April 1999. A total of 4,749 citations were identified. Of these, 1,558 were uniquely identified; that is, not previously included in the Acne Citations Database. From the results of the initial PubMed and CENTRAL searches, the search director removed citations dealing only with excluded topics (e.g., rosacea) and, because resources did not allow for translation of articles, citations that were published in languages other than English. A listing of these removed citations was reviewed by one of the principal investigators. Subsequent search results were limited to English language publications and were reviewed by the principal investigators to identify citations for inclusion in the abstract review. Through these processes 829 citations were identified to be included in the abstract review process.

Table 1. Summary of search results.

Table

Table 1. Summary of search results.

We were unable to review 7 citations as they did not have abstracts and we were unable to retrieve the full articles. Of the 822 citations we could review, 460 (56 percent) were classified as eligible for full article review while 362 (44 percent) were determined to be ineligible for full article review. The majority of abstracts were excluded as not addressing management of acne (291 [80 percent]); the next most common exclusion was no original data (75 [21 percent]). Much smaller numbers of abstracts were excluded as not in English language (6), human data not included (4), only addressing excluded topics (5), and as duplicate publications not caught during searching and retrieving process (4).

Article Review

From the abstract review process, 460 citations were identified for inclusion in the article review phase. Because of this large number, and with consensus from the technical experts, it was decided to focus on the controlled trials. Eligible trials were those where the individuals followed in the trial were definitely or possibly assigned prospectively to one of two or more treatment arms using either random allocation or a quasi-random method of allocation (e.g., date of birth). Of the 460 citations, 331 were identified as reports of controlled trials by the Cochrane Collaboration (citations included in CENTRAL) or by one of the abstract reviewers (for citations from MEDLINE® and other sources).

We were unable to retrieve, and, therefore, unable to complete article review of 12 articles. Of the remaining 319 articles reviewed, 23 (7.2 percent) contained descriptions of more than one trial. Each trial was quality assessed and abstracted separately. There were 351 trials for which a review was completed.

A total of 77 trials (21.9 percent of trials) were excluded during the article review process. The reasons for exclusion are presented in Table 2. The final review thus included 274 trials from 250 articles. (See Appendix H for included and excluded trials.)

Table 2. Reasons for exclusion of trials.

Table

Table 2. Reasons for exclusion of trials.

As the article review process progressed, we realized that many studies had multiple time points of outcome assessment (e.g., 2, 4, 6, 8, 10, 12, 14, and 16 weeks in a single trial). It was not feasible to continue collection, or data entry, in this manner nor did it seem likely to add to the value of the evidence synthesis. We conferred with our technical experts on an ad hoc basis and included a proposal for dealing with the time points in a formal request for feedback. Based on this, abstraction of outcome data was collected for the time points corresponding to the following time periods:

  • Short term -- Longest time interval up to and including 6 weeks (< 6 weeks);
  • Mid term -- Longest time interval up to and including 12 weeks (>6 to 12 weeks);
  • Long term -- Longest time point reported by study, if greater than 12 weeks; and
  • Post-therapy -- Any time period reported after stopping of treatment.

Description of the Acne Literature

General Trial Information

Of the 319 articles reviewed, 296 presented single trials while 23 articles reported multiple trials: 18 articles presented two trials; three articles presented three trials; one article presented four trials; and one article presented six trials.

Figure 2 shows the count of the 274 trials included by publication year. As illustrated in the graph, the number of controlled trials published has generally increased since the mid-1970s.

Figure 2. Number of trials published per year.

Figure

Figure 2. Number of trials published per year.

Study validity is often divided into two parts: external validity and internal validity. Internal validity is the extent to which the methods of the study lead one to trust the results of the study. External validity is the extent to which one may generalize the results beyond the study.

Our method of summarizing methodological strengths and weaknesses is presented in Volume 2 (see Description of the Evidence Tables). Trials that had only strengths numbered 45 (16 percent). Trials that had only weaknesses numbered 106 (39 percent). Trials that had a mix of strengths and weaknesses numbered 107 (37 percent). The remaining 22 (8 percent) were of intermediate quality or did not provide enough information to make a determination.

Funding source is important for assessing bias and in generalizing trial results. Explicit mention of funding source was made in 145 of the trials (53 percent). Of these, 8 were federally funded, 10 reported some other source of funding, and 127 out of 274 trials (46 percent) were drug-company sponsored. We were able to determine additional types of support by drug companies in 113 of the trials, including those where funding source was not clear. Twelve (4 percent of the trials) had a drug-company employee as first author, 38 (14 percent of the trials) had a drug-company employee as co-author, 38 (14 percent of trials) supported the trial by providing medication and one trial (0.4 percent of trials) noted analytic support.

Generalizability also depends on the extent to which results are replicated in multiple locations. The trials spanned the world: Of the 274 included trials for which ata are available, continental Europe accounted for 74 (27 percent); United Kingdom, 56 (20 percent); North America,124 (45 percent -- includes United States, 118 [43 percent], Canada, 5 [2 percent], and Mexico, 1 [0.4 percent]); Asia, 10 (4 percent); Middle East, 2 (1 percent); Oceania, 5 (2 percent); Africa, 1 (0.4 percent); and Iceland, 1 (0.4 percent) for a total of 31 different countries (and one unknown). Although the whole world is represented, patients from American/British Commonwealth countries comprise two-thirds of the trials.

Generalizability is influenced by replication in multiple locations within a trial. Most trials were performed at one study site. Of the 251 trials for which data are available, 177 (71 percent) were single-site, 65 (26 percent) were multi-center, single-country, and 9 (4 percent) were multi-country.

Finally, generalizability is greatly affected by how subjects (patients) are enrolled into a trial. The further away from being a randomly chosen patient and the more selected a subject is through the clinical-care process, the less likely the study patient is to represent other patients. Of 274 trials, fully 203 (84 percent) gave no information regarding patient selection. Of those trials providing information (71), 45 (63 percent) stated that patients were "recruited" (although rarely with further details), and 19 (42 percent) stated that patients were selected as consecutive visitors to the clinic. Seven trials (3 percent) mentioned other methods. Whether subjects were recruited through a dermatology practice, a general practice, or from the general population is also an important consideration. It was rarely possible to determine this with any specificity in the trials reviewed.

Within trials, it is important that the arms be similar in arm comparability, judged on the basis of baseline patient characteristics. For 103 (38 percent) of the trials arm comparability could not be determined. In 90 trials (33 percent) the arms were definitely comparable, and 76 (28 percent) were judged as maybe comparable. Only 5 (2 percent) provided enough evidence to conclude that the arms were not comparable. Further details concerning baseline patient characteristics are provided in the next section (Patient Characteristics).

Trial design is important for internal validity as well. Most of the trials used standard, two-arm, parallel trial designs (259 [94 percent]). Only 15 (5 percent) of the trials used a cross-over design, where patients serve as their own controls over time, 30 (11 percent) were parallel trials with split-face design, where patients serve as their own controls spatially; and 22 (8 percent) were parallel trials with matched controls. While both cross-over and split-face design have the potential to control for baseline characteristics in the most patient-specific manner, it is crucial that, for cross-over studies, there be a wash-out period, and that, for split-face studies, the investigator should anticipate contamination of the opposite side of the face. 25 For the most part, we were unable to determine this type of detail from the trials.

Treatment assignment is important for internal validity, with randomized assignment being the current gold standard. Relatively few trials (56, or 20 percent) gave enough information to conclude that the patients were randomly assigned, and 175 (64 percent) were stated to be randomized, with no further details. The rest either used deterministic methods of assignment -- i.e., the treatment was assigned using an attribute known by the investigator, such as date of birth or hospital identification number (9, or 3 percent) -- or gave no information at all (34, or 12 percent).

Treatment blinding is also important for maintaining high internal validity. Only 27 trials (10 percent) gave enough details to convince the abstracters that patient blinding was properly performed; 180 (66 percent) made a statement that blinding was performed, but with no further details; and 45 (16 percent) were explicit that patients were not blinded as to their treatment. Similarly, 41 trials (15 percent) convincingly demonstrated that the enrolling or treating physician was adequately blinded, while 177 (64 percent) only said that they were blinded, and 22 (8 percent) were clearly not blinded.

Compliance of patients with therapy is crucial in determining efficacy or effectiveness, and especially so with topical treatment. Seventy-four trials (27 percent) provided evidence that patients' compliance was addressed prospectively from trial onset. A further 17 (6 percent) discussed compliance retrospectively, 17 (6 percent) made statements suggestive of their concern about compliance, and 8 (3 percent) clearly ignored compliance. Fully 148 trials (54 percent) gave no indication one way or the other of this concern, and for 9 (3 percent), compliance monitoring was not applicable.

Trials varied in mode of treatment administration: 101 trials (37 percent) included systemic therapy, 183 trials (67 percent) included topical facial therapy, and 25 (9 percent) addressed other areas, like the back and chest. (Percent sums to greater than 100 percent because trials could address multiple areas.)

Trials excluded a variety of ancillary therapies used by all trial participants. The excluded ancillary treatments included other medications (110, or 40 percent), soaps (44, or 16 percent), and diet (9, or 3 percent); 129 trials (47 percent) mentioned ancillary therapy, but gave no details. A variety of other therapies were mentioned as being explicitly excluded: radiation therapy, sun bathing, unspecified powder, scalp shampoo, among others. We judged 124 trials (45 percent) as providing evidence that subjects received similar ancillary treatment. Seven trials (3 percent) clearly provided asymmetric ancillary therapy -- that is, the groups did not receive the same reatment, besides the treatment that was the focus of the trial. Fully 119 (43 percent) gave no information at all.

In terms of study execution, 45 trials (16 percent) had major losses to followup, while 107 (39 percent) had minor losses. Ninety-one trials (33 percent) documented virtually no such losses, while 31 (11 percent) did not provide enough data for an assessment. Beyond losses to followup, 7 trials (3 percent) reported protocol departures; 44 (16 percent) had minor such departures but 122 (45 percent) provided too little information to make any assessments. We judged 203 trials (74 percent) to have followed their design in execution and only 4 (1 percent) to definitely not have followed the plan, but this could not be determined for 14 trials (5 percent).

As a summary statement regarding external validity, we judged only 4 trials (1 percent) as being generalizable to the general population; 11 trials (4 percent), to more specific populations; and 9 (3 percent), to local populations. The great majority (152, or 55 percent) do not give enough data, while the remainder can be generalized only to the referral population (22, or 8 percent) or to any group beyond the study participants (76, or 28 percent).

Patient Characteristics

For those patient characteristics determined from exclusion criteria, no study mentioned sexual maturity rating (Tanner stage) and 66 (24 percent) did not explicitly mention any exclusion criteria based on patient characteristics. Table 3 shows the breakdown of other exclusion criteria. Acne severity was the basis of exclusion most of the time: 79 trials excluded patients with "severe" acne, while 82 trials excluded patients with "mild" acne. Other morbidity, pregnancy, and young age were the other most cited exclusion criteria.

Table 3. Exclusion criteria cited by trials.

Table

Table 3. Exclusion criteria cited by trials.

Based on the patient characteristics used as exclusion criteria, all studies included adolescents and young adults (see Table 3). Specific ages for participants in trials were also collected. Pooling the studies for which mean and standard deviations are available yields a pooled mean of 21 years and a pooled standard deviation of 5 years. This mean is consistent with the age groups implied by the exclusion criteria. The average range was 15 to 32 years, the overall minimum stated in age ranges was 8 years and the maximum was 72 years.

Table 4 supplies the number of trials providing information for each patient characteristic: age, sex, race, skin type, diet, duration of disease, insurance status, prescription plan, source of care, practice setting and phase of care. One trial reported on patients' diets, none reported on insurance status or the availability of a prescription plan. Few studies reported on skin type or disease duration. Most studies reported on the sex of patients, but 70 (26 percent) omitted this essential baseline characteristic.

Table 4. Patient characteristics provided by trials.

Table

Table 4. Patient characteristics provided by trials.

Eight trials stratified their results. Trials 35, 293, 111, and 277 stratified by gender. Trials 350, 16, 222, stratified by acne severity, and Trial 55 stratified by location.

Acne Severity

As stated above, regarding exclusion criteria, acne severity was the most important patient characteristic, and it is the characteristic our experts raised most often in deciding on individual treatment profiles. In the process of reviewing the literature, 25 different methods of assessment for acne severity were identified. These methods were varied and included lesion counting on all or part of the face, comparison of the patients to a photographic standard, and comparison of the patients to a text description. Many trials did not report their method of assessment. The way in which severity was reported also differed between studies. Many used terminology, "mild, moderate or severe." Others used numerical scores such as 1-4, 0-10, etc. Neither terminology nor score was necessarily consistent across studies; so what was termed "grade 2 acne" in one trial might have been classified as "grade 6" according to a grading system in a different trial.

In our synthesis, an effort was made to standardize the reporting of acne severity so that more accurate comparisons could be made between trials. Terminology used here is the standard "mild, moderate, severe." Each trial was reviewed for the acne severity of the patients included in that study. The reported severity was converted to a "Combined Acne Severity Classification" as follows:

  1. Trials Reporting Lesion Count
    The Combined Acne Severity Classification was derived from the mean lesion counts and standard deviations reported in the studies. The definitions are:
    • Mild acne -- Fewer than 20 comedones, or fewer than 15 inflammatory lesions, or total lesion count fewer than 30.
    • Moderate acne -- 20-100 comedones, or 15-50 inflammatory lesions, or total lesion count 30-125.
    • Severe acne -- More than 5 cysts, or total comedo count greater than 100, or total inflammatory count greater than 50, or total lesion count greater than 125.
    Severities included in the trial were based on the mean lesion count plus or minus 2 standard deviations. When no standard deviation was provided in the study, then only the grade of the mean was used.
  2. Trials Reporting an Acne Severity Grade
    Two reviewers evaluated each grading system that was described in detail. These grading systems, a few of which were frequently referenced, were converted to the Combined Acne Severity Classification based on how the grading system compared with the above definitions. This conversion was straightforward in articles that used scoring based on calculations from lesion counts. Some conversions, such as the ones from photographic assessment or from text descriptions, required judgement.
  3. Trials Reporting Both Lesion Count and an Acne Severity Grade
    In some instances, studies reported both lesion count and a severity grade. In this case, the lesion count was used to determine the Combined Acne Severity Classification.
  4. Trials With No Reference to Assessment Method
    In some trials, severity terminology was used with no reference as to how severity was actually assessed. In these instances, the severity stated by the author was kept as the combined acne severity classification.
    When no other information was provided, the exclusion criteria were used to determine the Combined Acne Severity Classification. Patients included were assumed to be all of those not excluded, using the definitions above to categorize the patients.

See Table 5 for comparison of the Combined Acne Severity Classification and frequently cited severity grading systems. See Figure 3 for the distribution of the Combined Acne Severity Classification among the trials included in the review. The trials were distributed as 21 percent (58) in mild/moderate category, 49 (18 percent) in moderate/severe category and 51 (19 percent) in moderate category. Only about 5 percent of the trials had patients classified at either end of the scale, as either mild or severe. We were unable to classify severity in about 22 percent (59) of the trials.

Table 5. Comparison of combined severity grading with frequently cited grading systems.

Table

Table 5. Comparison of combined severity grading with frequently cited grading systems.

Figure 3. Distribution of trials by combined acne severity classification.

Figure

Figure 3. Distribution of trials by combined acne severity classification.

Source of Care

Practice Setting

From our model of acne care (Figure 1), we needed data conditioned on source of care or practice setting. No trial noted self-care or HMO as setting of care, 8 (3 percent) were performed at a health clinic, 21 (8 percent) at a private practice, and 195 (71 percent) of the trials cited study clinic/hospital as practice setting. Miscellaneous sites included schools or student centers and one military clinic.

Care Provider

Eight trials (3 percent) noted generalist as source of care while 68 trials (25 percent) were determined to have care provided by dermatologists. In 157 trials (57 percent) the source of care could not be determined beyond study clinic.

Outcomes

Discrete outcomes are those in which investigators report the proportion of subjects achieving a categorical outcome (i.e., "improved"). Continuous outcomes are based on a quantitative measure of an attribute in each subject (e.g., mean number of lesions). Table 6 shows the distribution of outcomes. 203 (74 percent) of trials had continuous outcomes, and 80 percent (221 trials) had discrete outcomes.

Table 6. Distribution of continuous and discrete outcomes.

Table

Table 6. Distribution of continuous and discrete outcomes.

Table 7 shows the distribution of outcomes by outcome type. There were 1,237 outcomes. Of these outcomes, 888 (72 percent) concerned the assessment of acne lesions, and 335 (27 percent) were classified as outcomes of adverse events or side effects. Almost all trials (269, or 98 percent) trials had data on acne outcome, while no trials had data on economic outcomes. Only 2 trials (0.7 percent) had data on psychological outcomes, and only 43 trials (16 percent) provided some data on treatment compliance.

Table 7. Outcome types.

Table

Table 7. Outcome types.

The 19 outcomes shown in Table 8 account for more than half the data. There were 505 acne outcomes, lumping together multiple time points for individual outcomes within a trial. Of these, 142 (28 percent) concerned severity, 211 (42 percent) concerned lesions counts, and 152 (30 percent) were overall assessments. Percentage change with respect to baseline counts accounted for 73 (36 percent) of the lesion count outcomes. Of the 83 trials (31 percent) where location of count was mentioned, 25 (29 percent) simply said "face," 11 (13 percent) stated "entire face," and the remainder used 39 other descriptors, such as "5 cm worst area" or "selected area." Table 9 lists the various descriptors of count location.

Table 8. Number of trials employing outcomes accounting for just over half of reported effectiveness outcomes.

Table

Table 8. Number of trials employing outcomes accounting for just over half of reported effectiveness outcomes.

Table 9. Location of lesion counting.

Table

Table 9. Location of lesion counting.

Treatments

For purposes of summarizing the data and for organizing the results, the treatments were divided into a number of classes. This classification was completed by consensus of our technical experts. The basis for the division was (1) similarity of pharmacological mechanism, and (2) similarity of perceived severity for which the treatment would be prescribed. For the first reason, anti-bacterials were grouped together, regardless of the specific class of antibiotic. For the second, topical and oral antibiotics were separated because most physicians would treat topically first, before using systemic therapy. Our goal in separating these classes was to find if the literature supported this established use.

The classes shown in Table 10 represent the classes, based on the data available. The classes comprising first-line therapy are Cleanser, eratolytic, Anti-bacterial (topical), and combinations. Second-line therapy includes Retinoid topical), Anti-bacterial (oral), and combinations. Referral treatment includes Retinoid (oral) and Anti-androgen. The class Other includes treatments not classifiable in the previous classes, and Inert refers to placebos, vehicles, and other interventions used as controls for presumed effective acne treatment.

Table 10. Treatment classes.

Table

Table 10. Treatment classes.

Table 11 lists all of the treatments studied in the trials included in this report. The table includes a brief description, the classification, and the availability of the treatment. Table 12 summarizes the trials as comparisons, grouped by the class assignment of the evaluated treatment: target and comparator. There were 251 comparisons in the two- and three-arm trials.

Table 11. Description of treatments in trials reviewed.

Table

Table 11. Description of treatments in trials reviewed.

Table 12. Treatment comparisons.

Table

Table 12. Treatment comparisons.

Views

  • PubReader
  • Print View
  • Cite this Page

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...