U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

National Clinical Guideline Centre (UK). Headaches: Diagnosis and Management of Headaches in Young People and Adults [Internet]. London: Royal College of Physicians (UK); 2012 Sep. (NICE Clinical Guidelines, No. 150.)

  • Update information - February 2020: A footnote was added to recommendation 1.3.17 on the potential risk of propranolol overdose in people with migraine who also have depression. November 2015: New and updated recommendations on the prophylactic treatment of migraine were added.

Update information - February 2020: A footnote was added to recommendation 1.3.17 on the potential risk of propranolol overdose in people with migraine who also have depression. November 2015: New and updated recommendations on the prophylactic treatment of migraine were added.

Cover of Headaches

Headaches: Diagnosis and Management of Headaches in Young People and Adults [Internet].

Show details

2Development of the guideline

2.1. What is a NICE clinical guideline?

NICE clinical guidelines are recommendations for the care of individuals in specific clinical conditions or circumstances within the NHS – from prevention and self-care through primary and secondary care to more specialised services. We base our clinical guidelines on the best available research evidence, with the aim of improving the quality of health care. We use predetermined and systematic methods to identify and evaluate the evidence relating to specific review questions.

NICE clinical guidelines can:

  • provide recommendations for the treatment and care of people by health professionals
  • be used to develop standards to assess the clinical practice of individual health professionals
  • be used in the education and training of health professionals
  • help people to make informed decisions
  • improve communication between patient and health professional.

While guidelines assist the practice of healthcare professionals, they do not replace their knowledge and skills.

We produce our guidelines using the following steps:

  • guideline topic is referred to NICE from the Department of Health
  • stakeholders register an interest in the guideline and are consulted throughout the development process
  • the scope is prepared by the National Clinical Guideline Centre (NCGC)
  • the NCGC establishes a guideline development group
  • a draft guideline is produced after the group assesses the available evidence and makes recommendations
  • there is a consultation on the draft guideline
  • the final guideline is produced.

The NCGC and NICE produce a number of versions of this guideline:

  • the full guideline contains all the recommendations, plus details of the methods used and the underpinning evidence
  • the NICE guideline lists the recommendations
  • information for the public (‘understanding NICE guidance’ or UNG) is written using suitable language for people without specialist medical knowledge.

This version is the full version. The other versions can be downloaded from NICE at www.nice.org.uk

2.2. Remit

NICE received the remit for this guideline from the Department of Health. They commissioned the NCGC to produce the guideline.

The remit for this guideline is:

To develop a clinical guideline for the diagnosis and management of headaches in adolescents and adults.

Who developed this guideline?

A multidisciplinary Guideline Development Group (GDG) comprising professional group members and consumer representatives of the main stakeholders developed this guideline (see section on Guideline Development Group Membership and acknowledgements).

The National Institute for Health and Clinical Excellence funds the National Clinical Guideline Centre (NCGC) and thus supported the development of this guideline. The GDG was convened by the NCGC and chaired by Professor Martin Underwood in accordance with guidance from the National Institute for Health and Clinical Excellence (NICE).

The group met every 5–6 weeks during the development of the guideline. At the start of the guideline development process all GDG members declared interests including consultancies, fee-paid work, share-holdings, fellowships and support from the healthcare industry. At all subsequent GDG meetings, members declared arising conflicts of interest, which were also recorded (Appendix B).

Members were either required to withdraw completely or for part of the discussion if their declared interest made it appropriate. The details of declared interests and the actions taken are shown in Appendix B.

Staff from the NCGC provided methodological support and guidance for the development process. The team working on the guideline included a project manager, systematic reviewers, health economists and information scientists. They undertook systematic searches of the literature, appraised the evidence, conducted meta-analysis and cost effectiveness analysis where appropriate and drafted the guideline in collaboration with the GDG.

2.3. What this guideline covers

This guideline covers the following populations:

Young people (12 years and older) and adults in all settings in which NHS healthcare is provided.

The following clinical issues are covered:

  • Diagnosis of the following primary headaches: migraine with or without aura, menstrual related migraine, chronic migraine, tension-type headache and cluster headache. Consideration will also be given to people whose headaches have characteristics of more than one primary headache disorder.
  • Diagnosis of medication overuse headache.
  • Characteristics of headaches that may be related to serious underlying disease and need specific investigations and management.
  • Acute pharmacological management of the specified primary headaches with: antiemetics, aspirin, non-steroidal anti-inflammatory drugs (NSAIDs), opioids, oxygen, paracetamol and triptans.
  • Prophylactic pharmacological treatment for specified primary headaches with: ACE inhibitors and angiotensin II receptor antagonists, antidepressants (serotonin–norepinephrine reuptake inhibitors, selective serotonin reuptake inhibitors and tricyclics), beta blockers, calcium channel antagonists, corticosteroids, lithium, melatonin, neuromodulators or antiepileptics and serotonergic modulators (for example, pizotifen).
  • Non-pharmacological treatment for the specified primary headaches with: acupuncture, dietary supplements, education and self-management programmes, imaging, lifestyle factors (dietary manipulation and exercise), manual therapies and psychological therapies.
  • Information and support for patients and carers.
  • Prevention and treatment of medication overuse headache.
  • Management during pregnancy.
  • Choice of contraception in women with migraine.

For further details please refer to the scope in Appendix A (and review questions in section 2.6).

2.4. What this guideline does not cover

This guideline does not cover:

  • Children aged under 12.
  • Management of primary headaches other than those specified in 2.3.
  • Investigation and management of secondary headache other than medication overuse headache.
  • Diagnosis and management of cranial neuralgias and facial pain.
  • Management of comorbidities.

2.5. Relationships between the guideline and other NICE guidance

Related NICE Health Technology Appraisals

Botulinum toxin type A for the prevention of headaches in adults with chronic migraine. NICE technology appraisal 260 (2012).

Related NICE Clinical Guidelines

Patient experience in adult NHS service. NICE clinical guideline 138 (2012).

The epilepsies. NICE clinical guideline 137 (2012).

Hypertension. NICE clinical guideline 127 (2011).

Anxiety. NICE clinical guideline 113 (2011).

Depression in adults. NICE clinical guideline 90 (2009).

Glaucoma. NICE clinical guideline 85 (2009).

Medicines adherence. NICE clinical guideline 76 (2009).

Stroke. NICE clinical guideline 68 (2008).

Head injury. NICE clinical guideline 56 (2007).

Referral guidelines for suspected cancer. NICE clinical guideline 27 (2005).

NICE is developing the following guidance (details available from www.nice.org.uk)

Head injury. NICE clinical guideline 56 (2007). Currently being updated. Publication tbc.

Referral guidelines for suspected cancer. NICE clinical guideline 27 (2005). Currently being updated. Publication expected March 2014.

Methods

This guidance was developed in accordance with the methods outlined in the NICE Guidelines Manual 2009181.

Particular consideration will be given to the needs of girls and women of reproductive age.

2.6. Developing the review questions and outcomes

Review questions were developed in a PICO framework (patient, intervention, comparison and outcome) for intervention reviews, a framework of population, index tests, reference standard and target condition for reviews of diagnostic test accuracy, and population, presence or absence of risk factors and list of ideal minimum confounding factors for reviews of prognostic factors. This was to guide the literature searching process and to facilitate the development of recommendations by the guideline development group (GDG). They were drafted by the NCGC technical team and refined and validated by the GDG. The questions were based on the key clinical areas identified in the scope (Appendix A). Further information on the outcome measures examined follows this section.

For questions on prognostic factors, protocols stated the risk factor that would be searched for instead of the intervention and comparison.

The review question to determine the diagnostic criteria for primary headaches was the one exception to the usual systematic review process. The GDG agreed that these criteria were well established by the International Headache Society in the International Classification of Headache Disorders criteria 106. The GDG used these criteria as a basis to form the recommendations in a format intended to be useful to a clinician. Full details are in chapter 7.

Table 1Review questions

ChapterReview questionsOutcomes
Assessment and diagnosis:
Indications for consideration of additional investigation
For young people and adults with HIV presenting with new onset headache, how common are serious intracranial abnormalities?
  • Occurrence of serious intracranial abnormalities (as reported)
For young people and adults with a history of malignancy presenting with new onset headache, how common are serious intracranial abnormalities?
  • Occurrence of serious intracranial abnormalities (as reported)
For young people and adults presenting with early morning headache or new onset frequent headache that lasts for more than one month, how common are serious intracranial abnormalities?
  • Occurrence of serious intracranial abnormalities (as reported)
Assessment and diagnosis:
Identifying people with primary headache
What is the accuracy of case finding questionnaires for diagnosing primary headache disorders and medication overuse headache?
  • Positive predictive value
  • Negative predictive value
  • Sensitivity
  • Specificity.
Assessment and diagnosis:
Headache diaries for the diagnosis and management of primary headaches and medication overuse headache
What is the clinical effectiveness of using diaries for the diagnosis of people with suspected primary headaches and medication overuse headache?
  • Number of people correctly diagnosed
  • Positive predictive value
  • Negative predictive value
  • Sensitivity
  • Specificity.
What is the clinical effectiveness, and patients’ and practitioners’ experience, of using diaries for the management of people with primary headaches and medication overuse headache?
  • Clinical headache outcomes (for RCTs)
  • Patients’ and practitioners’ experience of using diaries.
Assessment and diagnosis:
Diagnosis of primary headaches and medication overuse headache
For young people and adults with headache, what are the key diagnostic features of the following headaches:
  • Migraine with or without aura
  • Menstrual related migraine
  • Chronic migraine
  • Tension-type headache
  • Cluster headache
  • Medication overuse headache.
N/A
Assessment and diagnosis:
The role of imaging in diagnosis and management of primary headaches
Should young people and adults with suspected primary headaches be imaged to rule out serious pathology?Percent with the following serious abnormalities:
  • Tumour/neoplasm (subdivide into types)
  • Abscess
  • Subdural haematoma
  • Hydrocephalus
  • Arterio-venous malformations.
For people with the following primary headaches (migraine with or without aura, menstrual related migraine, chronic migraine, tension type headache, cluster headache), what is the clinical evidence and cost-effectiveness of imaging as a management strategy?
  • Resource use including GP consultation, A&E attendance, investigations and referral to secondary care
  • Change in headache frequency and intensity (with e.g. headache impact test or migraine disability assessment test)
  • Percentage responders with 25%, 50% and 75% reduction in baseline headache frequency
  • Change in frequency of acute medication use
  • Change in anxiety and depression (e.g. HAD)
  • Change in health related quality of life (e.g. SF-36 or EuroQoL)
  • Incidental radiological findings.
Management:
Information and support
What information and support do patients with primary headaches say they want?
  • Patients’ preferences
Management:
Acute pharmacological treatment of tension type headache
In people with tension type headache, what is the clinical evidence and cost-effectiveness for acute pharmacological treatment with: aspirin, NSAIDs, opioids and paracetamol?
  • Time to freedom from pain
  • Headache response at up to 2 hours
  • Pain free at 2 hours
  • Pain intensity difference
  • Sustained headache response at 24 hours
  • Sustained freedom from pain at 24 hours
  • Functional health status and health related quality of life (e.g. SF-36 or EuroQoL)
  • Incidence of serious adverse events.
Management:
Acute pharmacological treatment of migraine
In people with migraine with or without aura, what is the clinical evidence and cost-effectiveness for acute pharmacological treatment with: antiemetics, aspirin, NSAIDs, opioids, paracetamol, triptans, ergots and corticosteroids?
  • Time to freedom from pain
  • Headache response at up to 2 hours
  • Freedom from pain at up to 2 hours
  • Sustained headache response at 24 hours
  • Sustained freedom from pain at 24 hours
  • Headache specific quality of life
  • Functional health status and health related quality of life
  • Incidence of serious adverse events.
Management:
Acute pharmacological treatment of cluster headache
In people with cluster headache, what is the clinical evidence and cost-effectiveness for acute pharmacological treatment with: aspirin, paracetamol, oxygen, triptans, ergots, NSAIDs and opioids?
  • Time to freedom from pain
  • Headache response up to 2 hours
  • Reduction in pain at 30 minutes
  • Functional health status and health related quality of life
  • Incidence of serious adverse events.
Management:
Prophylactic pharmacological treatment of tension type headache
In people with tension type headache, what is the clinical evidence and cost-effectiveness for prophylactic pharmacological treatment with: ACE inhibitors and angiotensin II receptor antagonists (ARBs), antidepressants (SNRIs, SSRIs, tricyclics), beta blockers and antiepileptics?
  • Change in patient-reported headache days, frequency and intensity
  • Functional health status and health-related quality of life
  • Responder rate
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic pharmacological treatment of migraine
In migraine with or without aura and chronic migraine, what is the clinical evidence and cost-effectiveness for prophylactic pharmacological treatment with: ACE inhibitors and angiotensin II receptor antagonists (ARBs), antidepressants (SNRIs, SSRIs, tricyclics), beta blockers, calcium channel blockers, antiepileptics and other serotonergic modulators?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic pharmacological treatment of menstrual migraine
In people with pure menstrual and menstrual related migraine, what is the clinical evidence and cost-effectiveness for prophylactic pharmacological treatment with: ACE inhibitors and angiotensin II receptor antagonists, antidepressants (SNRIs, SSRIs, tricyclics), beta blockers, calcium channel blockers, antiepileptics, triptans, other serotonergic modulators, NSAIDs and hormonal therapy (contraceptives)?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic pharmacological treatment of cluster headache
In people with cluster headache, what is the clinical evidence and cost-effectiveness for prophylactic pharmacological treatment with: calcium channel blockers, corticosteroids, lithium, melatonin, antiepileptics, triptans and other serotonergic modulators?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with acupuncture
For people with primary headaches, what is the clinical evidence and cost-effectiveness of management with acupuncture?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use, including GP consultation, A&E attendance, investigations and referral to secondary care
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with manual therapies
For people with primary headaches, what is the clinical evidence and cost-effectiveness of non-pharmacological management with manual therapies?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with psychological therapies
For people with primary headaches, what is the clinical evidence and cost-effectiveness of non-pharmacological management with psychological therapies?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with herbal remedies and dietary supplements
For people with primary headaches, what is the clinical evidence and cost-effectiveness of management with herbal remedies?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use, including GP consultation, A&E attendance, investigations and referral to secondary care
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
For people with primary headaches, what is the clinical evidence and cost-effectiveness of management with dietary supplements (e.g. magnesium, vitamin B12, coenzyme Q10 and riboflavin (vitamin B2)).
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with exercise
For people with primary headaches, what is the clinical evidence and cost-effectiveness of non-pharmacological management with exercise programmes?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Incidence of serious adverse events.
Management:
Prophylactic non-pharmacological management of primary headaches with education and self-management
For people with primary headaches, what is the clinical evidence and cost-effectiveness of non-pharmacological management with education and self-management programmes?
  • Change in patient-reported headache days, frequency and intensity
  • Responder rate
  • Functional health status and health-related quality of life
  • Headache specific quality of life
  • Resource use
  • Use of acute pharmacological treatment
  • Patient’s perception of the usefulness of programmes.
Management:
Medication overuse headache
What is the clinical evidence and cost-effectiveness of withdrawal strategies (of abortive treatments), psychological therapies, corticosteroids and NSAIDs for the treatment of probable medication overuse headache (MOH)?
  • Change in acute medication use (up to 3 months)
  • Relapse back to MOH
  • Responder rate (proportion who no longer have probable MOH)
  • Change in patient reported headache days, frequency and intensity
  • Headache specific quality of life
  • Resource use
  • Functional health status and health related quality of life.
Management during pregnancy and contraceptive use:
Management of primary headaches during pregnancy
What is the evidence for adverse fetal events in females with primary headaches during pregnancy using triptans?
  • Fetal adverse events.
What is the evidence for adverse fetal events in females using oxygen or verapamil during pregnancy?
  • Fetal adverse events.
Management during pregnancy and contraceptive use:
Combined hormonal contraception use in girls and women with migraine
What risks are associated with use of hormonal contraception in females aged 12 or over with migraine?
  • Incidence of serious adverse events
  • Worsening effect on headache disorder.

2.7. Searching for evidence

2.7.1. Clinical literature search

Systematic literature searches were undertaken to identify evidence within published literature in order to answer the review questions as per The Guidelines Manual [2009]181. Clinical databases were searched using relevant medical subject headings, free-text terms and study type filters where appropriate. Studies published in languages other than English were not reviewed. Where possible, searches were restricted to articles published in English language. All searches were conducted on MEDLINE and Embase. The Cochrane Library was searched for all intervention questions. Additional subject specific databases were used for some questions: Cinahl for diaries, treatment questions and patient information; PsycINFO for education and self-management programmes, psychological therapies, medication over use headaches and patient information; AMED for non-pharmacological treatment of headaches. All searches were updated on 13 March 2012. No papers after this date were considered.

Search strategies were checked by looking at reference lists of relevant key papers, checking search strategies in other systematic reviews and asking the GDG for known studies. The questions, the study types applied, the databases searched and the years covered can be found in Appendix D.

During the scoping stage, a search was conducted for guidelines and reports on the websites listed below and on organisations relevant to the topic. A full list of websites is included in Appendix D. Searching for grey literature or unpublished literature was not undertaken. All references sent by stakeholders were considered.

2.7.2. Health economic literature search

Systematic literature searches were also undertaken to identify health economic evidence within published literature relevant to the review questions. The evidence was identified by conducting a broad search relating to the guideline population in the NHS economic evaluation database (NHS EED), the Health Economic Evaluations Database (HEED) and health technology assessment (HTA) databases with no date restrictions. Additionally, the search was run on MEDLINE, with a specific economic filter, from 2008, to ensure recent publications that had not yet been indexed by these databases were identified. Studies published in languages other than English were not reviewed. Where possible, searches were restricted to articles published in English language.

The search strategies for health economics are included in Appendix D. All searches were updated on 18 January 2012. No papers published after this date were considered.

2.8. Evidence of clinical effectiveness

2.8.1. Literature review

The process for review of evidence of effectiveness is as follows:

The Research Fellows:

  • Identified potentially relevant studies for each review question from the relevant search results by reviewing titles and abstracts – full papers were then obtained.
  • Reviewed full papers against pre-specified inclusion/exclusion criteria to identify studies that addressed the review question in the appropriate population and reported on outcomes of interest (review protocols are included in Appendix C, excluded studies lists are in Appendix O. The excluded studies list only details studies excluded after the full papers were ordered. Many would have previously been excluded when the titles and abstracts were reviewed.
  • Critically appraised relevant studies using the appropriate checklist as specified in The Guidelines Manual181.
  • Extracted key information about the study’s methods and results into evidence tables (evidence tables are included in Appendix E.
  • Generated summaries of the evidence by outcome (included in the relevant chapter write-ups) and produced evidence statements indicating the number of included studies, sample size (number randomised), direction of effect, uncertainty and GRADE quality rating:
    • Randomised studies: meta analysed, where appropriate and reported in GRADE profiles (for clinical studies) – see below for details
    • Observational studies: data presented as a range of values in adapted GRADE profiles
    • Diagnostic studies: data presented as a range of values in adapted GRADE profiles
    • Prognostic studies: data presented as a range of values in adapted GRADE profiles
    • Qualitative studies: the quality of reporting for each study was summarised for three criteria in the guideline text: population, methods and analysis.

2.8.2. Inclusion/exclusion

See the review protocols in Appendix C for full details.

Note these key points:

The age range for this guideline was 12 years and over. Studies that included people younger than 12 were included only if the mean age of the population was over 12 years.

Crossover trials were only included in the review questions for acute treatment, however they were only included if it was clear from the paper that all participants included in the analysis had treated one headache attack only with each treatment, or if the data for the first crossover period only was available, in which case the study could be analysed as a parallel trial.

Placebo controlled trials were not included for the review question on the acute treatment of migraine as the GDG agreed that people seeking medical help for a migraine attack would have already tried over the counter medications. Therefore drug trials only were included if there was a head-to-head comparison.

The GDG agreed that for the majority of intervention review questions a sample size cut-off of 50 participants (25 per arm) was appropriate due to there being sufficient evidence with sample sizes greater than 50 which would provide a better estimate of the effect size. For most prognostic and diagnostic review questions, lager sample size cut-offs were applied (Chapters 5, 24 and 25). There were some exceptions in which lower sample size cut-offs were applied, or not cut-off values, when the GDG were aware that sufficient evidence at larger sample sizes would be lacking. These were:

  • Indications for consideration of additional investigation (Chapter 4) – Minimum n=any
  • Headache diaries for the diagnosis and management of primary headaches and medication overuse headache (Chapter 6) – Minimum n=any
  • Imaging for diagnosis in people with suspected primary headache (Chapter 8.2) – Minimum n=any
  • Imaging as a management strategy for people with suspected primary headaches (Chapter 8.3) – Minimum n=20 per arm
  • Patient information and support (Chapter 9) – Minimum n=any
  • Acute pharmacological treatment of cluster headache (Chapter 12) – Minimum n=any
  • Prophylactic pharmacological treatment of cluster headache (Chapter 16) – Minimum n=any
  • Prophylactic non-pharmacological management of primary headaches with psychological therapies (Chapter 19) – Minimum n=25 total
  • Prophylactic non-pharmacological management of primary headaches with education and self management (Chapter 22) – Minimum n=25 total.

2.8.3. Methods of combining clinical studies

Data synthesis for intervention reviews

Available case analysis

Estimates of effect from individual studies were based on available case analysis (ACA) where it was possible to extract these data. ACA was defined as analysis using all participants with data available for the outcome being considered. For example, for dichotomous outcomes, the denominator is the number of participants with available data and the numerator is the number who experienced the event. Participants for whom data for that outcome were not available are assumed to be missing at random. Where ACA was not possible data were reported as in the study and this is explained in the introduction of the relevant clinical review.

Meta-analyses

Where possible, meta-analyses were conducted to combine the results of studies for each review question using Cochrane Review Manager (RevMan5.1) software (http://ims.cochrane.org/revman).

Fixed-effects (Mantel-Haenszel) techniques were used to calculate risk ratios (relative risk) for the binary outcomes: responder rate; resource use including GP consultation, accident and emergency attendance, investigations and referral to secondary care; percentage responders with 25%, 50% and 75% reduction in baseline headache frequency; incidental radiological findings; headache response up to 2 hours; freedom from pain at up to 2 hours; sustained freedom from pain at 24 hours; sustained headache response at 24 hours; acute medication use; incidence of serious adverse events.

The continuous outcomes (change in patient-reported headache days, frequency and intensity; change in anxiety and depression (e.g. HAD); change in health related quality of life (e.g. SF-36 or EuroQoL); change in headache specific quality of life) were analysed using an inverse variance method for pooling weighted mean differences and where the studies had different scales, standardised mean differences were used. Final values were reported where available for continuous outcomes in preference of change scores. However, if change scores only were available, these were reported and meta-analysed with final values.

Statistical heterogeneity was assessed by considering the chi-squared test for significance at p<0.1 or an I-squared inconsistency statistic of >50% to indicate significant heterogeneity. Where significant heterogeneity was present, we carried out predefined subgroup analyses if possible. Subgroups were: age (12<18, or 18 and over), dose or route of administration.

Assessments of potential differences in effect between subgroups were based on the chi-squared tests for heterogeneity statistics between subgroups. If no sensitivity analysis was found to completely resolve statistical heterogeneity then a random effects (DerSimonian and Laird) model was employed to provide a more conservative estimate of the effect.

The means and standard deviations of continuous outcomes were required for meta-analysis. However, in cases where standard deviations were not reported, the standard error was calculated if the p-values or 95% confidence intervals were reported and meta-analysis was undertaken with the mean and standard error using the generic inverse variance method in Cochrane Review Manager (RevMan5) software. When the only evidence was based on studies which only presented means, this information was summarised in the GRADE tables without calculating the relative and absolute effect.

For binary outcomes, absolute event rates were also calculated using the GRADEpro software using event rate in the control arm of the pooled results.

Network meta-analyses

Network meta-analysis was conducted for the review questions on the acute and prophylactic treatment of migraine. This allowed indirect comparisons of all the drugs included in the review when no direct comparison was available. A hierarchical Bayesian network meta-analysis (NMA) was performed using the software WinBUGS. We adapted a three-arm random effects model template for the networks, from the University of Bristol website (https://www.bris.ac.uk/cobm/research/mpes/mtc.html). This model accounts for the correlation between study level effects induced by multi-arm trials. The model used was based on a random effects logistic regression, with parameters estimated by Markov chain Monte Carlo simulation.

Four network meta-analyses were run for the acute treatment of migraine, each for binary outcomes: headache response at up to 2 hours; freedom from pain at up to 2 hours; sustained headache response at 24 hours and sustained freedom from pain at 24 hours. The log odds ratios were calculated and converted into relative risks for comparison to the direct comparisons. The ranking of interventions was also calculated based on their relative risks compared to the control group. For the acute treatment of migraine, one network was run for change in patient reported migraine days. The change in migraine days for each treatment was calculated, as well as the overall ranking of each treatment based on the effect size compared to placebo.

Data synthesis for prognostic factor reviews

Odds ratio, relative risks or hazard ratios, with their 95% confidence intervals, from multivariate analyses were extracted from the papers, and standard errors were calculated from the 95% confidence intervals. The log of the effect size with its standard error was entered into the generic inverse variance technique in the Cochrane Review Manager (RevMan5) software (http://ims.cochrane.org/revman). Studies were not combined in a meta-analysis for observational studies.

The quality of studies was assessed and presented in an adapted GRADE profile according to criteria stated in the methodology checklist for prognostic studies in the guidelines manual. Results were reported as ranges.

Data synthesis for diagnostic test accuracy review

Evidence for diagnostic data were evaluated by study, using version two of the Quality Assessment of Diagnostic Accuracy Studies checklists (QUADAS-2) (http://www.bris.ac.uk/quadas/quadas-2).

For diagnostic test accuracy studies, the following outcomes were reported: sensitivity, specificity, positive predictive value and negative predictive value. In cases where the outcomes were not reported, 2 by 2 tables were constructed from raw data to allow calculation of these accuracy measures. Summary receiver operative characteristic (ROC) curves, would have been generated if appropriate, however there were no data in the diagnostic reviews included in this guideline that could be combined to produce an ROC curve or diagnostic meta-analysis.

Data synthesis for qualitative review

Themes were identified from these studies by two reviewers independently, and then verified jointly. These themes were supplemented with data from surveys where available. Common themes relevant to the question are reported in a narrative in the guideline text.

Appraising the quality of evidence by outcomes

The evidence for outcomes from the included RCT and observational studies were evaluated and presented using an adaptation of the ‘Grading of Recommendations Assessment, Development and Evaluation (GRADE) toolbox’ developed by the international GRADE working group (http://www.gradeworkinggroup.org/). The software (GRADEpro) developed by the GRADE working group was used to assess the quality of each outcome, taking into account individual study quality and the meta-analysis results. The summary of findings was presented as two separate tables in this guideline. The ‘Clinical/Economic Study Characteristics’ table includes details of the quality assessment while the ‘Clinical /Economic Summary of Findings’ table includes pooled outcome data, where appropriate, an absolute measure of intervention effect and the summary of quality of evidence for that outcome. In this table, the columns for intervention and control indicate the sum of the sample size for continuous outcomes. For binary outcomes such as number of people with an adverse event, the event rates (n/N: number of people with events divided by sum of number of people) are shown with percentages. Reporting or publication bias was only taken into consideration in the quality assessment and included in the Clinical Study Characteristics table if it was apparent. Each outcome was examined separately for the quality elements listed and defined in Table 2 and each graded using the quality levels listed in Table 3. The main criteria considered in the rating of these elements are discussed below (see section 2.8.4 Grading of Evidence). Footnotes were used to describe reasons for grading a quality element as having serious or very serious problems. The ratings for each component were summed to obtain an overall assessment for each outcome.

Table 2. Description of quality elements in GRADE for intervention studies.

Table 2

Description of quality elements in GRADE for intervention studies.

Table 3. Levels of quality elements in GRADE.

Table 3

Levels of quality elements in GRADE.

2.8.4. Grading the quality of clinical evidence

After results were pooled, the overall quality of evidence for each outcome was considered. The following procedure was adopted when using GRADE:

  1. A quality rating was assigned, based on the study design. RCTs start HIGH and observational studies as LOW, uncontrolled case series as LOW or VERY LOW.
  2. The rating was then downgraded for the specified criteria: Study limitations, inconsistency, indirectness, imprecision and reporting bias. These criteria are detailed below. Observational studies were upgraded if there was: a large magnitude of effect, dose-response gradient, and if all plausible confounding would reduce a demonstrated effect or suggest a spurious effect when results showed no effect. Each quality element considered to have ‘serious’ or ‘very serious’ risk of bias were rated down −1 or −2 points respectively.
  3. The downgraded/upgraded marks were then summed and the overall quality rating was revised. For example, all RCTs started as HIGH and the overall quality became MODERATE, LOW or VERY LOW if 1, 2 or 3 points were deducted respectively.
  4. The reasons or criteria used for downgrading were specified in the footnotes.

The details of criteria used for each of the main quality element are discussed further in the following sections 2.8.5 to 2.8.8.

2.8.5. Study limitations

The main limitations for randomised controlled trials are listed in Table 4.

Table 4. Study limitations of randomised controlled trials.

Table 4

Study limitations of randomised controlled trials.

The GDG agreed that wherever possible, except for acute pharmacological treatment of migraine (see chapter 11 for more information), comparators for intervention studies should be a placebo (or an active control for the case of non-pharmacological treatments) or another active intervention in a double blind situation. The GDG accepted that there were some non-pharmacological intervention studies were participant blinding was impossible or very hard to achieve in most situations (exercise, chapter 21, manual therapy, chapter 18, and education and self-management, chapter 22). Nevertheless, open-label studies for these intervention studies were downgraded to maintain a consistent approach in quality rating across the guideline; however, with interventions where a placebo or active control was possible, open label studies would be excluded.

Table 4 lists the limitations considered for randomised controlled trials.

2.8.6. Inconsistency

Inconsistency refers to an unexplained heterogeneity of results. When estimates of the treatment effect across studies differ widely (i.e. heterogeneity or variability in results), this suggests true differences in underlying treatment effect. When heterogeneity existed (Chi square p<0.1 or I-squared inconsistency statistic of >50%), but no plausible explanation can be found, the quality of evidence was downgraded by one or two levels, depending on the extent of uncertainty to the results contributed by the inconsistency in the results. In addition to the I- square and Chi square values, the decision for downgrading was also dependent on factors such as whether the intervention is associated with benefit in all other outcomes or whether the uncertainty about the magnitude of benefit (or harm) of the outcome showing heterogeneity would influence the overall judgment about net benefit or harm (across all outcomes).

If inconsistency could be explained based on pre-specified subgroup analysis, the GDG took this into account and considered whether to make separate recommendations based on the identified explanatory factors, i.e. population and intervention. Where subgroup analysis gives a plausible explanation of heterogeneity, the quality of evidence would not be downgraded.

2.8.7. Indirectness

Directness refers to the extent to which the populations, intervention, comparisons and outcome measures are similar to those defined in the inclusion criteria for the reviews. Indirectness is important when these differences are expected to contribute to a difference in effect size, or may affect the balance of harms and benefits considered for an intervention.

In this guideline the age range was people aged 12 and older. In cases where the population in the studies included children younger than 12, the studies were included if the average age was over 12, but the evidence would be down-graded for indirectness.

If the headache population included people with mixed headache types in the intervention reviews, the evidence would also be down-graded.

2.8.8. Imprecision

Imprecision refers to the certainty in the effect for the outcome. When results are imprecise or very imprecise we are uncertain if there is an important difference between interventions or not.

Minimally important difference (MID)

The thresholds of important benefits or harms, or the MID for an outcome are important considerations for determining whether there is a “clinically important” difference between intervention and control groups and in assessing imprecision.

For continuous outcomes, the MID is defined as “the smallest difference in score in the outcome of interest that informed patients or informed proxies perceive as important, either beneficial or harmful, and that would lead the person or clinician to consider a change in the management”93,107,222,223. For dichotomous outcomes, the MID is considered in terms of changes of both absolute and relative risk.

The GDG were asked at the outset of the guideline if they were aware of any established values for MIDs for the outcomes included in the review. Two published values were highlighted for the following outcomes; migraine specific quality of life questionnaire (MSQ) and; the headache impact test. The values reported in these publications were used to determine imprecision of the point estimates for these two outcomes:

  • Migraine-Specific Quality of Life Questionnaire (MSQ)41
    • Role restrictive domain: 3.2
    • Role preventive domain: 4.6
    • Emotional functioning domain: 7.5.
  • Headache Impact Test (HIT-6)38: 2.3.

For the majority of the outcomes, there were no published MIDs. The GDG agreed that the default values stated in the GRADEpro were appropriate for these outcomes, and would account for the >20% improvement rate in placebo arms of headache trials. The default thresholds suggested by GRADE are a relative risk reduction of 25% (relative risk of 0.75 for negative outcomes) or a relative risk increase of 25% (risk ratio 1.25 for positive outcomes) for dichotomous outcomes. For continuous outcomes two approaches were used. When only one trial was included as the evidence base for an outcome, the mean difference was converted to the standardized mean difference (SMD) and checked to see if the confidence interval crossed 0.5. However, the mean difference (95% confidence interval) was still presented in the Grade tables. If two or more included trials reported a quantitative outcome then the default approach of multiplying 0.5 by standard deviation (taken as the median of the standard deviations across the meta-analyzed studies) was employed.

There was one exception, the GDG chose to apply a specific MID for change in migraine / headache days as this was deemed the most important outcome for prophylactic reviews. After discussion, the GDG agreed by informal consensus that an MID of 0.5 days was appropriate for this outcome.

Assessing imprecision

The confidence interval for the pooled or best estimate of effect was considered in relation to the MIDs to assess imprecision. If the confidence interval crossed the MID threshold, there was uncertainty in the effect estimate supporting our recommendation (because the CI was consistent with two decisions) and the effect estimate was rated as having serious imprecision. If both MIDs were crossed, the effect estimate was rated as having very serious imprecision.

Assessing clinical importance

For the purposes of this guideline, clinical importance was assessed by comparing the effect estimate against the MID and reviewing the absolute effect reported in the GRADE summary table. For example, if the effect size was small (less than the MID), this finding suggests that there may not be enough difference to recommend one intervention over the other based on that outcome, unless in exceptional circumstances, the GDG agreed that the absolute effect was great enough to reach clinical importance. An effect estimate larger than the MID is considered to be clinically important.

Figure 1 illustrates how the clinical importance of effect estimates were considered along with imprecision. This is documented in the evidence statements throughout this guideline.

Figure 1. Illustration of precise and imprecise outcomes based on the confidence interval of outcomes in a forest plot in relation to the MID.

Figure 1

Illustration of precise and imprecise outcomes based on the confidence interval of outcomes in a forest plot in relation to the MID. Source: NCGC methods manual

Evidence statements

Evidence statements were formed for each outcome indicating the quantity and quality of evidence available, and the outcome and population to which they relate. Below are some examples to illustrate how the wording indicates the imprecision (uncertainty) and clinical importance:

  • Precise, both the point estimate and confidence intervals are outside the MID :
    Xx studies with xx people showed that intervention a is more clinically effective than intervention b. [GRADE quality].
  • Precise, both the point estimate and confidence intervals are between the MID and no difference:
    Xx studies with xx people showed that intervention a is more effective than intervention b, but the effect size was too small to be clinically important. [GRADE quality].
  • Serious imprecision, point estimate outside the MID, and the confidence interval crosses the MID:
    Xx studies with xx people suggested that intervention a may be more clinically effective than intervention b, but there is some uncertainty. [GRADE quality].
  • Serious imprecision, point estimate between the MID and no difference, and the confidence interval crosses the MID:
    Xx studies with xx people suggested that intervention a may be more effective than intervention b, but the effect size is too small to be clinically important, and there is some uncertainty. [GRADE quality].
  • Very serious imprecision, point estimate outside the MID, and the confidence interval crosses the MID in both directions:
    Xx studies with xx people suggested that intervention a may be more clinically effective than intervention b, but there is considerable uncertainty. [GRADE quality].
  • Very serious imprecision, point estimate between the MID and no difference, and the confidence interval crosses the MID in both directions:
    Xx studies with xx people suggested that intervention a may be more effective than intervention b, but the effect size is too small to be clinically important, and there is considerable uncertainty. [GRADE quality].
  • Precise, point estimate close to line of no difference, confidence intervals just cross line of no difference:
    Xx studies with xx people showed that there is no difference between intervention a and intervention b. [GRADE quality].

When imprecision could not be assessed, the following statement will be used: “the difference is uncertain as no comparative analysis could be carried out”.

For diagnostic reviews, the imprecision was based on the outcome deemed to be most important, for example in cases where it was most important not to have a high number of false negative test results, the imprecision assessment would be based on specificity. No MID was defined for any of the diagnostic outcomes. The GDG were asked to review the evidence and agree the level of imprecision based on the confidence intervals around the effect size and absolute effect estimate.

2.9. Evidence of cost-effectiveness

Evidence on cost-effectiveness related to the key clinical issues being addressed in the guideline was sought. The health economist:

  • Undertook a systematic review of the economic literature
  • Undertook new cost-effectiveness analysis in priority areas.

2.9.1. Literature review

The Health Economist:

  • Identified potentially relevant studies for each review question from the economic search results by reviewing titles and abstracts – full papers were then obtained
  • Reviewed full papers against pre-specified inclusion / exclusion criteria to identify relevant studies (see below for details)
  • Critically appraised relevant studies using the economic evaluations checklist as specified in The Guidelines Manual181
  • Extracted key information about the study’s methods and results into evidence tables (evidence tables are included in Appendix E).
  • Generated summaries of the evidence in NICE economic evidence profiles (included in the relevant chapter write-ups) – see below for details.

2.9.1.1. Inclusion/exclusion

Full economic evaluations (studies comparing costs and health consequences of alternative courses of action: cost–utility, cost-effectiveness, cost-benefit and cost-consequence analyses) and comparative costing studies that addressed the review question in the relevant population were considered potentially applicable as economic evidence.

Studies that only reported cost per hospital (not per person), or only reported average cost effectiveness without disaggregated costs and effects, were excluded. Abstracts, posters, reviews, letters/editorials, foreign language publications and unpublished studies were excluded. Studies judged to have an applicability rating of ‘not applicable’ were excluded (this included studies that took the perspective of a non-OECD country).

Remaining studies were prioritised for inclusion based on their relative applicability to the development of this guideline and the study limitations. For example, if a high quality, directly applicable UK analysis was available other less relevant studies may not have been included. Where exclusions occurred on this basis, this is noted in the relevant section.

For more details about the assessment of applicability and methodological quality see the economic evaluation checklist in The Guidelines Manual181 and the health economics research protocol in Appendix C.

When no relevant economic analysis was found from the economic literature review, relevant UK NHS unit costs related to the compared interventions were presented to the GDG to inform the possible economic implication of the recommendation to make.

2.9.1.2. NICE economic evidence profiles

The NICE economic evidence profile has been used to summarise cost and cost-effectiveness estimates. The economic evidence profile shows, for each economic study, an assessment of applicability and methodological quality, with footnotes indicating the reasons for the assessment.

These assessments were made by the health economist using the economic evaluation checklist from The Guidelines Manual181. It also shows incremental costs, incremental outcomes (for example, QALYs) and the incremental cost-effectiveness ratio from the primary analysis, as well as information about the assessment of uncertainty in the analysis. See Table 5 for more details.

Table 5. Content of NICE economic profile.

Table 5

Content of NICE economic profile.

Where economic studies compare multiple strategies, results are presented in the economic evidence profiles for the pair-wise comparison specified in the review question, irrespective of whether or not that comparison was ‘appropriate’ within the analysis being reviewed. A comparison is ‘appropriate’ where an intervention is compared with the next most expensive non-dominated option – a clinical strategy is said to ‘dominate’ the alternatives when it is both more effective and less costly. Footnotes indicate if a comparison was ‘inappropriate’ in the analysis.

For particular studies or original models comparing multiple strategies, results are not reported in the standard economic profile but are instead presented at the end of the relevant chapter in a paragraph summarising the study/model as a whole.

2.9.2. Undertaking new health economic analysis

As well as reviewing the published economic literature for each review question, as described above, new economic analysis was undertaken by the Health Economist in priority areas. Priority areas for new health economic analysis were agreed by the GDG after formation of the review questions and consideration of the available health economic evidence.

Additional data for the analysis was identified as required through additional literature searches undertaken by the Health Economist, and discussion with the GDG. Model structure, inputs and assumptions were explained to and agreed by the GDG members during meetings, and they commented on subsequent revisions.

See Appendices J and L for details of the health economic analyses undertaken for the guideline.

2.9.3. Cost-effectiveness criteria

NICE’s report ‘Social value judgements: principles for the development of NICE guidance’ sets out the principles that GDG members should consider when judging whether an intervention offers good value for money180.

In general, an intervention was considered to be cost effective if either of the following criteria applied (given that the estimate was considered plausible):

  1. The intervention dominated other relevant strategies (that is, it was both less costly in terms of resource use and more clinically effective compared with all the other relevant alternative strategies), or
  2. The intervention cost less than £20,000 per quality-adjusted life-year (QALY) gained compared with the next best strategy.

If the GDG recommended an intervention that was estimated to cost more than £20,000 per QALY gained, or did not recommend one that was estimated to cost less than £20,000 per QALY gained, the reasons for this decision are discussed explicitly in the ‘from evidence to recommendations’ section of the relevant chapter with reference to issues regarding the plausibility of the estimate or to the factors set out in the ‘Social value judgements: principles for the development of NICE guidance’ 180.

2.10. Developing recommendations

Over the course of the guideline development process, the GDG was presented with:

  • Evidence tables of the clinical and economic evidence reviewed from the literature. All evidence tables are in Appendices E and F.
  • Summary of clinical and economic evidence and quality (as presented in chapters 425).
  • Forest plots (Appendix G).
  • A description of the methods and results of the cost-effectiveness analysis undertaken for the guideline (Appendices J and L).

Recommendations were drafted on the basis of the GDG interpretation of the available evidence, taking into account the balance of benefits, harms and costs. When clinical and economic evidence was of poor quality, conflicting or absent, the GDG drafted recommendations based on their expert opinion by informal consensus. The considerations for making consensus based recommendations include the balance between potential harms and benefits, economic or implications compared to the benefits, current practices, recommendations made in other relevant guidelines, patient preferences and equality issues. The consensus recommendations were formed through discussions in the GDG meetings, and voting when there was not clear agreement.

The main considerations specific to each recommendation are outlined in the linking evidence to recommendation section preceding the recommendation section.

This guideline recommends some drugs for indications for which they do not have a UK marketing authorisation at the date of publication, if there is evidence to support that use10. Where recommendations have been made for the use of drugs outside their licensed indications (‘off-label use’), these drugs are marked with a footnote in the recommendations. Drug dosages are specified in recommendations where the dosage for that indication is not included in the ‘British national formulary’.

2.10.1. Research recommendations

When areas were identified for which good evidence was lacking, the guideline development group considered making recommendations for future research. Decisions about inclusion were based on factors such as:

  • the importance to patients or the population
  • national priorities
  • potential impact on the NHS and future NICE guidance
  • ethical and technical feasibility.

2.10.2. Validation process

The guidance is subject to a six week public consultation and feedback as part of the quality assurance and peer review the document. All comments received from registered stakeholders are responded to in turn and posted on the NICE website when the pre-publication check of the full guideline occurs.

2.10.3. Updating the guideline

Following publication, and in accordance with the NICE guidelines manual181, NICE will ask a National Collaborating Centre or the National Clinical Guideline Centre to advise NICE’s Guidance executive whether the evidence base has progressed significantly to alter the guideline recommendations and warrant an update.

2.10.4. Disclaimer

Health care providers need to use clinical judgement, knowledge and expertise when deciding whether it is appropriate to apply guidelines. The recommendations cited here are a guide and may not be appropriate for use in all situations. The decision to adopt any of the recommendations cited here must be made by the practitioners in light of individual patient circumstances, the wishes of the person, clinical expertise and resources.

The National Clinical Guideline Centre disclaims any responsibility for damages arising out of the use or non-use of these guidelines and the literature used in support of these guidelines.

2.10.5. Funding

The National Clinical Guideline Centre was commissioned by the National Institute for Health and Clinical Excellence to undertake the work on this guideline.

Copyright © 2012, National Clinical Guideline Centre.

Apart from any fair dealing for the purposes of research or private study, criticism or review, as permitted under the Copyright, Designs and Patents Act, 1988, no part of this publication may be reproduced, stored or transmitted in any form or by any means, without the prior written permission of the publisher or, in the case of reprographic reproduction, in accordance with the terms of licences issued by the Copyright Licensing Agency in the UK. Enquiries concerning reproduction outside the terms stated here should be sent to the publisher at the UK address printed on this page.

The use of registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant laws and regulations and therefore for general use.

The rights of National Clinical Guideline Centre to be identified as Author of this work have been asserted by them in accordance with the Copyright, Designs and Patents Act, 1988.

Bookshelf ID: NBK327501