METHODS USED TO DEVELOP THIS GUIDELINE

National Collaborating Centre for Mental Health (UK)

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

National Collaborating Centre for Mental Health (UK). Antenatal and Postnatal Mental Health: Clinical Management and Service Guidance: Updated edition. Leicester (UK): British Psychological Society; 2014 Dec. (NICE Clinical Guidelines, No. 192.)

See 2018 Partial Update

April 2018: Footnotes and cautions have been added and amended by NICE to link to the MHRA's latest advice and resources on sodium valproate. Sodium valproate must not be used in pregnancy, and only used in girls and women when there is no alternative and a pregnancy prevention plan is in place. This is because of the risk of malformations and developmental abnormalities in the baby.

April 2018: Footnotes and cautions have been added and amended by NICE to link to the MHRA's latest advice and resources on sodium valproate. Sodium valproate must not be used in pregnancy, and only used in girls and women when there is no alternative and a pregnancy prevention plan is in place. This is because of the risk of malformations and developmental abnormalities in the baby.

Cover of Antenatal and Postnatal Mental Health

Antenatal and Postnatal Mental Health: Clinical Management and Service Guidance: Updated edition.

Show details

Contents

< Prev Next >

3METHODS USED TO DEVELOP THIS GUIDELINE

3.1. OVERVIEW

The development of this guideline followed The Guidelines Manual (NICE, 2012a). A team of health and social care professionals, lay representatives and technical experts known as the Guideline Development Group (GDG), with support from the NCCMH staff, undertook the development of a person-centred, evidence-based guideline. There are seven basic steps in the process of developing a guideline:

Define the scope, which lays out exactly what will be included (and excluded) in the guidance.
Define review questions that cover all areas specified in the scope.
Develop a review protocol for each systematic review, specifying the search strategy and method of evidence synthesis for each review question.
Synthesise data retrieved, guided by the review protocols.
Produce evidence profiles and summaries using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) system.
Consider the implications of the research findings for clinical practice and reach consensus decisions on areas where evidence is not found.
Answer review questions with evidence-based recommendations for clinical practice.

The clinical practice recommendations made by the GDG are therefore derived from the most up-to-date and robust evidence for the clinical and cost effectiveness of the interventions and services covered in the scope. Where evidence was not found or was inconclusive, the GDG discussed and attempted to reach consensus on what should be recommended, factoring in any relevant issues. In addition, to ensure a service user and carer focus, the concerns of service users and carers regarding health and social care have been highlighted and addressed by recommendations agreed by the whole GDG.

3.2. THE SCOPE

Topics are referred by the Secretary of State and the letter of referral defines the remit, which defines the main areas to be covered (see The Guidelines Manual [NICE, 2012a] for further information). The NCCMH developed a scope for the guideline based on the remit (see Appendix 1). The purpose of the scope is to:

provide an overview of what the guideline will include and exclude
identify the key aspects of care that must be included
set the boundaries of the development work and provide a clear framework to enable work to stay within the priorities agreed by NICE and the National Collaborating Centre, and the remit from the Department of Health/Welsh Assembly Government
inform the development of the review questions and search strategy
inform professionals and the public about expected content of the guideline
keep the guideline to a reasonable size to ensure that its development can be carried out within the allocated period.

An initial draft of the scope was sent to registered stakeholders who had agreed to attend a scoping workshop. The workshop was used to:

obtain feedback on the selected key clinical issues
identify which population subgroups should be specified (if any)
seek views on the composition of the GDG
encourage applications for GDG membership.

The draft scope was subject to consultation with registered stakeholders over a 6-week period. During the consultation period, the scope was posted on the NICE website (www.nice.org.uk). Comments were invited from stakeholder organisations. The NCCMH and NICE reviewed the scope in light of comments received, and the revised scope was signed off by NICE.

3.3. THE GUIDELINE DEVELOPMENT GROUP

During the consultation phase, members of the GDG were appointed by an open recruitment process. GDG membership consisted of: professionals in psychiatry, clinical psychology, nursing, health visiting, obstetrics, midwifery and general practice; academic experts in psychiatry and psychology, a mother infant specialist service users and a representative from a service user organisation. The guideline development process was supported by staff from the NCCMH, who undertook the clinical and health economic literature searches, reviewed and presented the evidence to the GDG, managed the process, and contributed to drafting the guideline.

3.3.1. Guideline Development Group meetings

Twelve GDG meetings were held between 14 March, 2013, and 2 September, 2014. During each day-long GDG meeting, in a plenary session, review questions and clinical and economic evidence were reviewed and assessed, and recommendations formulated. At each meeting, all GDG members declared any potential conflicts of interest (see Appendix 2), and service user concerns were routinely discussed as a standing agenda item.

3.3.2. Topic groups

The GDG divided its workload along clinically relevant lines to simplify the guideline development process, and GDG members formed smaller topic groups to undertake guideline work in that area of clinical practice. Topic Group 1 covered questions relating to case identification. Topic Group 2 covered psychological and psychosocial interventions, and Topic Group 3 covered pharmacological interventions. These groups were designed to efficiently manage the large volume of evidence appraisal prior to presenting it to the GDG as a whole. Each topic group was chaired by a GDG member with expert knowledge of the topic area (one of the healthcare professionals). Topic groups refined the review questions and the clinical definitions of treatment interventions, reviewed and prepared the evidence with the systematic reviewer before presenting it to the GDG as a whole, and helped the GDG to identify further expertise in the topic. Topic group leaders reported the status of the group's work as part of the standing agenda. They also introduced and led the GDG's discussion of the evidence review for that topic and assisted the GDG Chair in drafting the section of the guideline relevant to the work of each topic group.

3.3.3. Service users

Individuals with direct experience of services gave an integral service-user focus to the GDG and the guideline. The GDG included a service user and representatives of a national service user group. They contributed as full GDG members to writing the review questions, providing advice on outcomes most relevant to service users, helping to ensure that the evidence addressed their views and preferences, highlighting sensitive issues and terminology relevant to the guideline, and bringing service user research to the attention of the GDG. In drafting the guideline, they reviewed the chapter on experience of care and identified recommendations from the service user perspective.

3.3.4. Special advisors

Special advisors, who had specific expertise in one or more aspects of treatment and management relevant to the guideline, assisted the GDG, commenting on specific aspects of the developing guideline and making presentations to the GDG. Appendix 3 lists those who agreed to act as special advisors.

3.3.5. National and international experts

National and international experts in the area under review were identified through the literature search and through the experience of the GDG members. These experts were contacted to identify unpublished or soon-to-be published studies, to ensure that up-to-date evidence was included in the development of the guideline. They informed the GDG about completed trials at the pre-publication stage, systematic reviews in the process of being published, studies relating to the cost effectiveness of treatment and trial data if the GDG could be provided with full access to the complete trial report. Appendix 5 lists researchers who were contacted.

3.4. REVIEW PROTOCOLS

Review questions drafted during the scoping phase were discussed by the GDG at the first few meetings and amended as necessary. The review questions were used as the starting point for developing review protocols for each systematic review (described in more detail below). Where appropriate, the review questions were refined once the evidence had been searched and, where necessary, sub-questions were generated. The final list of review questions can be found in Appendix 8.

For questions about interventions, the PICO (Population, Intervention, Comparison and Outcome) framework was used to structure each question (see Table 2).

Table 2

Features of a well-formulated question on the effectiveness of an intervention – PICO.

Questions relating to diagnosis or case identification do not involve an intervention designed to treat a particular condition, and therefore the PICO framework was not used. Rather, the questions were designed to pick up key issues specifically relevant to clinical utility, for example their accuracy, reliability, safety and acceptability to the service user.

Where review questions about service user experience were specified in the scope, the SPICE (Setting, Perspective, Intervention, Comparison and Evaluation) format was used to structure the questions (Table 3).

Table 3

Features of a well-formulated question about the experience of care (qualitative evidence) – SPICE.

For each topic, addressed by one or more review questions, a review protocol was drafted by the technical team and finalised by the GDG. All protocols are included in Appendix 9.

To help facilitate the literature review, a note was made of the best study design type to answer each question. There are four main types of review question of relevance to NICE guidelines. These are listed in Table 4. For each type of question, the best primary study design varies, where ‘best’ is interpreted as ‘least likely to give misleading answers to the question’. For questions about the effectiveness of interventions, where randomised controlled trials (RCTs) were not available, the review of other types of evidence was pursued only if there was reason to believe that it would help the GDG to formulate a recommendation.

Table 4

Best study design to answer each type of question.

However, in all cases, a well-conducted systematic review (of the appropriate type of study) is likely to yield a better answer than a single study.

3.5. CLINICAL REVIEW METHODS

The aim of the clinical literature review was to systematically identify and synthesise relevant evidence from the literature in order to answer the specific review questions developed by the GDG. Thus, clinical practice recommendations are evidence-based, where possible, and, if evidence is not available, informal consensus methods are used to try and reach general agreement between GDG members (see Section 3.5.7) and the need for future research is specified.

3.5.1. The search process

Scoping searches

A broad preliminary search of the literature was undertaken in March 2013 to obtain an overview of the issues likely to be covered by the scope, and to help define key areas. Searches were restricted to clinical guidelines, Health Technology Assessment (HTA) reports, key systematic reviews and RCTs. A list of databases and websites searched can be found in Appendix 10.

Systematic literature searches

After the scope was finalised, a systematic search strategy was developed to locate as much relevant evidence as possible. The balance between sensitivity (the power to identify all studies on a particular topic) and specificity (the ability to exclude irrelevant studies from the results) was carefully considered, and a decision made to utilise a broad approach to searching to maximise retrieval of evidence to all parts of the guideline. Searches were restricted to certain study designs if specified in the review protocol, and conducted in the following databases:

Cochrane Database of Abstracts of Reviews of Effects (DARE)
Cochrane Database of Systematic Reviews (CDSR)
Cochrane Database of RCTs and other controlled trials (CENTRAL)
Embase (Excerpta Medica Database)
Health Management Information Consortium (HMIC)
HTA database
Medical Literature Analysis and Retrieval System Online (MEDLINE/MEDLINE In-Process)
Psychological Information Database (PsycINFO).

The search strategies were initially developed for MEDLINE before being translated for use in other databases/interfaces. Strategies were built up through a number of trial searches and discussions of the results of the searches with the review team and GDG to ensure that all possible relevant search terms were covered. In order to assure comprehensive coverage, search terms for antenatal and postnatal mental health were kept purposely broad to help counter dissimilarities in database indexing practices and thesaurus terms, and imprecise reporting of study populations by authors in the titles and abstracts of records. The search terms for each search are set out in full in Appendix 10.

Reference management

Citations from each search were downloaded into reference management software and duplicates removed. Records were then screened against the eligibility criteria of the reviews before being appraised for methodological quality (see below). The unfiltered search results were saved and retained for future potential reanalysis to help keep the process both replicable and transparent.

Search filters

To aid retrieval of relevant and sound studies, filters were used to limit a number of searches to systematic reviews, randomized controlled trials, qualitative studies, surveys and observational studies. The search filters for systematic reviews and randomized controlled trials are adaptations of filters designed by McMaster University, Ontario, Canada. The qualitative study, surveys and observational study filter were developed in-house. Each filter comprises index terms relating to the study type(s) and associated text words for the methodological description of the design(s).

Date and language restrictions

Systematic database searches were initially conducted in April 2013 up to the most recent searchable date. Search updates were generated on a 6-monthly basis, with the final re-runs carried out in April 2014 ahead of the guideline consultation. After this point, studies were only included if they were judged by the GDG to be exceptional (for example, if the evidence was likely to change a recommendation).

Although no language restrictions were applied at the searching stage, foreign language papers were not requested or reviewed, unless they were of particular importance to a review question.

Date restrictions were not applied, except for update searches which were limited to the date of the last search conducted for NICE Clinical guideline 45. In addition searches for qualitative studies and surveys were limited to the last 15 years as service user's experiences of care pre-2000 were considered to be less relevant to the current clinical context.

Other search methods

Other search methods involved: (a) scanning the reference lists of all eligible publications (systematic reviews, stakeholder evidence and included studies) for more published reports and citations of unpublished research; (b) checking the tables of contents of key journals for studies that might have been missed by the database and reference list searches; (c) contacting included study authors for unpublished or incomplete datasets (see Appendix 5). Searches conducted for existing NICE guidelines were updated where necessary. Other relevant guidelines were assessed for quality using the AGREE instrument (AGREE Collaboration, 2003). The evidence base underlying high-quality existing guidelines was utilised and updated as appropriate.

Full details of the search strategies and filters used for the systematic review of clinical evidence are provided in Appendix 10.

Study selection and assessment of methodological quality

All primary-level studies included after the first scan of citations were acquired in full and re-evaluated for eligibility at the time they were being entered into the study information database. More specific eligibility criteria were developed for each review question and are described in the relevant clinical evidence chapters. Eligible systematic reviews and primary-level studies were critically appraised for methodological quality (risk of bias) using a checklist (see The Guidelines Manual [NICE, 2012a] for templates). The eligibility of each study was confirmed by at least one member of the GDG.

Unpublished evidence

Stakeholders were approached for unpublished evidence (see Appendix 4). The GDG used a number of criteria when deciding whether or not to accept unpublished data. First, the evidence must have been accompanied by a trial report containing sufficient detail to properly assess risk of bias. Second, the evidence must have been submitted with the understanding that data from the study and a summary of the study's characteristics would be published in the full guideline. Therefore, in most circumstances the GDG did not accept evidence submitted ‘in confidence’. However, the GDG recognised that unpublished evidence submitted by investigators might later be retracted by those investigators if the inclusion of such data would jeopardise publication of their research. Any unpublished data used in the guideline will be specifically highlighted as such.

3.5.2. Data extraction

Quantitative analysis

Study characteristics, aspects of methodological quality, and outcome data were extracted from all eligible studies, using Review Manager 5.2 (The Cochrane Collaboration, 2012) and Excel-based forms (see Appendix 12 for study characteristics tables).

In most circumstances, for a given outcome (continuous and dichotomous), where more than 50% of the number randomised to any group were missing or incomplete, the study results were excluded from the analysis (except for the outcome ‘leaving the study early’, in which case, the denominator was the number randomised). Where there were limited data for a particular review, the 50% rule was not applied. In these circumstances the evidence was downgraded (see section 3.5.4).

Where possible, outcome data from an intention-to-treat analysis (ITT) (that is, a ‘once-randomised-always-analyse’ basis) were used. Where ITT had not been used or there were missing data, the effect size for dichotomous outcomes were recalculated using best-case and worse-case scenarios. Where conclusions varied between scenarios, the evidence was downgraded (see section 3.5.4).

Consultation with another reviewer or members of the GDG was used to overcome difficulties with coding. Data from studies included in existing systematic reviews were extracted independently by one reviewer and cross-checked with the existing dataset. Where possible, two independent reviewers extracted data from new studies. Where double data extraction was not possible, data extracted by one reviewer was checked by the second reviewer. Disagreements were resolved through discussion. Where consensus could not be reached, a third reviewer or GDG members resolved the disagreement. Masked assessment (that is, blind to the journal from which the article comes, the authors, the institution and the magnitude of the effect) was not used since it is unclear that doing so reduces bias (Jadad et al., 1996; Berlin, 2001).

Qualitative analysis

After transcripts/reviews or primary studies of service user experience were identified (see 3.5.1), each was read and re-read and sections of the text were collected under different headings using an Excel-based form. Initially the text from the transcripts/reviews was organised using a matrix of service user experience (see Table 5).

Table 5

Matrix of service user experience.

The matrix was formed by creating a table with the eight dimensions of patient-centred care developed by the Picker Institute Europe², down the vertical axis, and the key points on a pathway of care (as specified by the GDG) across the horizontal axis. With regard to terminology, the GDG preferred the term ‘person-centred’ rather than ‘patient-centred’, therefore the former is used in the matrix. The Picker Institute's dimensions of patient-centred care were chosen because they are well established, comprehensive, and based on research. In addition, a variation of these dimensions has been adopted by the US Institute of Medicine (Institute of Medicine, 2001).

Under the broad headings in the matrix, specific emergent themes were identified and coded by two researchers working independently. Overlapping themes and themes with the highest frequency count across all testimonies were extracted and regrouped using the matrix. The findings from this qualitative analysis can be found in Chapter 8.

3.5.3. Evidence synthesis

The method used to synthesize evidence depended on the review question and availability and type of evidence (see Appendix 12 for full details). Briefly, for questions about test accuracy, bivariate test accuracy meta-analysis was conducted where appropriate. For questions about the effectiveness of interventions or harms associated with interventions, standard meta-analysis or network meta-analysis was used where appropriate, otherwise narrative methods were used with clinical advice from the GDG. In the absence of high-quality research, an informal consensus process was used (see Section 3.5.7).

3.5.4. Grading the quality of evidence

For questions about the effectiveness of interventions, the GRADE approach³ was used to grade the quality of evidence for each outcome (Guyatt et al., 2011). For questions about the experience of care, test accuracy, and harms associated with interventions (where case-control and cohort study designs were used) methodology checklists were used to assess the risk of bias, and this information was taken into account when interpreting the evidence. The technical team produced GRADE evidence profiles (see below) using GRADEprofiler (GRADEpro) software (Version 3.6), following advice set out in the GRADE handbook (Schünemann et al., 2009). All staff doing GRADE ratings were trained, and calibration exercises were used to improve reliability (Mustafa et al., 2013).

Evidence profiles

A GRADE evidence profile was used to summarise both the quality of the evidence and the results of the evidence synthesis for each ‘critical’ and ‘important’ outcome (see Table 6 for an example of an evidence profile). The GRADE approach is based on a sequential assessment of the quality of evidence, followed by judgment about the balance between desirable and undesirable effects, and subsequent decision about the strength of a recommendation.

Table 6

Example of GRADE evidence profile.

Within the GRADE approach to grading the quality of evidence, the following is used as a starting point:

RCTs without important limitations provide high quality evidence
- observational studies without special strengths or important limitations provide low quality evidence.

For each outcome, quality may be reduced depending on five factors: limitations, inconsistency, indirectness, imprecision and publication bias. For the purposes of the guideline, each factor was evaluated using criteria provided in Table 7.

Table 7

Factors that decrease quality of evidence.

For observational studies without any reasons for down-grading, the quality may be up-graded if there is a large effect, all plausible confounding would reduce the demonstrated effect (or increase the effect if no effect was observed), or there is evidence of a dose-response gradient (details would be provided under the ‘other’ column).

Each evidence profile includes a summary of findings: number of participants included in each group, an estimate of the magnitude of the effect, and the overall quality of the evidence for each outcome. Under the GRADE approach, the overall quality for each outcome is categorised into one of four groups (high, moderate, low, very low).

3.5.5. Presenting evidence to the Guideline Development Group

Study characteristics tables and, where appropriate, forest plots generated with Review Manager Version 5.2 and GRADE summary of findings tables (see below) were presented to the GDG.

Where meta-analysis was not appropriate and/or possible, the reported results from each primary-level study were reported in the study characteristics table and presented to the GDG. The range of effect estimates were included in the GRADE profile, and where appropriate, described narratively.

Summary of findings tables

Summary of findings tables generated from GRADEpro were used to summarise the evidence for each outcome and the quality of that evidence (Table 8). The tables provide illustrative comparative risks, especially useful when the baseline risk varies for different groups within the population.

Table 8

Example of a GRADE summary of findings table.

3.5.6. Extrapolation

When answering review questions, if there is no direct evidence from a primary dataset,⁴ based on the initial search for evidence, it may be appropriate to extrapolate from another data set. In this situation, the following principles were used to determine when to extrapolate:

a primary dataset is absent, of low quality or is judged to be not relevant to the review question under consideration, and
a review question is deemed by the GDG to be important, such that in the absence of direct evidence, other data sources should be considered, and
non-primary data source(s) is in the view of the GDG available, which may inform the review question.

When the decision to extrapolate was made, the following principles were used to inform the choice of the non-primary dataset:

the populations (usually in relation to the specified diagnosis or problem which characterises the population) under consideration share some common characteristic but differ in other ways, such as age, gender or in the nature of the disorder (for example, a common behavioural problem; acute versus chronic presentations of the same disorder) , and
the interventions under consideration in the view of the GDG have one or more of the following characteristics:
-
share a common mode of action (for example, the pharmacodynamics of drug; a common psychological model of change - operant conditioning)
-
be feasible to deliver in both populations (for example, in terms of the required skills or the demands of the health care system)
-
share common side effects/harms in both populations, and
the context or comparator involved in the evaluation of the different datasets shares some common elements which support extrapolation, and
the outcomes involved in the evaluation of the different datasets shares some common elements which support extrapolation (for example, improved mood or a reduction in challenging behaviour).

When the choice of the non-primary dataset was made, the following principles were used to guide the application of extrapolation:

the GDG should first consider the need for extrapolation through a review of the relevant primary dataset and be guided in these decisions by the principles for the use of extrapolation
in all areas of extrapolation datasets should be assessed against the principles for determining the choice of datasets. In general the criteria in the four principles set out above for determining the choice should be met
- in deciding on the use of extrapolation, the GDG will have to determine if the extrapolation can be held to be reasonable, including ensuring that:
  -
  the reasoning behind the decision can be justified by the clinical need for a recommendation to be made
  -
  the absence of other more direct evidence, and by the relevance of the potential dataset to the review question can be established
  -
  the reasoning and the method adopted is clearly set out in the relevant section of the guideline.

3.5.7. Method used to answer a review question in the absence of appropriately designed, high-quality research

In the absence of appropriately designed, high-quality research (including indirect evidence where it would be appropriate to use extrapolation), an informal consensus process was adopted.

The process involved a member of the GDG or review team drafting a statement about what is known about the issue based on expert opinion from existing narrative reviews. The statement was circulated to the GDG and used as the basis of a group discussion.

3.5.8. Key principles for recommendations

In reviewing the evidence for mental health problems in pregnancy and/or the postnatal period the GDG were guided by the principle that much of the assessment and treatment of mental health problems in pregnancy and the postnatal period is not different from that at other times of a woman's life, and so should be guided by relevant NICE guidelines for the specific mental health problem. However, new recommendations were developed where there was new evidence specifically for this guideline:

for an intervention that was specific to pregnancy or the postnatal period;
that an existing recommendation needed to be clarified or modified as a result of concerns about the health of the fetus or infant;
that changes are necessary to the context in which interventions are delivered;
- that specific variations are necessitated by changes in a woman's mental or physical health linked to pregnancy and the postnatal period.

3.6. HEALTH ECONOMICS METHODS

The aim of the health economics was to contribute to the guideline's development by providing evidence on the cost effectiveness of interventions for women who have, or are at risk of, mental health problems during pregnancy and the postnatal period covered in the guideline. This was achieved by:

systematic literature review of existing economic evidence
decision-analytic economic modelling.

Systematic reviews of economic literature were conducted in all areas covered in the guideline. Economic modelling was undertaken in areas with likely major resource implications, where the current extent of uncertainty over cost effectiveness was significant and economic analysis was expected to reduce this uncertainty, in accordance with The Guidelines Manual (NICE, 2012a). Prioritisation of areas for economic modelling was a joint decision between the Health Economist and the GDG. The rationale for prioritising review questions for economic modelling was set out in an economic plan agreed between NICE, the GDG, the Health Economist and the other members of the technical team. The following economic questions were selected as key issues that were addressed by economic modelling:

cost effectiveness of formal case identification tools for depression in the postnatal period
cost effectiveness of psychological and psychosocial interventions for the treatment of women with sub-threshold/mild to moderate depression in the postnatal period.

In addition, literature on the health-related quality of life (HRQoL) of women with mental health problems in pregnancy and postnatal period was systematically searched to identify studies reporting appropriate utility values that could be utilised in a cost-utility analysis.

The rest of this section describes the methods adopted in the systematic literature review of economic studies. Methods employed in economic modelling are described in the relevant economic sections of the evidence chapters.