Bipolar Assessment and Questionnaires. Screening, Diagnosis and Outcome Monitoring. 

By Dr Ben Buchanan, Clinical Psychologist & Co-founder, NovoPsych

Bipolar disorder is among the most frequently misdiagnosed conditions in mental health practice. Approximately one in five adults presenting to mental health services with depressive symptoms in fact have bipolar or a bipolar-related disorder (Mak, 2009; Olfson et al., 2005). Despite this prevalence, the condition routinely escapes timely identification: a landmark meta-analysis reported an average gap of approximately six years between bipolar symptom onset and correct diagnosis and management (Dagani et al., 2017). This diagnostic delay carries substantial human cost — prolonged exposure to ineffective treatments, elevated relapse risk, occupational and relational disruption, and increased suicidality (Scott & Leboyer, 2011).

A core driver of this delay is clinical presentation. Because depressive episodes predominate in bipolar disorder — and because clients rarely spontaneously report prior hypomanic or manic episodes — practitioners working primarily in depressive presentations often lack the structured tools needed to systematically probe the broader mood history. In primary care and general mental health settings in particular, where assessment time is constrained, a positive screen for depression can inadvertently close off further inquiry into mania or hypomania (Hirschfeld et al., 2003).

This article introduces a comprehensive, psychometrically grounded framework for bipolar disorder assessment and monitoring, centred on three validated instruments available through NovoPsych:


Why Systematic Screening and Routine Outcome Monitoring Matter in Bipolar Disorder

The Epidemiological and Clinical Imperative

Bipolar disorder affects approximately 2–4% of the general population when bipolar spectrum conditions are included (Merikangas et al., 2011). Its heterogeneous presentation — spanning Bipolar I, Bipolar II, cyclothymia, and subsyndromal mood instability — means that it frequently masquerades as unipolar depression, borderline personality disorder, ADHD, or anxiety disorders (Hirschfeld et al., 2003). The consequences of misclassification are clinically significant: antidepressant monotherapy administered in the context of undetected bipolarity can precipitate manic switch, increase mood cycling, and worsen long-term prognosis (Pacchiarotti et al., 2013).

Residual symptoms between full mood episodes are the rule rather than the exception. Even during apparent euthymia, sub-threshold depressive and hypomanic symptoms predict shortened time to episode recurrence, impaired psychosocial functioning, and reduced quality of life (Judd et al., 2002). This means that accurate assessment must extend beyond episode identification — it must track symptom trajectories continuously over time.

The Evidence Base for Routine Outcome Monitoring

Measurement-based care (MBC), also termed routine outcome monitoring (ROM), is defined as the systematic collection of patient-reported symptom data throughout treatment, with results integrated into clinical decision-making (Scott et al., 2012; Kilbourne et al., 2018). ROM is now recognised as a core competency in evidence-based psychological practice, and its application to bipolar disorder is particularly compelling.

A growing body of evidence demonstrates that enhanced treatment programs incorporating symptom measurement for bipolar disorder yield better outcomes compared to treatment as usual (Kilbourne et al., 2018). Critically, in bipolar disorder, detecting early shifts in clinical status — including subsyndromal manic or depressive symptoms — is associated with faster clinical intervention and reduced risk of full episode relapse (Bauer et al., 2006). Patient preference data are also instructive: in a mixed-methods study, the paired PMQ-9 and PHQ-9 combination emerged as the most preferred monitoring format, with participants citing format clarity, ease of completion, and clinical relevance as key strengths (Scott et al., 2023).

NovoPsych’s remote monitoring functionality and outcome monitoring assessment library directly support this approach, enabling practitioners to schedule repeated administrations, automatically score results, and visualise longitudinal trajectories — all from within a single platform.

A Staged Assessment Model

Effective bipolar disorder assessment proceeds through three conceptually distinct stages:

  1. Screening — identifying individuals at elevated risk who warrant further assessment, particularly in primary care and general mental health populations where bipolar disorder is underdetected
  2. Diagnostic formulation — characterising the longitudinal pattern, severity, and subtype of mood symptomatology to support differential diagnosis and case conceptualisation
  3. Routine outcome monitoring — tracking manic and depressive symptom trajectories across the treatment journey to guide clinical decision-making and detect early signs of relapse

Routine Outcome Monitoring: The Paired PMQ-9 & PHQ-9

Instrument Overview

The Patient Mania Questionnaire-9 (PMQ-9) is a nine-item self-report measure assessing the frequency of manic symptoms over the past week in adults aged 18+ (Guo et al., 2021). It was intentionally designed to mirror the structure, format, and 4-point Likert response scale of the Patient Health Questionnaire-9 (PHQ-9), creating a structurally parallel pair that can be administered together as a unified bipolar symptom monitoring battery.

Both measures yield total scores on a 0–27 scale, share a clinical cut-off of 10, and can be displayed side-by-side. This allows clinicians to compare manic and depressive symptom burden directly — critical given the polarity-shifting presentations common in bipolar disorder. A distinctive feature of the PMQ-9 is its sensitivity to subthreshold manic symptoms that fall below the threshold for a full manic or hypomanic episode — making it particularly suited to ongoing monitoring where early detection of emerging mood elevation is clinically valuable.

NovoPsych Report: PMQ-9 and PHQ-9

When administered through NovoPsych, the paired assessment produces an automatically scored and visualised report. On first administration, a paired bar chart displays both total scores side-by-side with a dashed reference line at the cut-off of 10, colour-coded severity bands, and validated PHQ-9 severity descriptors.

Client scores are contextualised against normative distributions — a clinical bipolar disorder reference sample for the PMQ-9, and both community and clinical (major depressive disorder) samples for the PHQ-9.

When administred two or more times, NovoPsych will graph symptoms over time, sometimes showing a clear depressive and manic cycle. 

→ Learn more about the PMQ-9 and PHQ-9 on NovoPsych


Initial Screening: The Mood Disorder Questionnaire (MDQ)

The Mood Disorder Questionnaire (MDQ) is a brief, widely used 15-item self-report screening instrument for bipolar disorder in adults (Hirschfeld et al., 2000). It assesses the lifetime history of manic and hypomanic symptoms based on DSM criteria, along with symptom clustering and functional impairment. This three-criterion positive screen design substantially improves specificity over symptom count alone.

The MDQ is particularly well-suited to general mental health and primary care settings where bipolar disorder is most likely to go undetected — especially among clients presenting primarily with depressive symptoms.

NovoPsych Report: MDQ

The NovoPsych MDQ report presents total score against the screening threshold with automatic flagging of whether the three-criterion positive screen is met.

→ Learn more about the MDQ on NovoPsych


Comprehensive Formulation: The General Behaviour Inventory-Revised (GBI-R)

Instrument Overview

The General Behaviour Inventory-Revised (GBI-R) is a 73-item self-report measure assessing the frequency and pattern of mood symptoms across depressive and hypomanic/biphasic episodes across the lifespan, suitable for adolescents (13+) and adults (Klein et al., 1986; Pendergast et al., 2014). It measures the frequency, intensity, and duration of experiences associated with bipolar disorder Type I and II, as well as cyclothymia.

Unlike the MDQ, the GBI-R provides a comprehensive longitudinal mood history — particularly valuable at the formulation stage for understanding a client’s broader mood history, identifying cyclothymic or biphasic patterns, and distinguishing bipolar spectrum conditions from unipolar depression.

NovoPsych Report: GBI-R

The NovoPsych GBI-R report presents total and subscale scores with colour-coded risk descriptors and percentile information benchmarked against normative community and clinical bipolar disorder distributions (Bullock et al., 2011; Pendergast et al., 2014).

The report also includes horizontal comparison charts positioning each score against the IQR distributions of both a community normative sample and a diagnosed bipolar disorder clinical sample.

→ Learn more about the GBI-R on NovoPsych


An Integrated Assessment Pathway for Bipolar Disorder

These three instruments are designed to be used in sequence, each contributing distinct information at different clinical stages:

  1. Initial Screening (MDQ) — Administer as a brief first-pass screen for clients presenting with depressive symptoms or complex emotional presentations. A positive screen indicates the need for more thorough bipolar assessment and should prompt consideration of mood stabiliser consultation.
  2. Formulation and Differential Diagnosis (GBI-R) — Following a positive screen, administer the GBI-R to obtain a comprehensive longitudinal mood history. Its subscale structure, normative comparisons, and high specificity support differential diagnosis — particularly in disentangling bipolar disorder from unipolar depression, BPD, or ADHD.
  3. Routine Outcome Monitoring (PMQ-9 & PHQ-9) — Once a bipolar diagnosis or working formulation is established, transition to regular paired PMQ-9 and PHQ-9 administrations via NovoPsych’s remote monitoring functionality for real-time tracking of both mood poles across the care journey.

This sequential pathway operationalises the evidence-based recommendation that measurement-based care should be embedded systematically across the treatment lifecycle. The NovoPsych Assessment Co-Pilot can assist in identifying the most appropriate tools from the platform’s library of over 190 validated measures.

The six-year diagnostic gap in bipolar disorder represents a preventable clinical failure — one that systematic screening and routine outcome monitoring can meaningfully address. By integrating the MDQ for initial screening, the GBI-R for comprehensive formulation, and the paired PMQ-9 and PHQ-9 for ongoing measurement-based monitoring, clinicians are equipped to detect bipolar disorder earlier, characterise its presentation more accurately, and track symptom trajectories with the precision required for safe, responsive management.

All three instruments are available through NovoPsych, which automatically scores, visualizes, and stores results. As with all psychometric instruments, self-report measures should be interpreted as one component of a comprehensive clinical assessment and not as a substitute for diagnostic interview or clinical judgement.


References

Bauer, M., Grof, P., Müller-Oerlinghausen, B., Rasgon, N., Glenn, T., & Whybrow, P. C. (2006). Temporal relation between sleep and mood in patients with bipolar disorder. Bipolar Disorders, 8(2), 160–167.

Bullock, B., Murray, G., & Judd, F. (2011). Validation of the short General Behaviour Inventory. Bipolar Disorders, 13(4), 410–421.

Carpenter, J. K., Price, E. C., Tully, E. C., & Nolan-Hoeksema, S. (2020). Factor structure of the Mood Disorder Questionnaire. Psychological Assessment, 32(5), 478–488.

Dagani, J., Signorini, G., Nielssen, O., Bani, M., Pastore, A., de Girolamo, G., & Large, M. (2017). Meta-analysis of the interval between the onset and management of bipolar disorder. The Canadian Journal of Psychiatry, 62(4), 247–258.

Guo, N., Robison, J., Bhatt, M., Blunt, H., Burdick, K. E., Chen, P., Delbello, M., Frye, M., Schneck, C., & Sylvia, L. G. (2021). Patient-Reported Outcome Measures of Manic Symptoms for Bipolar Disorder in Clinical Care. Psychiatric Services, 72(10), 1200–1204.

Hirschfeld, R. M. A., Williams, J. B. W., Spitzer, R. L., Calabrese, J. R., Flynn, L., Keck, P. E., Lewis, L., McElroy, S. L., Post, R. M., Rapport, D. J., Russell, J. M., Sachs, G. S., & Zajecka, J. (2000). Development and validation of a screening instrument for bipolar spectrum disorder: The Mood Disorder Questionnaire. American Journal of Psychiatry, 157(11), 1873–1875.

Hirschfeld, R. M. A., Calabrese, J. R., Weissman, M. M., Reed, M., Davies, M. A., Frye, M. A., Keck, P. E., Lewis, L., McElroy, S. L., McNulty, J. P., & Wagner, K. D. (2003). Screening for bipolar disorder in the community. Journal of Clinical Psychiatry, 64(1), 53–59.

Judd, L. L., Akiskal, H. S., Schettler, P. J., Endicott, J., Maser, J., Solomon, D. A., Leon, A. C., Rice, J. A., & Keller, M. B. (2002). The long-term natural history of the weekly symptomatic status of bipolar I disorder. Archives of General Psychiatry, 59(6), 530–537.

Kilbourne, A. M., Beck, K., Spaeth-Rublee, B., Ramanuj, P., O’Brien, R. W., Tomoyasu, N., & Pincus, H. A. (2018). Measuring and improving the quality of mental health care: a global perspective. World Psychiatry, 17(1), 30–38.

Klein, D. N., Depue, R. A., & Slater, J. F. (1986). Inventory identification of cyclothymia IX. Archives of General Psychiatry, 43(5), 441–445.

Kroenke, K., Spitzer, R. L., & Williams, J. B. W. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613.

Mak, A. D. P. (2009). Prevalence and correlates of bipolar II disorder in major depressive patients. Journal of Affective Disorders, 112(1–3), 201–205.

Merikangas, K. R., Jin, R., He, J. P., Kessler, R. C., Lee, S., & Sampson, N. A. (2011). Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Archives of General Psychiatry, 68(3), 241–251.

Olfson, M., Das, A. K., Gameroff, M. J., Pilowsky, D., Feder, A., Gross, R., Lantigua, R., Shea, S., & Weissman, M. M. (2005). Bipolar depression in a low-income primary care clinic. American Journal of Psychiatry, 162(11), 2146–2151.

Pacchiarotti, I., Bond, D. J., Baldessarini, R. J., Nolen, W. A., Grunze, H., & Vieta, E. (2013). The ISBD task force report on antidepressant use in bipolar disorders. American Journal of Psychiatry, 170(11), 1249–1262.

Pendergast, L. L., Youngstrom, E. A., Brown, C., Jensen, D., Abramson, L. Y., & Alloy, L. B. (2014). Predicting diagnostic status from General Behavior Inventory scores. Psychological Assessment, 26(4), 1130–1140.

Schiweck, C., Arteaga-Henriquez, G., Aichholzer, M., & Bitter, I. (2021). Comorbidity of ADHD and adult bipolar disorder: A systematic review and meta-analysis. Neuroscience & Biobehavioral Reviews, 124, 100–123.

Scott, J., & Leboyer, M. (2011). Consequences of delayed diagnosis of bipolar disorders. L’Encéphale, 37(Suppl. 3), S173–S175.

Scott, K., Lewis, C. C., & Boyd, A. (2023). Bipolar disorder symptom monitoring measures: A mixed-methods study of patient preferences. Bipolar Disorders. https://doi.org/10.1016/j.bion.2023.100063