By Dr Ben Buchanan, Clinical Psychologist & Co-founder, NovoPsych
Bipolar disorder is among the most frequently misdiagnosed conditions in mental health practice. Approximately one in five adults presenting to mental health services with depressive symptoms in fact have bipolar or a bipolar-related disorder (Mak, 2009; Olfson et al., 2005). Despite this prevalence, the condition routinely escapes timely identification: a landmark meta-analysis reported an average gap of approximately six years between bipolar symptom onset and correct diagnosis and management (Dagani et al., 2017). This diagnostic delay carries substantial human cost — prolonged exposure to ineffective treatments, elevated relapse risk, occupational and relational disruption, and increased suicidality (Scott & Leboyer, 2011).
A core driver of this delay is clinical presentation. Because depressive episodes predominate in bipolar disorder — and because clients rarely spontaneously report prior hypomanic or manic episodes — practitioners working primarily in depressive presentations often lack the structured tools needed to systematically probe the broader mood history. In primary care and general mental health settings in particular, where assessment time is constrained, a positive screen for depression can inadvertently close off further inquiry into mania or hypomania (Hirschfeld et al., 2003).
This article introduces a comprehensive, psychometrically grounded framework for bipolar disorder assessment and monitoring, centred on three validated instruments available through NovoPsych:
Bipolar disorder affects approximately 2–4% of the general population when bipolar spectrum conditions are included (Merikangas et al., 2011). Its heterogeneous presentation — spanning Bipolar I, Bipolar II, cyclothymia, and subsyndromal mood instability — means that it frequently masquerades as unipolar depression, borderline personality disorder, ADHD, or anxiety disorders (Hirschfeld et al., 2003). The consequences of misclassification are clinically significant: antidepressant monotherapy administered in the context of undetected bipolarity can precipitate manic switch, increase mood cycling, and worsen long-term prognosis (Pacchiarotti et al., 2013).
Residual symptoms between full mood episodes are the rule rather than the exception. Even during apparent euthymia, sub-threshold depressive and hypomanic symptoms predict shortened time to episode recurrence, impaired psychosocial functioning, and reduced quality of life (Judd et al., 2002). This means that accurate assessment must extend beyond episode identification — it must track symptom trajectories continuously over time.
Measurement-based care (MBC), also termed routine outcome monitoring (ROM), is defined as the systematic collection of patient-reported symptom data throughout treatment, with results integrated into clinical decision-making (Scott et al., 2012; Kilbourne et al., 2018). ROM is now recognised as a core competency in evidence-based psychological practice, and its application to bipolar disorder is particularly compelling.
A growing body of evidence demonstrates that enhanced treatment programs incorporating symptom measurement for bipolar disorder yield better outcomes compared to treatment as usual (Kilbourne et al., 2018). Critically, in bipolar disorder, detecting early shifts in clinical status — including subsyndromal manic or depressive symptoms — is associated with faster clinical intervention and reduced risk of full episode relapse (Bauer et al., 2006). Patient preference data are also instructive: in a mixed-methods study, the paired PMQ-9 and PHQ-9 combination emerged as the most preferred monitoring format, with participants citing format clarity, ease of completion, and clinical relevance as key strengths (Scott et al., 2023).
NovoPsych’s remote monitoring functionality and outcome monitoring assessment library directly support this approach, enabling practitioners to schedule repeated administrations, automatically score results, and visualise longitudinal trajectories — all from within a single platform.
Effective bipolar disorder assessment proceeds through three conceptually distinct stages:
The Patient Mania Questionnaire-9 (PMQ-9) is a nine-item self-report measure assessing the frequency of manic symptoms over the past week in adults aged 18+ (Guo et al., 2021). It was intentionally designed to mirror the structure, format, and 4-point Likert response scale of the Patient Health Questionnaire-9 (PHQ-9), creating a structurally parallel pair that can be administered together as a unified bipolar symptom monitoring battery.
Both measures yield total scores on a 0–27 scale, share a clinical cut-off of 10, and can be displayed side-by-side. This allows clinicians to compare manic and depressive symptom burden directly — critical given the polarity-shifting presentations common in bipolar disorder. A distinctive feature of the PMQ-9 is its sensitivity to subthreshold manic symptoms that fall below the threshold for a full manic or hypomanic episode — making it particularly suited to ongoing monitoring where early detection of emerging mood elevation is clinically valuable.
When administered through NovoPsych, the paired assessment produces an automatically scored and visualised report. On first administration, a paired bar chart displays both total scores side-by-side with a dashed reference line at the cut-off of 10, colour-coded severity bands, and validated PHQ-9 severity descriptors.
Client scores are contextualised against normative distributions — a clinical bipolar disorder reference sample for the PMQ-9, and both community and clinical (major depressive disorder) samples for the PHQ-9.
When administred two or more times, NovoPsych will graph symptoms over time, sometimes showing a clear depressive and manic cycle.
→ Learn more about the PMQ-9 and PHQ-9 on NovoPsych
The Mood Disorder Questionnaire (MDQ) is a brief, widely used 15-item self-report screening instrument for bipolar disorder in adults (Hirschfeld et al., 2000). It assesses the lifetime history of manic and hypomanic symptoms based on DSM criteria, along with symptom clustering and functional impairment. This three-criterion positive screen design substantially improves specificity over symptom count alone.
The MDQ is particularly well-suited to general mental health and primary care settings where bipolar disorder is most likely to go undetected — especially among clients presenting primarily with depressive symptoms.
The NovoPsych MDQ report presents total score against the screening threshold with automatic flagging of whether the three-criterion positive screen is met.
→ Learn more about the MDQ on NovoPsych
The General Behaviour Inventory-Revised (GBI-R) is a 73-item self-report measure assessing the frequency and pattern of mood symptoms across depressive and hypomanic/biphasic episodes across the lifespan, suitable for adolescents (13+) and adults (Klein et al., 1986; Pendergast et al., 2014). It measures the frequency, intensity, and duration of experiences associated with bipolar disorder Type I and II, as well as cyclothymia.
Unlike the MDQ, the GBI-R provides a comprehensive longitudinal mood history — particularly valuable at the formulation stage for understanding a client’s broader mood history, identifying cyclothymic or biphasic patterns, and distinguishing bipolar spectrum conditions from unipolar depression.
The NovoPsych GBI-R report presents total and subscale scores with colour-coded risk descriptors and percentile information benchmarked against normative community and clinical bipolar disorder distributions (Bullock et al., 2011; Pendergast et al., 2014).
The report also includes horizontal comparison charts positioning each score against the IQR distributions of both a community normative sample and a diagnosed bipolar disorder clinical sample.
→ Learn more about the GBI-R on NovoPsych
These three instruments are designed to be used in sequence, each contributing distinct information at different clinical stages:
This sequential pathway operationalises the evidence-based recommendation that measurement-based care should be embedded systematically across the treatment lifecycle. The NovoPsych Assessment Co-Pilot can assist in identifying the most appropriate tools from the platform’s library of over 190 validated measures.
The six-year diagnostic gap in bipolar disorder represents a preventable clinical failure — one that systematic screening and routine outcome monitoring can meaningfully address. By integrating the MDQ for initial screening, the GBI-R for comprehensive formulation, and the paired PMQ-9 and PHQ-9 for ongoing measurement-based monitoring, clinicians are equipped to detect bipolar disorder earlier, characterise its presentation more accurately, and track symptom trajectories with the precision required for safe, responsive management.
All three instruments are available through NovoPsych, which automatically scores, visualizes, and stores results. As with all psychometric instruments, self-report measures should be interpreted as one component of a comprehensive clinical assessment and not as a substitute for diagnostic interview or clinical judgement.
Bauer, M., Grof, P., Müller-Oerlinghausen, B., Rasgon, N., Glenn, T., & Whybrow, P. C. (2006). Temporal relation between sleep and mood in patients with bipolar disorder. Bipolar Disorders, 8(2), 160–167.
Bullock, B., Murray, G., & Judd, F. (2011). Validation of the short General Behaviour Inventory. Bipolar Disorders, 13(4), 410–421.
Carpenter, J. K., Price, E. C., Tully, E. C., & Nolan-Hoeksema, S. (2020). Factor structure of the Mood Disorder Questionnaire. Psychological Assessment, 32(5), 478–488.
Dagani, J., Signorini, G., Nielssen, O., Bani, M., Pastore, A., de Girolamo, G., & Large, M. (2017). Meta-analysis of the interval between the onset and management of bipolar disorder. The Canadian Journal of Psychiatry, 62(4), 247–258.
Guo, N., Robison, J., Bhatt, M., Blunt, H., Burdick, K. E., Chen, P., Delbello, M., Frye, M., Schneck, C., & Sylvia, L. G. (2021). Patient-Reported Outcome Measures of Manic Symptoms for Bipolar Disorder in Clinical Care. Psychiatric Services, 72(10), 1200–1204.
Hirschfeld, R. M. A., Williams, J. B. W., Spitzer, R. L., Calabrese, J. R., Flynn, L., Keck, P. E., Lewis, L., McElroy, S. L., Post, R. M., Rapport, D. J., Russell, J. M., Sachs, G. S., & Zajecka, J. (2000). Development and validation of a screening instrument for bipolar spectrum disorder: The Mood Disorder Questionnaire. American Journal of Psychiatry, 157(11), 1873–1875.
Hirschfeld, R. M. A., Calabrese, J. R., Weissman, M. M., Reed, M., Davies, M. A., Frye, M. A., Keck, P. E., Lewis, L., McElroy, S. L., McNulty, J. P., & Wagner, K. D. (2003). Screening for bipolar disorder in the community. Journal of Clinical Psychiatry, 64(1), 53–59.
Judd, L. L., Akiskal, H. S., Schettler, P. J., Endicott, J., Maser, J., Solomon, D. A., Leon, A. C., Rice, J. A., & Keller, M. B. (2002). The long-term natural history of the weekly symptomatic status of bipolar I disorder. Archives of General Psychiatry, 59(6), 530–537.
Kilbourne, A. M., Beck, K., Spaeth-Rublee, B., Ramanuj, P., O’Brien, R. W., Tomoyasu, N., & Pincus, H. A. (2018). Measuring and improving the quality of mental health care: a global perspective. World Psychiatry, 17(1), 30–38.
Klein, D. N., Depue, R. A., & Slater, J. F. (1986). Inventory identification of cyclothymia IX. Archives of General Psychiatry, 43(5), 441–445.
Kroenke, K., Spitzer, R. L., & Williams, J. B. W. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613.
Mak, A. D. P. (2009). Prevalence and correlates of bipolar II disorder in major depressive patients. Journal of Affective Disorders, 112(1–3), 201–205.
Merikangas, K. R., Jin, R., He, J. P., Kessler, R. C., Lee, S., & Sampson, N. A. (2011). Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Archives of General Psychiatry, 68(3), 241–251.
Olfson, M., Das, A. K., Gameroff, M. J., Pilowsky, D., Feder, A., Gross, R., Lantigua, R., Shea, S., & Weissman, M. M. (2005). Bipolar depression in a low-income primary care clinic. American Journal of Psychiatry, 162(11), 2146–2151.
Pacchiarotti, I., Bond, D. J., Baldessarini, R. J., Nolen, W. A., Grunze, H., & Vieta, E. (2013). The ISBD task force report on antidepressant use in bipolar disorders. American Journal of Psychiatry, 170(11), 1249–1262.
Pendergast, L. L., Youngstrom, E. A., Brown, C., Jensen, D., Abramson, L. Y., & Alloy, L. B. (2014). Predicting diagnostic status from General Behavior Inventory scores. Psychological Assessment, 26(4), 1130–1140.
Schiweck, C., Arteaga-Henriquez, G., Aichholzer, M., & Bitter, I. (2021). Comorbidity of ADHD and adult bipolar disorder: A systematic review and meta-analysis. Neuroscience & Biobehavioral Reviews, 124, 100–123.
Scott, J., & Leboyer, M. (2011). Consequences of delayed diagnosis of bipolar disorders. L’Encéphale, 37(Suppl. 3), S173–S175.
Scott, K., Lewis, C. C., & Boyd, A. (2023). Bipolar disorder symptom monitoring measures: A mixed-methods study of patient preferences. Bipolar Disorders. https://doi.org/10.1016/j.bion.2023.100063