PHQ-9 Depression Screening: How Doctors Actually Use the Score

Reviewed by Daniel Montville, MD, Psychiatrist

SiggyMD Clinical Team · Last updated June 17, 2026

Clinical & Research PHQ-9 Depression Screening Measurement-Based Care Depression Monitoring Clinical Assessment

Key Takeaways

The PHQ-9 is a validated nine-item self-report scale with 88% sensitivity and 88% specificity for major depression at a cutoff score of 10. It can screen for depression and grade symptom severity using the same nine items.
Scores of 5, 10, 15, and 20 represent the established thresholds for mild, moderate, moderately severe, and severe depression. A score below 5 is the clinical definition of remission for patients in treatment.
A five-point change between PHQ-9 administrations is the validated minimum for a clinically meaningful shift in symptom severity. A 50% reduction from baseline indicates treatment response.
Question 9, which asks about suicidal ideation, always requires direct clinical follow-up regardless of the total score.
Most patients with moderate-to-severe depression do not receive follow-up PHQ-9 assessments after starting treatment, representing a significant gap in measurement-based care in practice.

Most people who fill out the PHQ-9 think they are checking a box. Clinicians who use it well think of it as a longitudinal data point.

The nine questions take less than three minutes to answer. They can screen for depression, establish baseline severity, guide dosing decisions, and detect whether a treatment is working, all from the same tool. Whether that data gets used depends entirely on how the prescriber treats it. A PHQ-9 score filed in a chart once a year is a different clinical instrument than one tracked every two weeks across the course of treatment.

What This Page Covers

What the PHQ-9 measures and why it was designed this way
The scoring bands and what each one means clinically
How doctors use the PHQ-9 differently from screening to treatment monitoring
Why question 9 always requires separate clinical attention
What measurement-based care actually looks like in practice

Why the PHQ-9 Exists

Depression is common and measurable, but it is not always obvious in a clinic visit. A 2001 validation study published in the Journal of General Internal Medicine found that the PHQ-9 demonstrated 88% sensitivity and 88% specificity for major depression at a cutoff score of 10, making it one of the most precisely calibrated brief screening tools in mental health practice.

The PHQ-9 was developed as a dual-purpose instrument: it can both identify likely depressive diagnoses and grade symptom severity using the same nine items. Before it existed, standard depression assessment tools required a two-step process: a screener first, then a clinician-administered interview to establish severity. Converting that process into a single self-report scale dramatically increased adoption in primary care settings where time is the binding constraint.

Each of the nine questions maps directly to a diagnostic criterion for major depressive disorder, asking about frequency over the past two weeks across domains including depressed mood, anhedonia, sleep disturbance, fatigue, appetite changes, concentration, psychomotor changes, feelings of worthlessness, and suicidal ideation. The scale has been translated into more than 80 languages and validated across primary care, obstetrics, oncology, cardiology, and specialty mental health settings.

Understanding Your Score

Scores of 5, 10, 15, and 20 represent the clinically established thresholds for mild, moderate, moderately severe, and severe depression, respectively. The full scoring ranges are:

0 to 4: Minimal or no depression
5 to 9: Mild depression
10 to 14: Moderate depression
15 to 19: Moderately severe depression
20 to 27: Severe depression

A score of 10 or above is the threshold at which active treatment is typically indicated for most patients. A score below 5 is the clinical definition of remission when a patient is in active treatment.

These cutoffs are not rigid rules. Clinical judgment, the patient’s degree of functional impairment, and the trend over time all inform how a specific score gets interpreted and acted on.

From Score to Diagnosis: Why the PHQ-9 Is Not the Final Word

The PHQ-9 identifies patients who are likely to have major depression. It does not diagnose it.

A final diagnosis requires a comprehensive clinical evaluation that assesses whether the patient’s symptoms meet duration and functional impairment criteria, excludes other conditions that can cause depressive presentations (such as thyroid disorders, anemia, or bipolar disorder), and accounts for context that a self-report questionnaire cannot capture.

Research demonstrates that how the PHQ-9 is administered can influence scoring, and that the tool’s role in diagnosis is to provide an evidentiary basis for a clinical conversation, not to replace the clinical interview. A prescriber who sees a PHQ-9 score of 12 and engages the patient in a structured clinical interview is using the tool correctly. A prescriber who sees a score of 12 and writes a prescription without further assessment is not.

Question 9: The Item That Stands Apart

The ninth question asks how often the patient has been bothered by thoughts that they would be better off dead, or thoughts of hurting themselves. Even a response of “several days” triggers required clinical follow-up.

Any positive answer to question 9 must be followed with a direct clinical interview to assess suicide risk, intent, plan, and access to means. The PHQ-9 total score does not drive that decision. The content of the answer to question 9 does.

This item cannot be processed algorithmically. It requires a human prescriber who can hear what the patient is actually saying, assess the clinical context, and respond with appropriate safety planning or escalation. Every clinical platform that incorporates the PHQ-9 is responsible for having that human layer in place.

If you are experiencing thoughts of suicide or self-harm, contact your care team immediately or call 988. If you are in immediate danger, call 911.

How Doctors Use the PHQ-9 During Treatment

Screening is the beginning, not the end, of the PHQ-9’s clinical usefulness.

When a patient starts treatment, the initial PHQ-9 score establishes the baseline. Every subsequent score tells the prescriber whether the treatment plan is working. A five-point change between PHQ-9 administrations is the validated minimum for a clinically meaningful shift in symptom severity. A patient who starts at 18 and scores 13 four weeks later has shown meaningful improvement. A patient who starts at 18 and scores 17 has not responded to treatment.

This is where measurement-based care changes clinical outcomes. When PHQ-9 scores are tracked consistently and used to drive treatment decisions rather than to satisfy documentation requirements, prescribers can catch non-response early. An adequate antidepressant trial runs four to six weeks before a switch or augmentation decision should be made. But tracking data every two weeks reveals the trajectory much sooner.

A body of evidence supports measurement-based care as a practice that improves treatment outcomes, increases patient engagement, and reduces the gap between clinical research and real-world practice. Treatment response is defined as a 50% or greater reduction in PHQ-9 score from baseline. Remission is a score below 5. A partial response may warrant augmentation rather than switching antidepressants. Non-response after four to six weeks at therapeutic dose typically calls for a different medication or class.

Why Most PHQ-9 Data Goes Unused

Despite how well-designed the PHQ-9 is, a 2022 real-world study found that most patients with moderate-to-severe depression did not receive a follow-up PHQ-9 questionnaire after starting adjunctive therapy, suggesting incomplete monitoring of treatment response in clinical practice.

The gap is structural. In a care model organized around quarterly or biannual appointments, there are not enough encounters to administer the PHQ-9 with enough frequency to use it as a treatment-tracking tool. It becomes a snapshot rather than a trend. Snapshots are less useful than trends when the clinical question is whether a medication is working.

A care model with regular check-ins, whether weekly or bi-weekly, can use the PHQ-9 the way it was designed: to show whether the patient is improving, plateauing, or declining between appointments.

How SiggyMD Uses the PHQ-9

At SiggyMD, the PHQ-9 is integrated into the clinical model from first contact through ongoing treatment.

Every intake includes standardized depression severity assessment. The licensed prescriber reviews the results before approving any treatment plan. During ongoing treatment, regular structured check-ins capture how symptoms are changing over time and surface that data in the prescriber’s dashboard.

That means a prescriber reviewing a patient’s case sees a trend, not just a single score. They can see whether PHQ-9 scores are declining steadily, plateauing, or worsening, and make informed decisions about whether to stay the course, adjust dosing, or escalate.

“A PHQ-9 score at a quarterly appointment tells me where someone is today,” says Daniel Montville, MD, Psychiatrist, of the SiggyMD clinical team. “A PHQ-9 tracked regularly tells me where they are going. Those are completely different levels of clinical information. The trend is what actually guides my prescribing decisions.”

Question 9 is handled with the same clinical seriousness in a digital setting as in a traditional one. Any indication of suicidal ideation escalates immediately to the in-house prescriber for direct clinical assessment.

What Members Are Saying

M.V., 27

Major Depressive Disorder

“I had filled out the PHQ-9 dozens of times at different clinics. No one ever showed me what my number meant or how it was changing over time. When my prescriber shared my trend with me, I realized for the first time how far I had come. The data made the progress real in a way that my own sense of how I was feeling could not.”

A.L., 52

Depression and Anxiety

“My score was 19 when I started. No one told me what that meant clinically. Six weeks in, it was down to 11. The prescriber explained that we were tracking toward remission, defined as a score below 5. Having a goal with a number attached to it changed how I engaged with the treatment.”

Member stories reflect real experiences. Names and identifying details have been changed to protect privacy. Results vary. SiggyMD is currently invite-only.

Bottom Line

The PHQ-9 is not a form. It is a clinical instrument with a validated scoring system, clear response thresholds, and strong evidence for improving treatment outcomes when used to drive decisions rather than satisfy documentation.

The gap between how well the PHQ-9 can work and how it is typically used in practice is largely structural: not enough follow-up encounters to track the trend the way the tool was designed to be tracked. A model built around regular structured check-ins changes that equation.

If you have been screened for depression and are not sure what your score means or what happens next, those questions have clinical answers. Read more about how medication tracking connects to your treatment plan or start your anonymous intake with SiggyMD to receive a clinician-reviewed plan that tracks your progress from day one.

Sources

Kroenke K, Spitzer RL, Williams JBW. The PHQ-9: Validity of a Brief Depression Severity Measure. Journal of General Internal Medicine. 2001;16(9):606-613.
Coley RY, Boggs JM, Beck A. Defining Success in Measurement-Based Care for Depression: A Comparison of Common Metrics. Psychiatric Services. 2020;71(4):312-318.
Kroenke K, Spitzer RL. Enhancing the Clinical Utility of Depression Screening. CMAJ. 2012.
Forma F, Liberman JN, Rui P. Measuring Response to Adjunctive Therapy Among Individuals with Major Depressive Disorder. Neuropsychiatric Disease and Treatment. 2022;18:2467-2475.
Antaki C, et al. Use of the Patient Health Questionnaire (PHQ-9) in Practice: Interactions between Patients and Physicians. Social Science and Medicine. 2021.
AACAP. PHQ-9 Modified for Teens: GLAD-PC Scoring Guide. 2010.

Frequently Asked Questions

What is a normal PHQ-9 score?

A score of 0 to 4 indicates minimal or no depression. Scores of 5 to 9 indicate mild depression, 10 to 14 moderate depression, 15 to 19 moderately severe depression, and 20 to 27 severe depression. A score below 5 is considered remission for patients currently in treatment for depression.

What PHQ-9 score indicates major depression?

A score of 10 or above is the validated cutoff for a positive screen for major depression, with 88% sensitivity and 88% specificity. However, the PHQ-9 identifies patients who may have major depression; it does not diagnose it. A formal diagnosis requires a comprehensive clinical evaluation to confirm symptom duration, functional impairment, and rule out other conditions.

How often should the PHQ-9 be administered during treatment?

For patients in active treatment, every 2 to 4 weeks during the acute phase is clinically appropriate. This frequency allows prescribers to detect non-response early and make treatment adjustments before more time is lost. For patients in remission or on maintenance therapy, every 3 to 6 months may be sufficient to monitor for relapse.

What does a 5-point change on the PHQ-9 mean?

A 5-point change is the minimum clinically meaningful shift. A patient who moves from 18 to 13 has shown meaningful improvement. A patient who moves from 18 to 17 has not. A 50% reduction from baseline, for example from 18 to 9, is defined as treatment response. Clinicians use these thresholds to decide whether to continue the current plan, increase the dose, switch medications, or add an augmentation agent.

Is the PHQ-9 the same as a depression diagnosis?

No. The PHQ-9 is a screening and severity measurement tool. It identifies patients likely to have depression and quantifies symptom burden. A formal diagnosis of major depressive disorder requires a comprehensive clinical evaluation that assesses duration and functional impairment criteria, rules out other conditions, and involves a qualified clinician's judgment. A PHQ-9 score initiates the clinical conversation; it does not end it.

What happens when question 9 is answered positively?

Any positive answer to question 9, which asks about thoughts of being better off dead or of hurting oneself, requires immediate clinical follow-up regardless of the total score. The prescriber assesses suicidal ideation, intent, plan, and access to means, and takes appropriate action including safety planning or escalation. This step cannot be automated. It requires a human clinician.

Mental healthcare should stay with you between appointments.

SiggyMD combines daily check-ins with clinician-supervised care so your treatment plan can respond to what is actually happening.

SiggyMD is currently invite-only. A real doctor reviews every clinical decision. HIPAA-compliant.

Join the SiggyMD Waitlist