Outcome Measures Documentation: PHQ-9, GAD-7, PCL-5 & More
What Are Outcome Measures?
Outcome measures are standardized, validated instruments that quantify a client's symptom severity, functional status, or therapeutic progress at a given point in time. In mental health practice, the most commonly used outcome measures include the PHQ-9 (depression), GAD-7 (anxiety), PCL-5 (PTSD), ORS/SRS (general functioning and therapeutic alliance), PHQ-A (adolescent depression), and the Edinburgh Postnatal Depression Scale (perinatal depression).
Measurement-based care — the systematic use of outcome measures to track progress, guide treatment decisions, and detect deterioration — is supported by substantial evidence. Research demonstrates that therapists who use routine outcome monitoring achieve better client outcomes, identify clients at risk of deterioration earlier, and make more responsive treatment adjustments. Despite this evidence, studies consistently find that fewer than 20% of practicing therapists use outcome measures routinely.
Documenting outcome measures in clinical records serves multiple functions: it provides objective evidence of treatment progress for insurance and utilization review, establishes a quantitative baseline against which change is measured, supports clinical decision-making, and protects the clinician by demonstrating evidence-based practice.
When You Need It
- At intake to establish baseline symptom severity and guide diagnosis
- At regular intervals during treatment to track symptom trajectory (every session or every 2-4 sessions)
- Before and after specific treatment phases (e.g., before and after completing an exposure hierarchy)
- When making treatment decisions such as changing modality, adjusting frequency, or recommending medication evaluation
- When a client reports subjective worsening or improvement — to quantify the change
- When completing treatment summaries, discharge summaries, or reports for insurance utilization review
- When a client's treatment progress is being reviewed by a supervisor, consultant, or multidisciplinary team
Key Components
PHQ-9 (Patient Health Questionnaire-9)
The PHQ-9 is a 9-item self-report measure of depressive symptom severity based on DSM criteria for major depressive disorder. Each item rates the frequency of a symptom over the past two weeks on a scale from 0 (not at all) to 3 (nearly every day). Total scores range from 0 to 27.
Scoring and Interpretation:
| Score Range | Severity | Clinical Action |
|---|---|---|
| 0-4 | Minimal or no depression | No treatment indicated; monitor if appropriate |
| 5-9 | Mild depression | Watchful waiting; consider follow-up |
| 10-14 | Moderate depression | Treatment plan indicated (therapy, medication, or both) |
| 15-19 | Moderately severe depression | Active treatment recommended (therapy and/or medication) |
| 20-27 | Severe depression | Immediate treatment; consider medication, intensive services |
Key clinical notes: A score of 10 or above has a sensitivity of 88% and specificity of 88% for major depressive disorder. A change of 5 or more points is considered reliable change. Item 9 assesses suicidal ideation — any score above 0 on this item requires immediate clinical assessment regardless of the total score.
GAD-7 (Generalized Anxiety Disorder-7)
The GAD-7 is a 7-item self-report measure of anxiety severity. Items are rated on the same 0-3 scale as the PHQ-9, covering a two-week timeframe. Total scores range from 0 to 21.
Scoring and Interpretation:
| Score Range | Severity | Clinical Action |
|---|---|---|
| 0-4 | Minimal anxiety | No treatment indicated |
| 5-9 | Mild anxiety | Monitor; consider treatment if functionally impaired |
| 10-14 | Moderate anxiety | Treatment plan indicated |
| 15-21 | Severe anxiety | Active treatment recommended |
Key clinical notes: While developed for generalized anxiety disorder, the GAD-7 has good sensitivity for detecting panic disorder, social anxiety disorder, and PTSD. A score of 10 or above is the standard clinical threshold for treatment consideration. The PHQ-9 and GAD-7 are frequently administered together because depression and anxiety are highly comorbid — up to 60% of clients with major depression also meet criteria for an anxiety disorder.
PCL-5 (PTSD Checklist for DSM-5)
The PCL-5 is a 20-item self-report measure that assesses the 20 DSM-5 symptoms of PTSD. Items are rated from 0 (not at all) to 4 (extremely), covering the past month. Total scores range from 0 to 80. The PCL-5 can also be scored by symptom cluster: intrusions (items 1-5), avoidance (items 6-7), negative alterations in cognition and mood (items 8-14), and arousal and reactivity (items 15-20).
Scoring and Interpretation:
| Score Range | Interpretation | Clinical Action |
|---|---|---|
| 0-30 | Below clinical threshold | PTSD unlikely; monitor if clinically indicated |
| 31-33 | Provisional PTSD cutoff | Clinical interview recommended to confirm diagnosis |
| 34-80 | Above clinical threshold | Probable PTSD; trauma-focused treatment indicated |
Key clinical notes: The recommended clinical cutoff is 31-33. A change of 5-10 points is considered reliable change, and a change of 10-20 points is considered clinically meaningful change. The PCL-5 is freely available from the National Center for PTSD. It is typically administered every 2-4 weeks during trauma-focused treatment to monitor progress.
ORS/SRS (Outcome Rating Scale / Session Rating Scale)
The ORS and SRS, developed by Scott D. Miller and Barry L. Duncan, are ultra-brief measures designed for every-session administration. The ORS measures general client functioning across four domains (individual, interpersonal, social, overall), while the SRS measures the therapeutic alliance across four dimensions (relationship, goals and topics, approach or method, and overall).
ORS Scoring and Interpretation:
| Score | Interpretation |
|---|---|
| 0-24 | Below clinical cutoff — client is experiencing clinically significant distress |
| 25-40 | Above clinical cutoff — client is in the functional range |
| Clinical cutoff: 25 | Scores below 25 suggest the client is experiencing significant difficulties |
Each of the four ORS items is scored on a 10cm visual analogue scale, with marks converted to scores from 0 to 10. Total possible score is 40. A reliable change index of 5 points has been established — a change of 5 or more points indicates statistically reliable improvement or deterioration.
SRS Scoring and Interpretation: The SRS uses the same 4-item visual analogue format with a total possible score of 40. A cutoff score of 36 has been established — scores below 36 suggest a problem with the therapeutic alliance that should be addressed directly with the client.
PHQ-A (Patient Health Questionnaire — Adolescent)
The PHQ-A is a modified version of the PHQ-9 adapted for adolescents ages 11-17. It uses the same 9-item, 0-3 scoring format, with language adjusted for developmental appropriateness.
Scoring and Interpretation:
| Score Range | Severity |
|---|---|
| 0-4 | None/minimal depression |
| 5-9 | Mild depression |
| 10-14 | Moderate depression |
| 15-19 | Moderately severe depression |
| 20-27 | Severe depression |
Key clinical notes: A score of 11 or higher has a sensitivity of 89.5% and specificity of 77.5% for major depression in adolescents. The American Academy of Pediatrics recommends annual depression screening for adolescents age 12 and older. As with the PHQ-9, any positive response on item 9 (suicidal ideation) requires immediate clinical assessment.
Edinburgh Postnatal Depression Scale (EPDS)
The EPDS is a 10-item self-report screening tool for depression during pregnancy and the postpartum period. Items are scored 0-3, with total scores ranging from 0 to 30. Unlike the PHQ-9, several EPDS items are reverse-scored.
Scoring and Interpretation:
| Score Range | Interpretation | Clinical Action |
|---|---|---|
| 0-9 | Depression unlikely | Routine monitoring |
| 10-12 | Possible depression | Further evaluation recommended |
| 13-30 | Probable depression | Diagnostic assessment indicated |
Key clinical notes: A cutoff of 13 or above has been validated for detecting probable major depression in the postnatal period. Some clinical guidelines use a lower cutoff of 10 for screening purposes. Item 10 assesses thoughts of self-harm — any non-zero response requires immediate clinical assessment regardless of total score. The EPDS can be used during pregnancy as well as postpartum, and it screens for anxiety symptoms in addition to depression.
Quick Reference Table
| Measure | Items | Score Range | Mild | Moderate | Severe | Clinical Cutoff | Reliable Change |
|---|---|---|---|---|---|---|---|
| PHQ-9 | 9 | 0-27 | 5-9 | 10-14 | 20-27 | 10 | 5 points |
| GAD-7 | 7 | 0-21 | 5-9 | 10-14 | 15-21 | 10 | 4 points |
| PCL-5 | 20 | 0-80 | — | — | — | 31-33 | 5-10 points |
| ORS | 4 | 0-40 | — | — | — | 25 | 5 points |
| SRS | 4 | 0-40 | — | — | — | 36 | — |
| PHQ-A | 9 | 0-27 | 5-9 | 10-14 | 20-27 | 11 | 5 points |
| EPDS | 10 | 0-30 | — | 10-12 | 13+ | 13 | — |
Documenting Outcome Measures in a Progress Note
Session Date: 03/20/2026 | Session #: 12 | Client: K.P.
Outcome Measures Administered:
PHQ-9: Score 10 (moderate depression), decreased from 15 (moderately severe) at session 8. This represents a 5-point decrease, meeting the threshold for reliable change. Client denied suicidal ideation (item 9 = 0). Highest-scoring items: anhedonia (3), fatigue (2), sleep disturbance (2). Results discussed with client, who reported the improvement is consistent with her experience — "I feel like I'm coming out of the fog a little."
GAD-7: Score 8 (mild anxiety), decreased from 13 (moderate) at session 8. Client noted that worry about work has decreased but worry about her daughter's school performance remains elevated. Highest-scoring items: uncontrollable worry (2), restlessness (2).
ORS: Total score 28.4 (above clinical cutoff of 25), increased from 19.2 at intake. Improvement is most notable in the interpersonal domain (from 3.1 to 7.8). Social domain remains the lowest (5.6), consistent with client's continued social withdrawal.
SRS: Total score 38.2 (above alliance cutoff of 36). Client rated all four domains positively. No alliance concerns identified.
Clinical Integration: Outcome data indicates a positive treatment trajectory with clinically reliable improvement in both depression and anxiety over the past four sessions. The decline in PHQ-9 corresponds temporally with the introduction of behavioral activation assignments targeting anhedonia (initiated in session 9). The persistent elevation in GAD-7 items related to worry about her daughter suggests this may be a useful focus for cognitive restructuring in upcoming sessions. The improvement in ORS interpersonal functioning is encouraging and likely reflects the client's re-engagement with her partner, which she identified as a treatment goal. Social functioning remains an area for continued work. Treatment plan remains appropriate; no modifications indicated at this time.
This is a sample for educational purposes only — not real patient data.
How to Use It Step by Step
Step 1: Select measures appropriate to your client population and presenting problems. Use the PHQ-9 and GAD-7 as general screeners for most adult clients. Add the PCL-5 when PTSD is suspected or being treated. Use the EPDS for perinatal clients. Use the PHQ-A for adolescents. Consider the ORS/SRS for every-session outcome and alliance monitoring regardless of diagnosis.
Step 2: Administer at intake to establish baseline scores. Document the baseline score, severity category, and any clinically significant item responses. This baseline becomes the reference point for all subsequent administrations.
Step 3: Administer at regular intervals throughout treatment. Brief measures (PHQ-9, GAD-7, ORS, SRS) can be administered at every session — they take one to two minutes each and provide continuous tracking data. Longer measures (PCL-5) can be administered every 2-6 sessions.
Step 4: Score and interpret immediately. Score the measure before or at the beginning of the session so you can discuss the results with the client. Waiting to score measures after the session eliminates the clinical feedback loop that makes them valuable.
Step 5: Document in the progress note. Include the measure name, total score, severity category, comparison to the previous score (direction and magnitude of change), any item-level responses of clinical significance, and the client's subjective reaction to the results.
Step 6: Use the data clinically. Discuss the results with the client. Explore discrepancies between scores and subjective report. Use upward trends to explore potential causes of deterioration. Use downward trends to reinforce progress and identify effective strategies. Adjust the treatment plan when outcome data suggests the current approach is not working.
Step 7: Track longitudinally. Maintain a running record of scores across sessions — either in a graph, a spreadsheet, or your EHR's outcome tracking system. Longitudinal tracking reveals patterns (e.g., seasonal worsening, score increases correlated with specific stressors) that individual data points do not.
Common Mistakes
-
Administering measures without using the data. Collecting outcome measures and filing them without reviewing the scores, discussing them with the client, or integrating them into clinical decision-making is a compliance exercise, not measurement-based care. If you are not going to use the data, you are wasting the client's time.
-
Documenting only the total score. A total PHQ-9 score of 12 tells part of the story. Documenting which items are most elevated, whether item 9 (suicidal ideation) was endorsed, and whether the score increased or decreased from the previous administration tells the full story.
-
Treating a single score as definitive. A single elevated score does not confirm a diagnosis, and a single low score does not indicate recovery. Scores should be interpreted in context — considering the client's baseline, the trajectory over time, current life circumstances, and the phase of treatment.
-
Not addressing SRS scores below the alliance cutoff. When the SRS score drops below 36, many clinicians ignore it or assume the client made an error. Research shows that addressing alliance ruptures directly — by asking the client what could be improved about the therapeutic relationship — is one of the most powerful predictors of treatment outcome. Low SRS scores are clinical data, not criticism.
-
Using outcome measures only for insurance. If the primary reason you administer outcome measures is to satisfy insurance requirements, you are missing their clinical value. Measures that are administered, scored, discussed, and used to guide treatment decisions improve outcomes. Measures that are administered and filed improve documentation but not care.
Writing a clinical document right now?
My Clinical Writer helps you generate clinical documents from your session details in under 60 seconds.
Try My Clinical Writer Free →myclinicalwriter.ai
Frequently Asked Questions
Related Templates
External Resources
Authoritative references and tools related to this documentation type.
Stop spending hours on documentation
My Clinical Writer uses AI to help you draft clinical notes, treatment plans, and reports in minutes — not hours.
Get Started at myclinicalwriter.ai →