You have five lead vendors. Each one sends a monthly invoice and a report full of metrics that make them look good. None of those reports tell you cost per signed case. So every budget conversation is a negotiation based on partial information — and the vendors who argue loudest tend to win.
A vendor scorecard fixes that. It evaluates every vendor in your portfolio against the same metrics, weights them by importance, and produces a grade you can act on. Budget conversations stop being debates. They become decisions grounded in your data, not theirs.
This guide covers the complete build: which metrics to include, how to weight them, and how to attach a decision rule to every grade so the scorecard actually changes how you spend.
Related guide: See our complete guide to evaluating PI lead vendors — the 7 metrics that define vendor quality and how to build a vendor scorecard.
Why Most Vendor Evaluations Fail Before They Start
The typical PI firm vendor review goes like this: someone pulls invoices, scans lead volume, and makes a call based on whether the vendor “feels” productive. It's fast, informal, and wrong in two specific ways.
First, it evaluates in isolation. Vendor A this month versus Vendor A last month — not Vendor A versus B, C, and D on identical terms. You can't optimize a portfolio one position at a time.
Second, it measures the wrong thing. Cost per lead shows up on an invoice. Cost per case — the number that actually tells you whether spend produced value — requires connecting invoice data to intake and case management records. Informal reviews skip that step every time.
A scorecard is designed to solve both problems simultaneously.
Step 1: Choose Your Scorecard Metrics
Keep reading
Five to seven metrics is the right range. Fewer than five and you miss meaningful signal. More than seven and the scorecard becomes noise. Here are the metrics that matter most for PI vendor evaluation:
Cost Per Signed Case
Your primary metric. Total spend divided by signed cases attributed to that vendor in the measurement window. It collapses lead volume, CPL, and conversion rate into one number you can compare across every vendor in the portfolio. Every other metric on the scorecard adds context to this one.
Lead-to-Case Conversion Rate
Signed cases divided by total leads, expressed as a percentage. A vendor sending 100 leads that produce 10 signed cases is fundamentally different from a vendor sending 100 leads that produce 3 — even if their cost per lead is identical. This is your clearest window into actual lead quality.
Rejection Rate
Rejected or declined leads divided by total leads received. Above 20–25% is a yellow flag. Above 35% is a serious problem — the vendor is likely sourcing outside your agreed case criteria, outside your geography, or from channels that attract the wrong claimants.
Conversion Trend (3-Month Direction)
Is this vendor's conversion rate moving up, flat, or declining? A vendor at 9% conversion with a three-month decline needs a different response than a vendor at 9% and improving. Trend data separates a short-term dip from a structural problem — which is a completely different conversation.
Case Severity Distribution
What share of signed cases are high-severity (catastrophic, surgical, significant soft tissue) versus low-severity (minor soft tissue, disputed liability)? Include this if your case management system exports case type. Vendors that consistently deliver low-severity cases will look deceptively efficient on cost-per-case — because those cases settle at far lower values. Case-level analytics surface this severity data by vendor without manual lookups.
Cost Per Lead (Contextual Only)
Include CPL as reference data, not a scored metric. It helps you understand a vendor's economics, but it shouldn't influence the grade. CPL is an input. Cost per case is the output that matters.
Step 2: Assign Weights to Each Metric
Not all metrics deserve equal pull on the final grade. A practical weighting model for PI vendor scorecards:
- Cost per signed case: 35%
- Lead-to-case conversion rate: 25%
- Rejection rate: 20%
- Conversion trend: 15%
- Case severity distribution: 5% (raise this when you have reliable severity data)
This weighting treats financial efficiency as the primary signal, with lead quality and funnel health as secondary indicators. It works for most PI firms running mixed lead portfolios.
Weight
If your firm prioritizes case quality over volume — for example, you only take catastrophic injury cases — shift weight toward severity distribution and reduce the weight on raw cost per case.
Step 3: Build the Scoring Scale
Each metric needs a 1–5 scale. Anchor it to your firm's own data — a score of 3 should mean “at the firm average,” not some arbitrary industry benchmark you don't actually know.
For cost per signed case, calculate your firm's blended average first. Then score each vendor relative to that number:
- 5: More than 25% below firm average cost per case
- 4: 10–25% below firm average
- 3: Within 10% of firm average (either direction)
- 2: 10–35% above firm average
- 1: More than 35% above firm average
Use the same relative logic for conversion rate. For rejection rate — where lower is better — a score of 5 means below 10%, and a score of 1 means above 35%.
Step 4: Calculate the Weighted Score
Multiply each metric score by its weight, then add the results. A vendor that scores 4 on cost per case (35% weight), 3 on conversion (25%), 4 on rejection rate (20%), 3 on trend (15%), and 3 on severity (5%) earns a weighted score of 3.60. Out of 5.
Convert the weighted score to a letter grade for easy communication:
- A: 4.0–5.0
- B: 3.0–3.9
- C: 2.0–2.9
- D: 1.0–1.9
Step 5: Attach a Budget Decision Rule to Each Grade
A scorecard without a decision protocol is just a report. The final step is defining what each grade requires at the next budget cycle — before you run the scorecard, not after.
- A vendors: Eligible for budget increase of up to 20%. No additional review needed until the next monthly cycle.
- B vendors: Maintain current budget. Monitor for movement and re-score next month.
- C vendors: Budget freeze. Schedule a vendor conversation within two weeks. Present the performance data. Re-score after 60 days on flat budget.
- D vendors:25–50% budget reduction. The vendor gets a defined 60-day improvement window. If the score doesn't reach C or above, end the contract.
The decision rule strips the politics out of vendor management. You're not rewarding tenure or a friendly rep. You're executing a protocol grounded in performance data — the same data for every vendor, every month.
How Often to Run the Scorecard
Monthly is the right cadence for most firms. You get enough data volume for meaningful comparisons, and you catch trend changes fast enough to act before they cost you a quarter of marketing budget.
Use a rolling 90-day window, not a calendar-month snapshot. The 90-day window smooths out short-term noise — a slow intake week, a holiday-related lead dip — without burying real performance changes. Slide the window forward by one month each time you score.
| Grade | Score Range | Budget Action | |
|---|---|---|---|
| A — Outperforming | 4.0–5.0 | Eligible for 20% budget increase | |
| B — On Track | 3.0–3.9 | Maintain current budget | |
| C — Below Threshold | 2.0–2.9 | Budget freeze, vendor conversation in 2 weeks | |
| D — Underperforming | 1.0–1.9 | 25–50% budget reduction, 60-day window |
Common Mistakes to Avoid
These patterns show up consistently in firms that build scorecards but don't get the results they expected:
- Using vendor-reported data:Your intake system and case management platform are authoritative. Vendor-provided “quality scores” and “verified lead” counts belong in the reference section — not the graded metrics. Vendors grade themselves generously.
- Grading on a 30-day window: One month produces too much noise. You end up making budget decisions on a few weeks of data that may not reflect how a vendor actually performs over time.
- Adding too many metrics: More than seven dilutes the scoring and makes it harder to diagnose why a vendor is underperforming. If everything signals a problem, nothing does.
- Ignoring the decision rules when the results are inconvenient: The most common failure mode — building a scorecard, running it for a few months, then reverting to gut instinct when the data says something unwelcome about a long-standing vendor. Trust the system.
Building Toward Automation
A manual scorecard is a real improvement over informal vendor evaluation. It's also two to three hours a month of data pulling and reconciliation. That's a reasonable investment for a firm managing four to six vendors.
As your portfolio grows, that maintenance cost grows with it. A revenue intelligence platform automates the data collection and calculates scorecard metrics in real time. The logic stays identical — the platform just eliminates the assembly work the manual approach requires every month.
Either way, the most important investment is the habit: every vendor reviewed on identical terms, every month, with budget decisions driven by what the data says. Build that discipline first. The tooling can catch up.
Related guide: See our complete guide to lead source tracking for law firms — the 4-level attribution chain, 8 data points, and 5-step tracking system every PI firm needs.
Related guide:For the complete category guide, see ourdefinitive guide to Revenue Intelligence for Personal Injury Law Firms — the four intelligence layers, the maturity model, and the 90-day path from spreadsheets to a connected revenue engine.
