Merit Engine - Project Charter

Published

June 29, 2026

internal

Merit Engine: AI Public Safety Promotional Readiness Tutor

Project Charter - Version 1.0

Prepared by: Tonya R. Dawson, Fairlawn Strategy Partners (FSP) Date: June 29, 2026 Status: In Development - Beta Design Phase

1. Executive Overview

The Merit Engine is an IRT-driven (Item Response Theory) AI tutoring platform built specifically for public safety promotional candidates - police officers, firefighters, and command staff preparing for rank advancement exams. It is not a study guide. It is a psychometric intelligence engine that diagnoses an officer’s latent ability (theta, or θ), targets their specific knowledge decay points, and validates their readiness to command before they ever sit for an oral board or written exam.

Core Value Proposition: > “We aren’t helping officers pass a test. We are validating their readiness to lead.”

Business Model: License the platform to public safety agencies (department-wide contracts) at tiered pricing. Where candidates fall short in specific competency areas, offer supplemental e-learning led by Tonya R. Dawson or Caraline Malloy.

2. Problem Statement

What is broken in the current market

Pain Point	Current Reality	Merit Engine Fix
Content volume	Officers study 1,000+ pages of SOPs, penal codes, fire tactics	AI prioritizes by criticality weight and knowledge decay
Prep equity	Study groups favor “insiders” - an informal inner circle	24/7 AI access levels the field for all shifts and units
Validation gap	Standard exams measure test-taking, not leadership aptitude	IRT measures latent ability (θ), not raw score percentage
Litigation exposure	Unvalidated promotional processes invite EEOC complaints	Bias-resistant audit trail is built into every session
Succession planning	Chiefs promote who tested well, not who is field-ready	Bench Strength Heat Map predicts leadership readiness

3. The Science Foundation: IRT vs. Classical Test Theory

Understanding this distinction is what separates FSP from every competitor.

Classical Test Theory (CTT) - What competitors use

Counts correct answers as a percentage
Every question is treated as equal
Cannot distinguish a “lucky guesser” from a true master
Does not adapt to the learner

Item Response Theory (IRT) - The Merit Engine standard

Measures the latent trait (θ) behind the answers
Every item has a calibrated difficulty parameter (b)
The 3-Parameter Logistic (3PL) model filters out lucky guesses via a pseudo-guessing parameter (c)
The engine knows not just that an officer got a question wrong, but why and at what difficulty threshold they break down

The practical difference: Two officers both score 80%. Officer A got the hard Use of Force and Search & Seizure questions right and missed easy administrative items. Officer B got the easy items right and guessed through the hard ones. Under CTT: tie game. Under IRT: Officer A is identified as the higher-potential leader. The Merit Engine tells a Chief which officer to promote.

4. Platform Architecture (Conceptual)

Layer 1: Knowledge Ingestion

Ingest agency-specific SOPs, General Orders, Penal/Fire Codes, POST standards
Tag each policy item with a Criticality Weight (e.g., Use of Force = 10, administrative leave policy = 2)
Build a Difficulty Matrix calibrated to the specific promotional exam format

Layer 2: The IRT Engine

Adaptive Item Selection: Each question served is chosen based on the candidate’s current estimated θ
Real-time θ recalculation after every response
False Confidence Detection: Flags items the officer answers quickly but incorrectly (a key I-O insight competitors miss)
Spaced Repetition scheduling based on the Ebbinghaus Forgetting Curve - re-tests items at the exact moment cognitive decay is predicted to begin

Layer 3: Scenario Simulation

Voice-AI oral board simulation: AI role-plays as a hostile subordinate, angry citizen, or superior officer
Decision Latency measurement: tracks the gap between receiving a scenario prompt and committing to a decision (2-second vs. 10-second response is field-significant)
Quizzing under artificial stressors (simulated radio chatter, time pressure) to build cognitive load resilience

Layer 4: Analytics Dashboard

Candidate View: Personal θ curve, 60-day progress, knowledge heat map, predicted final score range
HR/Command View: Force-wide anonymized readiness, unit-level bench strength, policy blind spot reports
Predictive Success Score: Final probability of mastery per candidate, per competency domain

Layer 5: E-Learning Bridge (Revenue Lever)

When a candidate’s θ falls below threshold in a domain (e.g., Constitutional Law, Budget Management), they are flagged
FSP offers targeted e-learning modules taught live by Tonya R. Dawson or Caraline Malloy
This converts a software license into an ongoing services revenue stream

5. The 60-Day Command Sprint

Phase 1: Diagnostic Foundation (Days 1-15)

Goal: Establish the ability baseline and identify knowledge decay

Period	Focus	AI Activity	I-O Metric
Day 1	Full Diagnostic	100-question adaptive exam across all SOPs and Law	Initial θ Baseline
Days 2-7	Gap Mapping	AI identifies “False Confidence” - areas answered fast but incorrectly	Discrimination Index
Days 8-15	Foundation Building	Personalized high-yield summaries of bottom 20% scoring areas	Knowledge Decay Map

Phase 2: Targeted Mastery (Days 16-45)

Goal: Close gaps, build long-term retention

Period	Focus	AI Activity	I-O Metric
Weeks 3-4	High-Yield Drills	15-minute daily micro-quizzes on highest-criticality statutes	Spaced Repetition curve
Weeks 5-6	Scenario Simulation	Voice-AI role-plays disgruntled officer in disciplinary scenario	Behavioral Consistency
Week 6	Stress Testing	Quizzing with environmental distractors (simulated radio traffic/noise)	Cognitive Load Resilience

Phase 3: Peak Performance (Days 46-60)

Goal: Strategic test-taking stamina and anxiety regulation

Period	Focus	AI Activity	I-O Metric
Week 7	Mock Simulation	4-hour proctored exams with dept-weighted scoring identical to actual exam	Predictive Validity
Week 8	Sniper Drills	Quizzing only on the final 5% of missed items	Mastery Ceiling
Day 60	Taper and Reset	High-level philosophy review, go/no-go readiness visualization	Peak Readiness State

6. Business Model: Tiered Pricing

Tier	Service Level	Target Client	Price Range	E-Learning Upsell
Bronze: The SaaS	Access to agency-tuned AI study tutor with basic SOP training	Individual officers or small-volume departments	$199-$350/user (one-time license)	Optional add-on
Silver: Path to Mastery	Personalized 60-day plan, weekly AI performance audits, high-fidelity oral board simulations	Mid-sized departments (50-200 sworn)	$15k-$40k/dept contract	Included at trigger thresholds
Gold: Leadership Pipeline	Full IRT-validated diagnostic for whole dept, succession planning analytics for Chiefs, “Success Prediction” reports	Large municipalities, state agencies	$100k+ (multi-year strategic partnership)	Included, custom curriculum

The E-Learning Revenue Loop

When a candidate scores below a department-defined threshold in a competency area: 1. Merit Engine flags the gap automatically 2. Candidate is offered a targeted 4-6 hour e-learning module 3. Module is taught live (Tonya or Caraline) or on-demand 4. Completion is documented in the candidate’s Merit Engine profile as a validated remediation

7. Competitive Positioning

Feature	All-Write Testing	Truleo / PowerDMS	NotebookLM	Merit Engine (FSP)
Core Function	Static exam prep	Policy management and analysis	Document synthesis chatbot	IRT-driven psychometric tutor
Logic Engine	Linear quizzing	Administrative workflow	General LLM	Adaptive IRT with 3PL model
Personalization	One-size-fits-all	Search-based	User-fed prompts	Theta-calibrated per candidate
I-O Validation	Content-based only	None (operational)	None (general)	Criterion validity, bias audit trail
Strategic Goal	Passing the test	Compliance	Information retrieval	Predicting and validating leadership readiness
Exclusivity	Mass market	Enterprise software	Public consumer	Agency-sovereign instances

Key differentiator: Competitors sell content delivery. FSP sells validated intelligence. Content is cheap. Calibration is expensive. That is the price justification.

8. Go-to-Market Strategy

Primary Buyers

Training Officers / HR Directors - operational pain: inconsistent prep, oral board liability
Police/Fire Chiefs - strategic pain: succession gaps, litigation exposure, retention
Graduate School Program Managers - channel partner: executive leadership programs that train Chiefs

Target Market Entry

Pilot: Mid-sized departments (50-200 sworn) at Silver Tier - best balance of revenue and proof-of-concept data
SDPD and comparable “Big 10” agencies - aspirational anchor clients that validate the platform nationally
University partnerships - embed Merit Engine into executive leadership curriculum as a “Digital Teaching Assistant”

Exclusivity Model (Selective Engagement Policy)

Maximum 3 marquee agencies or elite institutions per region per calendar year
Agency data is never used to train models for competing jurisdictions (Sovereign Instance model)
FSP reserves the right to decline engagements where culture does not align with Ethical AI and Objective Merit

9. Assumptions Being Challenged

The following are working assumptions that should be tested with beta users before hardening the product:

Assumption 2: Officers will engage with daily 15-minute micro-sessions voluntarily

Challenge: Shift work, overtime, and fatigue are real. Voluntary adoption may be low without departmental mandate or incentive. Mitigation to explore: Gamification layer, shift-aware scheduling that pushes sessions during known downtime, and union engagement strategy.

Assumption 3: IRT calibration is achievable without large historical item-response datasets

Challenge: IRT item calibration requires substantial pilot data (typically 200-500 responses per item) to accurately estimate difficulty parameters. A new agency with limited prior exam data will have uncalibrated items. Mitigation to explore: Use pre-calibrated item banks from POST standards and published public safety exams as a starting scaffold, then recalibrate over time with actual response data.

Assumption 4: Voice-AI oral board simulation will feel authentic enough to build real preparation

Challenge: Current voice-AI has latency, unnatural cadence, and limited emotional range. Officers may dismiss it as a gimmick. Mitigation to explore: Frame it explicitly as a knowledge stress-test (not a human substitute), and measure decision latency and content accuracy rather than conversational naturalness.

Assumption 5: A $15k-$100k price point is accessible within standard department training budgets

Challenge: Many mid-sized departments operate on thin training line items and require city council or grant approval for new technology spend above a certain threshold. Mitigation to explore: Grant writing support (COPS Office, BJA), per-officer pricing at bronze tier as a foot-in-the-door, and framing Gold Tier as litigation risk insurance.

Assumption 6: Tonya and Caraline can personally scale e-learning delivery as a revenue stream

Challenge: Live instruction is the highest-margin service but the least scalable. If 20 agencies trigger the e-learning upsell simultaneously, capacity breaks. Mitigation to explore: Build asynchronous on-demand versions of core modules early. Reserve live instruction for high-complexity domains or Gold Tier clients only.

11. Key Personnel

Role	Person	Responsibility
Founder / I-O Lead	Tonya R. Dawson	Industrial Organizational Psychology Professional; psychometric framework, client relationships, e-learning instruction
Co-Founder / E-Learning Lead	Caraline Malloy	Industrial Organizational Psychology Professional, researcher, and e-learning instructor; leads live and on-demand coaching layer
Software Engineer (TBD)	Open	IRT engine, adaptive quiz platform, dashboard
Psychometrician Advisor (TBD)	Open	IRT item calibration, validity studies

12. Success Metrics (Beta Pilot)

Metric	Target
Knowledge Acquisition	15%+ average increase in baseline-to-peak assessment scores
Candidate Engagement	More than 80% participation rate in daily AI micro-sessions
Administrative ROI	Reduction in internal training man-hours by minimum 40 hours per promotional cycle
Predictive Accuracy	Predicted score range within +/- 5 points of actual exam score
E-Learning Conversion	More than 25% of flagged candidates enroll in supplemental module

13. Open Questions for Beta Users

These are the questions to actively probe during initial client conversations and pilot programs:

What promotional cycle are you running and what is the timeline?
How many pages is your current SOP/General Orders manual?
Do candidates currently receive any department-funded prep support, or is it self-directed?
What is the promotional exam format - written only, oral board only, or combined?
Has your department faced EEOC complaints or grievances related to a promotional process in the last 5 years?
What is your data governance policy for third-party AI tools?
Who owns the promotional process - HR, the Chief, Civil Service Commission, or all three?
What would a measurable “win” look like for you after one cycle?

Document Version 1.0 - June 29, 2026 Fairlawn Strategy Partners, LLC, an affiliate of the Institute for Transformative Change - Confidential and Proprietary Contact: Tonya R. Dawson | tonya@fairlawnstrategy.com

Merit Engine - Project Charter

Merit Engine: AI Public Safety Promotional Readiness Tutor

Project Charter - Version 1.0

1. Executive Overview

2. Problem Statement

What is broken in the current market

3. The Science Foundation: IRT vs. Classical Test Theory

Classical Test Theory (CTT) - What competitors use

Item Response Theory (IRT) - The Merit Engine standard

4. Platform Architecture (Conceptual)

Layer 1: Knowledge Ingestion

Layer 2: The IRT Engine

Layer 3: Scenario Simulation

Layer 4: Analytics Dashboard

Layer 5: E-Learning Bridge (Revenue Lever)

5. The 60-Day Command Sprint

Phase 1: Diagnostic Foundation (Days 1-15)

Phase 2: Targeted Mastery (Days 16-45)

Phase 3: Peak Performance (Days 46-60)

6. Business Model: Tiered Pricing

The E-Learning Revenue Loop

7. Competitive Positioning

8. Go-to-Market Strategy

Primary Buyers

Target Market Entry

Exclusivity Model (Selective Engagement Policy)

9. Assumptions Being Challenged

Assumption 2: Officers will engage with daily 15-minute micro-sessions voluntarily

Assumption 3: IRT calibration is achievable without large historical item-response datasets

Assumption 4: Voice-AI oral board simulation will feel authentic enough to build real preparation

Assumption 5: A $15k-$100k price point is accessible within standard department training budgets

Assumption 6: Tonya and Caraline can personally scale e-learning delivery as a revenue stream

10. Technical Build Roadmap (Iteration 1)

Phase 0: Foundation (Months 1-2)

Phase 1: MVP (Months 3-5)

Phase 2: Intelligence Layer (Months 6-9)

Phase 3: Scale (Months 10-18)

11. Key Personnel

12. Success Metrics (Beta Pilot)

13. Open Questions for Beta Users

Merit Engine: AI Public Safety Promotional Readiness Tutor

Project Charter - Version 1.0

1. Executive Overview

2. Problem Statement

What is broken in the current market

3. The Science Foundation: IRT vs. Classical Test Theory

Classical Test Theory (CTT) - What competitors use

Item Response Theory (IRT) - The Merit Engine standard

4. Platform Architecture (Conceptual)

Layer 1: Knowledge Ingestion

Layer 2: The IRT Engine

Layer 3: Scenario Simulation

Layer 4: Analytics Dashboard

Layer 5: E-Learning Bridge (Revenue Lever)

5. The 60-Day Command Sprint

Phase 1: Diagnostic Foundation (Days 1-15)

Phase 2: Targeted Mastery (Days 16-45)

Phase 3: Peak Performance (Days 46-60)

6. Business Model: Tiered Pricing

The E-Learning Revenue Loop

7. Competitive Positioning

8. Go-to-Market Strategy

Primary Buyers

Target Market Entry

Exclusivity Model (Selective Engagement Policy)

9. Assumptions Being Challenged

Assumption 1: Departments will share SOP data with a third-party AI vendor

Assumption 2: Officers will engage with daily 15-minute micro-sessions voluntarily

Assumption 3: IRT calibration is achievable without large historical item-response datasets

Assumption 4: Voice-AI oral board simulation will feel authentic enough to build real preparation

Assumption 5: A $15k-$100k price point is accessible within standard department training budgets

Assumption 6: Tonya and Caraline can personally scale e-learning delivery as a revenue stream

10. Technical Build Roadmap (Iteration 1)

Phase 0: Foundation (Months 1-2)

Phase 1: MVP (Months 3-5)

Phase 2: Intelligence Layer (Months 6-9)

Phase 3: Scale (Months 10-18)

11. Key Personnel

12. Success Metrics (Beta Pilot)

13. Open Questions for Beta Users