Services

We help organizations test AI and plan for what's next. On the evaluation side, we pressure-test models against your actual use cases, data, and edge conditions, so you see real outcomes and risks before you buy. On the strategy side, we help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented. We bring real-world experience from government, regulators, startups nonprofits, and industry to aspects of our work.

See what we've been up to.

Schedule Consultation

Strategic Planning

We help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented.

What we do:

Facilitate inclusive strategic planning processes with staff, board, funders, and external stakeholders
Conduct organizational assessments and comparative ecosystem analyses
Develop multi-year strategic plans with measurable goals and implementation roadmaps
Refresh mission, vision, and values language
Apply the Prosci change management framework to build organization-wide buy-in
Create product and project roadmaps that can be operationalized
Lead Theory of Change assessments

Deliverable: Multi-year strategic plan with implementation roadmap, annual planning framework, and stakeholder summary

Who this is for: Nonprofits, government agencies, and mission-driven organizations navigating technology, policy, and social impact

Pricing model: Project-based, fixed fee (3-8 months)

Pre-Procurement AI Safety & Bias Testing

Test vendor systems before you sign the contract.

What we do:

Independent safety evaluation of AI systems before purchase
Test across multiple languages and harm categories
Compare vendor claims against actual performance
Provide objective pass/fail data for purchasing decisions

Deliverable: Written report with test results, pass rates, and procurement recommendation (2-3 weeks)

Who this is for: Procurement officers, compliance teams, department heads evaluating AI vendors

Pricing model: Fixed fee per system evaluation

Multilingual AI Safety & Bias Audits

Find vulnerabilities in languages other than English.

What we do:

Test AI systems in 9+ languages
Identify where safety filters fail in non-English languages
Document the language gap in your specific deployment
Provide evidence for vendor negotiations

Deliverable: Multilingual safety report with language-specific failure rates and recommendations

Who this is for: Organizations serving multilingual users, international deployments, government agencies

Pricing model: Per-language testing or multi-language packages

Large-Scale Crowdsourced Testing

We are pioneers in running large-scale crowdsourced testing.

What we do:

Recruit domain experts (healthcare, education, legal, etc.)
Coordinate systematic testing across 100-500+ testers
Aggregate findings across diverse perspectives
Identify patterns technical testing misses

Deliverable: Comprehensive vulnerability report with prioritized findings and remediation roadmap

Who this is for: AI developers, high-risk deployments (medical, financial, educational), organizations requiring regulatory compliance

Pricing model: Project-based (3-6 months)

Internal Testing Capacity Building

Train your team to test AI safety.

What we do:

Train your staff to conduct ongoing AI safety testing
Develop custom test suites for your specific use case
Establish testing processes and documentation standards
Provide templates and tools for sustainable testing

Deliverable: Trained internal team + custom test suite + process documentation

Who this is for: Organizations deploying multiple AI systems, institutions wanting internal expertise, compliance teams

Pricing model: Training program + ongoing support retainer

AI Safety Standards Development

Build safety standards for your industry or region.

What we do:

Develop industry-specific safety testing standards
Create procurement frameworks for AI evaluation
Design multilingual safety benchmarks
Collaborate with regulators and policymakers

Deliverable: Published standards, testing frameworks, implementation guides

Who this is for: Industry associations, government agencies, regulatory bodies, large organizations setting internal standards

Pricing model: Consulting engagement (6-12 months)