Services

Services
Photo by Charles Forerunner / Unsplash

We help organizations test AI and plan for what's next. On the evaluation side, we pressure-test models against your actual use cases, data, and edge conditions, so you see real outcomes and risks before you buy. On the strategy side, we help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented. We bring real-world experience from government, regulators, startups nonprofits, and industry to aspects of our work.

See what we've been up to.

Strategic Planning

We help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented.

What we do:

  • Facilitate inclusive strategic planning processes with staff, board, funders, and external stakeholders
  • Conduct organizational assessments and comparative ecosystem analyses
  • Develop multi-year strategic plans with measurable goals and implementation roadmaps
  • Refresh mission, vision, and values language
  • Apply the Prosci change management framework to build organization-wide buy-in
  • Create product and project roadmaps that can be operationalized
  • Lead Theory of Change assessments

Deliverable: Multi-year strategic plan with implementation roadmap, annual planning framework, and stakeholder summary

Who this is for: Nonprofits, government agencies, and mission-driven organizations navigating technology, policy, and social impact

Pricing model: Project-based, fixed fee (3-8 months)


Pre-Procurement AI Safety & Bias Testing

Test vendor systems before you sign the contract.

What we do:

  • Independent safety evaluation of AI systems before purchase
  • Test across multiple languages and harm categories
  • Compare vendor claims against actual performance
  • Provide objective pass/fail data for purchasing decisions

Deliverable: Written report with test results, pass rates, and procurement recommendation (2-3 weeks)

Who this is for: Procurement officers, compliance teams, department heads evaluating AI vendors

Pricing model: Fixed fee per system evaluation


Multilingual AI Safety & Bias Audits

Find vulnerabilities in languages other than English.

What we do:

  • Test AI systems in 9+ languages
  • Identify where safety filters fail in non-English languages
  • Document the language gap in your specific deployment
  • Provide evidence for vendor negotiations

Deliverable: Multilingual safety report with language-specific failure rates and recommendations

Who this is for: Organizations serving multilingual users, international deployments, government agencies

Pricing model: Per-language testing or multi-language packages


Large-Scale Crowdsourced Testing

We are pioneers in running large-scale crowdsourced testing.

What we do:

  • Recruit domain experts (healthcare, education, legal, etc.)
  • Coordinate systematic testing across 100-500+ testers
  • Aggregate findings across diverse perspectives
  • Identify patterns technical testing misses

Deliverable: Comprehensive vulnerability report with prioritized findings and remediation roadmap

Who this is for: AI developers, high-risk deployments (medical, financial, educational), organizations requiring regulatory compliance

Pricing model: Project-based (3-6 months)


Internal Testing Capacity Building

Train your team to test AI safety.

What we do:

  • Train your staff to conduct ongoing AI safety testing
  • Develop custom test suites for your specific use case
  • Establish testing processes and documentation standards
  • Provide templates and tools for sustainable testing

Deliverable: Trained internal team + custom test suite + process documentation

Who this is for: Organizations deploying multiple AI systems, institutions wanting internal expertise, compliance teams

Pricing model: Training program + ongoing support retainer


AI Safety Standards Development

Build safety standards for your industry or region.

What we do:

  • Develop industry-specific safety testing standards
  • Create procurement frameworks for AI evaluation
  • Design multilingual safety benchmarks
  • Collaborate with regulators and policymakers

Deliverable: Published standards, testing frameworks, implementation guides

Who this is for: Industry associations, government agencies, regulatory bodies, large organizations setting internal standards

Pricing model: Consulting engagement (6-12 months)