Services
We help organizations test AI and plan for what's next. On the evaluation side, we pressure-test models against your actual use cases, data, and edge conditions, so you see real outcomes and risks before you buy. On the strategy side, we help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented. We bring real-world experience from government, regulators, startups nonprofits, and industry to aspects of our work.
Strategic Planning
We help mission-driven organizations build strategic plans that are grounded in evidence, shaped by stakeholders, and designed to be implemented.
What we do:
- Facilitate inclusive strategic planning processes with staff, board, funders, and external stakeholders
- Conduct organizational assessments and comparative ecosystem analyses
- Develop multi-year strategic plans with measurable goals and implementation roadmaps
- Refresh mission, vision, and values language
- Apply the Prosci change management framework to build organization-wide buy-in
- Create product and project roadmaps that can be operationalized
- Lead Theory of Change assessments
Deliverable: Multi-year strategic plan with implementation roadmap, annual planning framework, and stakeholder summary
Who this is for: Nonprofits, government agencies, and mission-driven organizations navigating technology, policy, and social impact
Pricing model: Project-based, fixed fee (3-8 months)
Pre-Procurement AI Safety & Bias Testing
Test vendor systems before you sign the contract.
What we do:
- Independent safety evaluation of AI systems before purchase
- Test across multiple languages and harm categories
- Compare vendor claims against actual performance
- Provide objective pass/fail data for purchasing decisions
Deliverable: Written report with test results, pass rates, and procurement recommendation (2-3 weeks)
Who this is for: Procurement officers, compliance teams, department heads evaluating AI vendors
Pricing model: Fixed fee per system evaluation
Multilingual AI Safety & Bias Audits
Find vulnerabilities in languages other than English.
What we do:
- Test AI systems in 9+ languages
- Identify where safety filters fail in non-English languages
- Document the language gap in your specific deployment
- Provide evidence for vendor negotiations
Deliverable: Multilingual safety report with language-specific failure rates and recommendations
Who this is for: Organizations serving multilingual users, international deployments, government agencies
Pricing model: Per-language testing or multi-language packages
Large-Scale Crowdsourced Testing
We are pioneers in running large-scale crowdsourced testing.
What we do:
- Recruit domain experts (healthcare, education, legal, etc.)
- Coordinate systematic testing across 100-500+ testers
- Aggregate findings across diverse perspectives
- Identify patterns technical testing misses
Deliverable: Comprehensive vulnerability report with prioritized findings and remediation roadmap
Who this is for: AI developers, high-risk deployments (medical, financial, educational), organizations requiring regulatory compliance
Pricing model: Project-based (3-6 months)
Internal Testing Capacity Building
Train your team to test AI safety.
What we do:
- Train your staff to conduct ongoing AI safety testing
- Develop custom test suites for your specific use case
- Establish testing processes and documentation standards
- Provide templates and tools for sustainable testing
Deliverable: Trained internal team + custom test suite + process documentation
Who this is for: Organizations deploying multiple AI systems, institutions wanting internal expertise, compliance teams
Pricing model: Training program + ongoing support retainer
AI Safety Standards Development
Build safety standards for your industry or region.
What we do:
- Develop industry-specific safety testing standards
- Create procurement frameworks for AI evaluation
- Design multilingual safety benchmarks
- Collaborate with regulators and policymakers
Deliverable: Published standards, testing frameworks, implementation guides
Who this is for: Industry associations, government agencies, regulatory bodies, large organizations setting internal standards
Pricing model: Consulting engagement (6-12 months)