Skip to content

MukundaKatta/agentcast

agentcast

npm version npm downloads CI License: MIT Node Tests runtime deps

📖 Part of the agent-stack — 5 tiny libraries to stop AI agents from misbehaving in production.

Structured output for any LLM call. Validate the model's response, retry with the validation error as feedback, return typed data or throw after N attempts. Bring your own LLM, bring your own validator (zod, valibot, JSON Schema, plain predicate). Zero runtime dependencies.

npm install @mukundakatta/agentcast
import { cast, adapters } from '@mukundakatta/agentcast';
import { z } from 'zod';
import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic();

const productSchema = z.object({
  name: z.string(),
  price: z.number(),
  in_stock: z.boolean(),
  tags: z.array(z.string()),
});

const product = await cast({
  llm: async (messages) => {
    const r = await client.messages.create({
      model: 'claude-sonnet-4-6',
      max_tokens: 1024,
      messages: messages.map((m) => ({ role: m.role, content: m.content })),
    });
    return r.content[0].text;
  },
  validate: adapters.zod(productSchema),
  prompt: 'Generate one example product.',
  maxRetries: 3,
});

// product is { name, price, in_stock, tags } — guaranteed shape, or it threw.

If the model returns prose-wrapped JSON, returns the wrong type for a field, forgets a field, or refuses to return JSON at all, cast() extracts what it can, runs your validator, and feeds the validation error back to the model on the next attempt. After maxRetries it throws CastError with the full attempt history.

TypeScript types ship in the box.

See it in action

git clone https://github.com/MukundaKatta/agentcast && cd agentcast
node examples/demo-retry.js

Stubbed "LLM" returns invalid JSON twice and clean JSON the third time. Watch the message history grow as feedback gets appended.

Why

Every agent that returns structured data hits the same problems:

  • The model wraps JSON in Sure! Here you go: prose
  • Wraps it in ```json fences
  • Returns "thirty" when you needed 30
  • Returns the wrong shape entirely
  • Refuses with "I cannot help with that"

Ad-hoc retry loops sprawl through agent codebases. cast() is the small, focused primitive that handles the standard pattern in one place: extract → validate → feedback → retry → throw.

Other libraries that touch this problem:

  • Vercel ai's generateObject — great if you're in Vercel's ecosystem; this is the standalone version for any LLM call.
  • Python's instructor — same pattern, different language. This is the JS sibling.

API

cast(opts) → Promise<T>

The retry loop.

await cast({
  llm: async (messages) => '...',           // required: your LLM call
  validate: (value) => ({ valid: true }),   // required: validation function
  prompt: 'give me data',                   // required: the user prompt
  system: 'You are precise.',               // optional: system message
  maxRetries: 2,                            // optional: total attempts = 1 + maxRetries (default 2)
  onAttempt: (info) => console.log(info),   // optional: per-failed-attempt callback
});

Behavior:

  • Calls llm(messages) and expects a string back.
  • Pulls JSON via extractJson() (handles whole-text JSON, ```json fences, and the largest balanced {...}/[...] substring in prose).
  • Calls validate(value). If { valid: true, value }, returns value (or the original if value is undefined).
  • If extraction or validation fails, appends the assistant's response and a feedback message asking it to try again, then re-calls the LLM.
  • After maxRetries failures, throws CastError with the full attempt history.

extractJson(text) → any | null

Pulls JSON out of an LLM response. Tries: whole text, ```json fence, plain ``` fence, largest balanced {...}/[...] substring. Returns the parsed value or null. Useful standalone if you want to handle the retry yourself.

adapters.zod(schema)

Bridge for any validator with a safeParse() method (zod, valibot, etc.).

import { z } from 'zod';
import { adapters } from '@mukundakatta/agentcast';

const validate = adapters.zod(z.object({ name: z.string() }));

adapters.fn(predicate, errorBuilder?)

For ad-hoc predicate validation, no schema lib needed.

const validate = adapters.fn(
  (v) => typeof v?.score === 'number' && v.score >= 0 && v.score <= 1,
  (v) => `score must be between 0 and 1, got ${v?.score}`
);

adapters.shape(spec)

Tiny built-in shape checker for when you want zero deps end-to-end.

const validate = adapters.shape({
  name: 'string',
  age: 'number',
  active: 'boolean?',  // suffix '?' for optional
  tags: 'array',
  meta: 'object',
});

Not a full JSON Schema validator — just enough to gate basic shapes. For richer constraints, use zod.

CastError

Thrown when retries are exhausted.

import { CastError } from '@mukundakatta/agentcast';

try {
  await cast(...);
} catch (err) {
  if (err instanceof CastError) {
    console.error('failed after', err.attempts.length, 'attempts');
    console.error('last error:', err.lastError);
    console.error('last text:', err.lastText);
    console.error('last parsed:', err.lastParsed);
    console.error('full history:', err.attempts);
  }
}

Recipes

With the OpenAI SDK

import OpenAI from 'openai';
const openai = new OpenAI();

await cast({
  llm: async (messages) => {
    const r = await openai.chat.completions.create({
      model: 'gpt-5',
      messages: messages.map((m) => ({ role: m.role, content: m.content })),
    });
    return r.choices[0].message.content;
  },
  validate: adapters.shape({ summary: 'string', sentiment: 'string' }),
  prompt: 'Analyze this review: ...',
});

Without any schema library (pure predicate)

const isISO8601 = (s) => typeof s === 'string' && /^\d{4}-\d{2}-\d{2}T/.test(s);

const event = await cast({
  llm: myLLM,
  prompt: 'Extract the event date as ISO 8601.',
  validate: adapters.fn(
    (v) => isISO8601(v?.date),
    (v) => `expected { date: ISO8601 string }, got ${JSON.stringify(v)}`
  ),
});

Logging every retry

await cast({
  llm,
  validate,
  prompt,
  maxRetries: 5,
  onAttempt: ({ attempt, text, error }) => {
    console.log(`[cast attempt ${attempt} failed]`, { error, text });
  },
});

CLI

@mukundakatta/agentcast ships an agentcast binary for one-shot extract/validate work — no LLM call needed, useful for testing prompts or sanity-checking captured outputs:

# Pull the JSON value out of a prose-wrapped LLM response
echo 'I think the answer is {"answer":"42"}.' | \
  npx -p @mukundakatta/agentcast agentcast extract -

# Validate a JSON value against an inline shape
echo '{"name":"alice"}' | npx -p @mukundakatta/agentcast agentcast validate - \
  --shape '{"name":"string","age":"number"}' --pretty

Output is JSON to stdout (the extracted value for extract; {valid, value|error} for validate). Exit code is 0 on success, 1 when no JSON could be extracted or the value fails validation, 2 on usage errors. Run agentcast --help for the full subcommand reference.

What this is not

  • Not a model client. You bring the LLM call. Works with any SDK, any HTTP client, any local model.
  • Not a prompt framework. It just handles the validate-and-retry loop. Compose with your existing prompt code.
  • Not a tool-call validator. This is for the model's output shape, not the tool calls it makes mid-conversation. For tool-call regressions, see @mukundakatta/agentsnap.

Sibling libraries

Part of the agent reliability stack — all @mukundakatta/* scoped, all zero-dep:

Natural pipeline: fit → guard → snap → vet → cast.

Status

v0.1.2 — tooling polish. Core API stable. TypeScript types included. 44/44 tests, CI on Node 20/22/24.

v0.2 plans (post-real-world-feedback):

  • Streaming variant (parse JSON as it streams in, fail fast on bad shape)
  • JSON-mode hint generation (auto-build the OpenAI/Anthropic JSON-mode params if user opts in)
  • Cost-aware max budget (skip retry if next attempt would exceed a $/token cap)

License

MIT

Repository Health

This repository includes a dependency-free health check for core documentation, metadata, and CI wiring. Run it locally before publishing changes:

python3 scripts/check_repository_health.py

The same check runs in GitHub Actions on pushes and pull requests.

About

Structured output for any LLM call — validate, retry with feedback, return typed data or throw. Zero deps. Sibling to agentsnap + agentguard.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors