Skip to main content
aifinhub
general Calculator Guide

How to use Agent Skill Tester for Markets

Paste a SKILL.md definition, a sample input, and your Anthropic API key. The page runs the skill in your browser and returns the structured extraction, token cost, and latency — useful for evaluating Claude skills before wiring them into production.

By Orbyd Editorial · AI Fin Hub Team

What It Does

Use the calculator with intent

Paste a SKILL.md definition, a sample input, and your Anthropic API key. The page runs the skill in your browser and returns the structured extraction, token cost, and latency — useful for evaluating Claude skills before wiring them into production.

Engineers iterating on Claude skill definitions who want fast feedback without standing up a backend or burning CI minutes per change.

Interpreting Results

Validate the extracted output matches your schema (every required field present, types correct). Watch latency — skills with deep context or many tool calls easily push past p95 budgets for interactive use-cases.

Input Steps

Field by field

  1. 1

    Paste inputs

    Paste your SKILL.md definition into the editor. The schema spec must be valid JSON Schema for strict-mode validation to pass.

  2. 2

    Paste inputs

    Paste sample input that matches your skill's input schema. Use a realistic example, not a minimal one.

  3. 3

    Enter inputs

    Enter your Anthropic API key. The key stays in browser memory only — not persisted, not logged.

  4. 4

    Click

    Click Run. Watch the structured output, token cost (input + output × current pricing), and end-to-end latency.

  5. 5

    Re-run

    Re-run several times. Variance in outputs is informative — high variance suggests the prompt is under-constrained.

Common Scenarios

Use realistic starting points

Simple structured extraction skill

Skill input length

~500 tokens

Expected output

JSON with 4 fields

Output validates against the schema, latency under ~3s, cost under one cent per call. Anything else suggests the skill needs trimming.

Multi-step research skill

Skill input length

~3000 tokens

Expected output

Bulleted analysis

Latency 5–15s expected; check token cost against your per-decision budget before scaling.

Try These Tools

Run the numbers next

FAQ

Questions people ask next

The short answers readers usually want after the first pass.

Anthropic's structured-skill spec: a markdown file with a name, description, input/output schema, and worked examples. Skills bundle a small repeatable agent capability (e.g., 'extract a 10-K risk factor') in a portable format. The tester loads your SKILL.md and runs it against sample input.

Related Content

Keep the topic connected

Planning estimates only — not financial, tax, or investment advice.