Changelog

All notable changes to Attest are documented here. Versions follow Semantic Versioning.

v0.7.0 — 2026-02-27

Production Hardening — test coverage, TS CLI, documentation

Features

Go engine test coverage — Integration tests for evaluate_batch, submit_plugin_result, and shutdown JSON-RPC methods. Coverage for concurrent request handling, schema compiler cache, and trace validation.
TypeScript CLI — npx attest init scaffolds a vitest-based test project with @attest-ai/core and @attest-ai/vitest pre-configured.
TypeScript examples — Four ported examples (quickstart, openai-adapter, schema-assertions, content-assertions) in the attest-examples repo.
Documentation site — Updated changelog, configuration reference, migration guide (v0.5 to v0.6), and SDK reference pages covering all v0.5–v0.7 features.

Testing

Engine: integration tests for all JSON-RPC methods, concurrent request handling, error paths
Python SDK: ExpectChain.plugin(), aggregate_latency_under(), all_tools_called() assertion tests
TypeScript SDK: plugin system, continuous eval runner, branded types, discriminated union specs

Upgrade

# Python
uv add attest-ai@latest

# TypeScript
pnpm add @attest-ai/core@latest @attest-ai/vitest@latest

v0.6.1 — 2026-02-26

TypeScript Parity — full feature alignment with Python SDK

Features

Discriminated union specs — Step type uses a kind discriminant (llm_call | tool_call | retrieval | agent_call) for exhaustive switch handling without type guards.
Branded types — TraceId, AssertionId, AgentId newtypes prevent string mixing at compile time.
Plugin system (TS) — PluginRegistry and AttestPlugin interface for registering custom assertion plugins in TypeScript. Matches the Python attest.plugins entry point API.
Continuous eval (TS) — ContinuousEvalRunner, Sampler, and AlertDispatcher ported from Python. Supports sampling strategies and alerting via webhooks.
LangChain.js adapter — @attest-ai/core/adapters/langchain captures traces from LangChain.js agents with automatic callback instrumentation.
TraceAdapter type safety — Adapter traceFromResponse() return type is Trace (not unknown), enabling end-to-end type inference.
CJS dual output — tsup-based build produces both ESM and CommonJS bundles. package.json exports map resolves the correct format automatically.
Python TraceTree analytics — TraceTree.summary() returns aggregate metrics (total cost, tokens, latency, agent count, max depth) across the full delegation tree.

Breaking Changes

Adapter API rename — capture() is replaced by traceFromResponse() on all TypeScript adapters. The old name is removed (no deprecation shim).
CJS consumers — If you previously used a bundler workaround for ESM-only @attest-ai/core, remove it. The package now ships dual ESM/CJS.

Upgrade

pnpm add @attest-ai/core@latest @attest-ai/vitest@latest

v0.6.0 — 2026-02-25

Performance — engine-only optimizations

Features

Schema compiler cache — JSON Schema compilation results are cached per-schema hash. Eliminates recompilation on repeated evaluate_batch calls with the same schema.
Trace validation optimization — Trace validation short-circuits on first error in non-verbose mode, reducing per-evaluation overhead.
SQL query optimizations — History store queries use covering indexes for list_results and drift_query. Batch inserts use prepared statements.
Prepared statement pooling — Frequently-used SQL statements are prepared once and reused across evaluations.
Result pagination — list_results supports limit/offset for large history stores.
segmentio/encoding — JSON codec switched from encoding/json to segmentio/encoding/json for ~2x faster marshal/unmarshal on evaluation payloads.

Notes

Engine-only release. No SDK changes. Update the engine binary via ATTEST_ENGINE_PATH or let auto-download fetch the new version.

v0.5.1 — 2026-02-24

Robustness — timeouts, bounded resources, new assertions

Features

Budget tracking — BudgetTracker enforces per-evaluation cost limits. Set ATTEST_BUDGET_MAX_COST to cap total spend across judge and embedding assertions.
Concurrent request support — Engine handles multiple evaluate_batch requests concurrently with per-request isolation.
Configurable judge cache — Judge response cache size is configurable via ATTEST_JUDGE_CACHE_MAX_MB (default: 100 MB). Cache eviction uses LRU.
History retention policy — ATTEST_HISTORY_MAX_ROWS and ATTEST_HISTORY_MAX_AGE_DAYS control automatic cleanup of old evaluation results.
Engine read timeout — ATTEST_ENGINE_TIMEOUT (default: 30s) prevents SDK hangs when the engine process stalls.
Bounded continuous eval queue — ATTEST_CONTINUOUS_QUEUE_SIZE (default: 1000) caps the evaluation queue. Overflow uses backpressure instead of unbounded growth.
ExpectChain.plugin() — Chain custom plugin assertions alongside built-in ones: expect(result).output_contains("ok").plugin("my_plugin", config).
Simulation mode (TS) — ATTEST_SIMULATION=1 works in the TypeScript SDK, returning deterministic mock results without an engine process.

Fixes

Engine read timeout — SDK no longer hangs indefinitely if the engine process crashes or stalls mid-response.
History store cleanup — Unbounded row growth in SQLite history is now capped by retention policy.

Environment Variables

Variable	Purpose	Default
`ATTEST_BUDGET_MAX_COST`	Maximum USD spend per evaluation	unset (unlimited)
`ATTEST_JUDGE_CACHE_MAX_MB`	Judge response LRU cache size	`100`
`ATTEST_HISTORY_MAX_ROWS`	Maximum rows in history store	`10000`
`ATTEST_HISTORY_MAX_AGE_DAYS`	Auto-delete results older than N days	`90`
`ATTEST_ENGINE_TIMEOUT`	Engine response timeout (seconds)	`30`
`ATTEST_CONTINUOUS_QUEUE_SIZE`	Continuous eval queue capacity	`1000`

Upgrade

# Python
uv add attest-ai@latest

# TypeScript
pnpm add @attest-ai/core@latest

v0.5.0 — 2026-02-23

Correctness & Safety — adapter fixes, engine hardening

Engine Fixes

submit_plugin_result — Implement the previously-stubbed submit_plugin_result JSON-RPC method. Plugin evaluations now round-trip correctly through the engine.
Trace ID validation — Engine rejects traces with missing or malformed trace_id fields instead of silently accepting them.
Step type validation — Unknown step types (kind field) return a typed error instead of being silently dropped.
Assertion ID uniqueness — Engine enforces unique assertion IDs within a batch. Duplicate IDs return an error.
Error response codes — All engine errors use spec-compliant JSON-RPC error codes (-32600 to -32603).
Shutdown draining — shutdown waits for in-flight evaluations to complete (5s timeout) before exiting.

Adapter Fixes

OpenAI — Tool call arguments are parsed from JSON string to object. Previously, function.arguments was passed as a raw string, causing schema assertions on tool args to fail.
Ollama — Empty tool call arrays are normalized to undefined instead of []. Prevents phantom “0 tool calls” in trace summaries.
Gemini — Token count extraction reads usage_metadata.total_token_count (was missing, reported as 0).
LangChain — Callback handler implements full BaseCallbackHandler protocol including ignore_chat_model and ignore_retriever.
Anthropic — System prompt is captured as a separate step when present.

TypeScript Fixes

delegate() — parent_trace_id is now set correctly on child traces, fixing broken TraceTree traversal for multi-agent scenarios.

Testing

Adapter integration tests — Each adapter has a dedicated test suite verifying trace capture, token counting, and tool call extraction.

Upgrade

# Python
uv add attest-ai@latest

# TypeScript
pnpm add @attest-ai/core@latest

v0.4.2 — 2026-02-22

SDK patch release — adapter fixes and async compatibility

Fixes

LangChain adapter — Add missing callback protocol attributes (ignore_agent, ignore_retry, raise_error) required by LangChain’s BaseCallbackHandler interface. Handle LangGraph AIMessage and ToolMessage output formats so traces capture tool-call responses correctly.
expect() DSL — Accept Trace directly in addition to AgentResult. Auto-wraps into AgentResult for manual adapter workflows that build traces via TraceBuilder without going through a provider adapter.
Plugin fixture — Run the engine event loop in a background daemon thread with run_coroutine_threadsafe() bridge. Fixes Future attached to a different loop errors when pytest-asyncio tests (e.g., google-adk) call into the engine from a separate event loop.

Upgrade

uv add attest-ai@latest

Notes

No engine changes. The Go engine binary remains at v0.4.0. ENGINE_VERSION is unchanged; auto-download continues to fetch v0.4.0 binaries.

v0.4.1 — 2026-02-21

SDK patch release — engine auto-download

Features

Engine auto-download — Both Python and TypeScript SDKs now automatically download the attest-engine binary from GitHub Releases on first use. No manual binary setup required after uv add attest-ai or pnpm add @attest-ai/core.
SHA256 verification — Downloaded binaries are verified against checksums-sha256.txt from the release. Checksum mismatch aborts the download with a clear error.
Version-pinned cache — Binaries are cached at ~/.attest/bin/ with a .engine-version marker. SDK version mismatch triggers automatic re-download.

Discovery chain — Engine binary resolution follows a predictable order:

ATTEST_ENGINE_PATH env var
  → PATH lookup
  → ~/.attest/bin/ (shared cache, version-checked)
  → ../../bin/ (monorepo dev layout)
  → ./bin/ (local)
  → auto-download from GitHub Releases
  → actionable error message

Opt-out — Set ATTEST_ENGINE_NO_DOWNLOAD=1 to disable network access. The error message explains alternative installation methods.

Fixes

pytest plugin — pytest.skip() replaced with pytest.fail() when the engine binary is missing. With auto-download in place, silent skipping is no longer appropriate; real errors are now surfaced.
TypeScript VERSION — Corrected from 0.3.0 to 0.4.1.

Other

Examples moved — All example projects relocated to attest-framework/attest-examples for independent versioning. A redirect README remains in the main repo.

Install / Upgrade

Python

uv add attest-ai

TypeScript

pnpm add @attest-ai/core

Environment Variables

Variable	Purpose	Default
`ATTEST_ENGINE_PATH`	Absolute path to engine binary — skips all discovery	unset
`ATTEST_ENGINE_NO_DOWNLOAD`	`1` / `true` / `yes` disables auto-download	unset (enabled)

v0.4.0 — 2026-02-20

Production & Polish

Result history with SQLite storage
Drift detection (σ-based statistical thresholds)
Continuous eval runner with sampling and alerting
Plugin system (attest.plugins entry point group)
CrewAI adapter (11 adapters total)
CLI init and validate commands
MkDocs documentation site

v0.3.0 — 2026-02-20

Simulation & Multi-Agent

Layers 7-8: simulation runtime, multi-agent testing
TypeScript SDK (first npm publish: @attest-ai/core, @attest-ai/vitest)
Framework adapters: LangChain, Google ADK, LlamaIndex

v0.2.0 — 2026-02-19

Semantic & Judge Layers

Layers 5-6: ONNX local embeddings, LLM-as-judge
Soft failure support
OTel adapter
setup-attest GitHub Action

v0.1.0

Foundation

Layers 1-4: schema validation, cost/performance, trace structure, content validation
Python SDK with pytest plugin
4 provider adapters: OpenAI, Anthropic, Gemini, Ollama
PyPI + GitHub release