Run any Skill in Manus with one click

$pwd:

testing-assistant-conversations

Name: Testing Assistant Conversations
Author: dxos

// Test assistant conversations, agents, and blueprints using AssistantTestLayer, Effect/vitest, ECHO types, and memoized LLM fixtures. Use when writing or fixing assistant-toolkit tests, blueprint.operation tests, AiSession flows, or when CI fails on missing memoized conversations.

Run Skill in Manus

$ git log --oneline --stat

stars:506

forks:44

updated:May 29, 2026 at 23:39

SKILL.md

readonly

related-skills.json

same repository

composer-plugins.md

from "dxos/dxos"

Use when working on files in packages/plugins/, adding new plugins, refactoring plugin components/containers, writing storybooks for plugins, or wiring capabilities like react-surface or operation-resolver.

2026-05-30506

land.md

from "dxos/dxos"

Land an existing PR — finds it, fixes CI failures iteratively, keeps the branch up to date with main, subscribes to PR events for continuous autofixing, and adds to merge queue. Use when the user says "/land <PR number or URL>" or asks to land/ship an existing PR. Accepts optional extra instructions after the PR reference.

2026-05-30506

composer-plugin-dev.md

from "dxos/dxos"

Author DXOS Composer plugins — primarily community plugins built in their own repo (Vite + composerPlugin, GitHub release, registered via dxos/community-plugins), with notes on how the in-repo workflow differs. Use when scaffolding a new Composer plugin, wiring capabilities (surfaces, operations, blueprints), exposing operations to AI agents, integrating external services, testing with the composer testing harness, or publishing to the community registry.

2026-05-28506

composite-components.md

from "dxos/dxos"

Use when authoring or refactoring Radix-style composite React components in `@dxos/react-ui` and sibling UI packages — namespaced primitives like `Foo.Root` / `Foo.Trigger` / `Foo.Content` built around `forwardRef`, `Slot`, and a `tx()` theme function.

2026-05-24506

effect.md

from "dxos/dxos"

Guides working with Effect-TS in TypeScript codebases. Use when writing Effect programs, defining services/layers, handling errors, running effects, or when code uses effect, Context, Layer, Effect.gen, or related Effect patterns.

2026-05-21506

subduction-policy.md

from "dxos/dxos"

Reference for `SubductionPolicy` (the four hooks `authorizeConnect`, `authorizeFetch`, `authorizePut`, `filterAuthorizedFetch` passed via `Subduction.hydrate(..., policy)` or `new Repo({ subductionPolicy })`). Use when designing client-side access control over Subduction-replicated data, choosing which hook to deny in, or debugging why a doc did or did not replicate.

2026-05-21506

package.json

"author": "dxos"

"repository": "dxos/dxos"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	testing-assistant-conversations
description	Test assistant conversations, agents, and blueprints using AssistantTestLayer, Effect/vitest, ECHO types, and memoized LLM fixtures. Use when writing or fixing assistant-toolkit tests, blueprint.operation tests, AiSession flows, or when CI fails on missing memoized conversations.

Testing assistant conversations, agents, and blueprints

This guide matches patterns in packages/core/assistant-toolkit and related packages (assistant, plugin-markdown, plugin-assistant). For regenerating *.conversations.json only, prefer the focused skill regenerate-memoized-llm.

AssistantTestLayer

Import from @dxos/assistant/testing.

AssistantTestLayer composes:

AI — TestAiService (memoized by default; see below), default model @anthropic/claude-opus-4-6.
Tool execution — ToolExecutionServices and OpaqueToolkit.providerLayer.
Blueprint registry — Blueprint.RegistryService seeded with optional blueprints.
Operations — operationHandlers passed to OperationHandlerSet.provide(...); ProcessManager wires Operation.Service for tool execution (see AssistantTestLayer in packages/core/assistant/src/testing/layer.ts).
ECHO test DB — TestDatabaseLayer with types you register.
Credentials — CredentialsService.configuredLayer(credentials) (often [] in tests).
Tracing — noop | console | pretty.

Use AssistantTestLayerWithTriggers when the scenario uses scheduled triggers (manual time control, in-memory trigger state). Example: packages/core/assistant-toolkit/src/blueprints/project/blueprint.test.ts.

Important options

Option	Role
`operationHandlers`	`OperationHandlerSet` (or merged sets) registered via `OperationHandlerSet.provide` so `Operation.invoke` resolves your operations.
`types`	Every ECHO entity type the test creates or queries (`Blueprint.Blueprint`, plugin types, `Message.Message`, etc.). Missing types break DB/schema expectations.
`blueprints`	Optional registry seed when code reads blueprints from `Blueprint.RegistryService` instead of only binding at runtime.
`toolkits`	Extra toolkits (e.g. `OpaqueToolkit.make(WebSearchToolkit, Layer.empty)`).
`aiServicePreset`	`'direct'` \| `'edge-local'` \| `'edge-remote'` — where real LLM calls go when generation is allowed. Use `'edge-remote'` to route LLM calls through the DXOS Edge service so no Anthropic API key is required locally.
`tracing: 'pretty'`	Useful locally to see tool traces.
`disableLlmMemoization: true`	Skips memo wrapper; use only when you fully stub `AiService` / `LanguageModel` and do not need recorded conversations.

Implementation reference: packages/core/assistant/src/testing/layer.ts.

Model memoization and `ALLOW_LLM_GENERATION`

AssistantTestLayer includes memoization internally — you do not need to set up MemoizedAiService yourself. The layer wraps the AI service with MemoizedAiService.layerTest automatically (unless disableLlmMemoization: true).

Default test AI goes through MemoizedAiService.layerTest, which:

Writes/reads <test-file>.conversations.json next to the test (path from TestContextService).
Without ALLOW_LLM_GENERATION: replays only; missing matching prompt → error telling you to regenerate.
With ALLOW_LLM_GENERATION=1 (or true): calls the real model when no match exists and updates the JSON.

CI stays deterministic because it uses committed fixtures, not live LLM calls.

Requirements for regeneration

Credentials — API keys must be in the environment. In this repo, load 1Password-injected env from the workspace root:
- fish: eval (pnpm -ws 1p-credentials)
- bash/zsh: eval "$(pnpm -ws 1p-credentials)"
The script is the 1p-credentials package script (runs op inject against .env.1password).
Run tests with generation:
```
ALLOW_LLM_GENERATION=1 moon run assistant-toolkit:test
```
Or all memoized-LLM packages: ALLOW_LLM_GENERATION=1 moon run '#memoized-llm:test'.
Commit updated *.conversations.json files.

Packages that participate are tagged memoized-llm in their moon.yml (e.g. assistant-toolkit, assistant, ai, plugin-markdown, plugin-assistant).

Timeouts

LLM conversation tests should use a longer timeout to account for generation. Pattern: { timeout: 60_000 } or MemoizedAiService.isGenerationEnabled() ? 240_000 : 30_000. Note that MemoizedAiService is only needed as an import for the timeout helper — the layer already handles memoization internally.

`TestHelpers.provideTestContext`

Effects that use memoization must end with TestHelpers.provideTestContext (from @dxos/effect/testing) so the memo layer knows the current test file path. Typical pipe:

Effect.fnUntraced(..., Effect.provide(TestLayer), TestHelpers.provideTestContext).

Using `edge-remote` to avoid local API keys

Set aiServicePreset: 'edge-remote' to route LLM calls through the DXOS Edge service instead of calling Anthropic directly. This means no local Anthropic API key is required. Works for both direct operation invocations and full conversation tests. Example: packages/core/assistant-toolkit/src/blueprints/blueprint-manager/blueprint.test.ts.

General test structure

Vitest + Effect

Use @effect/vitest (describe, it.effect, it.scoped) and Effect.fnUntraced for generator bodies.

Determinism

Many tests call EntityId.dangerouslyDisableRandomness() at module scope for stable IDs.

Database and invocation flow

yield* Database.add(...) / Obj.make(...) for fixtures.
yield* Database.flush() before invoking functions or conversations that read persisted state.
Call Operation.invoke(Operation, input) for direct operation tests, or AiSessionService.run, new AiSession, AiRequest, etc., depending on the layer under test.

Registering blueprints in tests

Two common patterns:

Registry at layer build — pass blueprints: [SomeBlueprint.make(), ...] into AssistantTestLayer when services read from the registry.
Runtime bind — addBlueprints from packages/core/assistant-toolkit/src/blueprints/testing.ts loads definition make() objects into the DB and calls AiContextService.bindContext({ blueprints: [...] }). Used with AiSessionService.layerNewFeed().pipe(Layer.provideMerge(TestLayer)) in memory blueprint tests.

You still pass the blueprint’s operations (handler set) into AssistantTestLayer({ operationHandlers: ... }) so tools actually execute.

Types list

Include every ECHO type instances may have: blueprint metadata types, domain objects (Message, Person, plugin documents), Blueprint.Blueprint, Trigger.Trigger, queues, etc. If in doubt, mirror imports from a similar test in the same blueprint folder.

Quick checklist

AssistantTestLayer (or WithTriggers) with correct operationHandlers and types.
Effect.provide(TestLayer) + TestHelpers.provideTestContext for memoized LLM tests.
New/changed prompts → regenerate with ALLOW_LLM_GENERATION=1 + 1p-credentials, commit *.conversations.json.
Package has memoized-llm tag if tests use memoization (for CI grouping).

testing-assistant-conversations

More from this repository

More from this repository

Testing assistant conversations, agents, and blueprints

AssistantTestLayer

Important options

Model memoization and ALLOW_LLM_GENERATION

Requirements for regeneration

Timeouts

TestHelpers.provideTestContext

Using edge-remote to avoid local API keys

General test structure

Vitest + Effect

Determinism

Database and invocation flow

Registering blueprints in tests

Types list

Quick checklist

Testing assistant conversations, agents, and blueprints

AssistantTestLayer

Important options

Model memoization and ALLOW_LLM_GENERATION

Requirements for regeneration

Timeouts

TestHelpers.provideTestContext

Using edge-remote to avoid local API keys

General test structure

Vitest + Effect

Determinism

Database and invocation flow

Registering blueprints in tests

Types list

Quick checklist

Model memoization and `ALLOW_LLM_GENERATION`

`TestHelpers.provideTestContext`

Using `edge-remote` to avoid local API keys

Model memoization and `ALLOW_LLM_GENERATION`

`TestHelpers.provideTestContext`

Using `edge-remote` to avoid local API keys