Skip to main content
Stoneturner sits between your external data sources and your agents. It pulls raw data in, distills it into structured markdown, vectorizes it, and serves it back over MCP.

The end-to-end flow

1

Connect an integration

Provide API credentials via the web UI or API. Each integration declares how it authenticates — API key, basic token, or OAuth.
2

Sync

Stoneturner fetches data, parses it into markdown artifacts, and indexes them into vector tables.
3

Search

Agents query the MCP server using semantic_search and other tools to find relevant context.

The sync pipeline

Every integration follows the same three-stage pipeline:
sync-data (parallel fetches) → parse (LLM-extracted insights) → index-vector (embed + upsert)
  • sync-data — fetches raw data from the integration’s API, often via several parallel calls.
  • parse — uses an LLM to extract insights from each artifact: key points, questions answered, and entities, alongside the full markdown content.
  • index-vector — embeds the content and upserts it into the vector tables.
Each step writes syncTask rows so progress is observable from the web UI.
All network and LLM calls are wrapped in retry logic with quadratic backoff. Syncs are fire-and-forget from the HTTP handler — the request returns immediately and the pipeline runs in the background.

What gets indexed

The parse step produces three vectorized representations of each artifact, so semantic search can match on any of them:
EmbeddingWhat it captures
contentEmbeddingThe full markdown content of the artifact
keyPointsEmbeddingLLM-extracted key points
questionsAnsweredEmbeddingQuestions the artifact answers
Artifacts themselves are stored in mdArtifacts with their content, key points, questions, entities, and metadata — retrievable in full via get_md_artifact_by_id.

Serving agents over MCP

The MCP server is exposed as Streamable HTTP at /mcp. Agents call standard MCP tools to search across all indexed content, retrieve raw artifacts, run read-only SQL, list integrations, and trigger syncs.
For the full tool list and parameters, see the Overview.