Skip to content

What is archAIc

archAIc is a probabilistic search and intelligence layer for handwritten archival documents.

The product takes scanned manuscripts, produces a multi-hypothesis transcription rather than a single best guess (this is the Probabilistic Indexing, or PrIx, foundation), and exposes the result as a probabilistically searchable index plus a graph of structured entities, events, and relationships anchored to specific lines on specific pages.

The pieces:

  • PrIx generation. HTR over degraded handwriting plus a probabilistic indexing pipeline that preserves alternative readings with their confidence scores, instead of collapsing to a single best guess.
  • The search engine. Probabilistic keyword search, semantic search, and a structured-document layer (Document Logical Units) all built on top of the same PrIx data.
  • The interface surfaces. A web application, an MCP server for AI agents, and IIIF endpoints for integration with existing archival infrastructure.

archAIc is sold and run self-hosted today, packaged as a Docker bundle for the client’s own servers.

This is a placeholder concept page; more detailed concept documentation lands as the site grows.