Firecrawl is one example of a broader class. Anything that does structured
output, structured input, memory capture, or document interpretation can
consume a Penumbra shape — the Workbench’s
shape_workbench_export_schema
exports a shape as a provider-ready schema for Firecrawl, OpenAI, Anthropic,
or Gemini. The crawl below runs through Penumbra’s own Firecrawl integration,
so you do not need a Firecrawl key for it; to use Firecrawl in your own
pipelines, get one at firecrawl.dev.Crawl a site into a proposed shape
| Field | Description |
|---|---|
source.type | "web" to crawl a URL, or "file" to read a document. |
source.url | The site to crawl (web source). Penumbra crawls up to 100 pages. |
name | An optional name for the proposed shape. |
extractionPrompt | Optional guidance for what to focus on while extracting. |
From a file instead
The same endpoint accepts a document instead of a URL:Refine and materialize
A proposal is a starting point, not a finished shape. Open it in the Shapes Workbench to adjust types, properties, and relationships, compile and preview it, then materialize it so the runtime can use it.This endpoint authenticates with your API key and consumes credits (a crawl
spans many pages). It returns a draft shape; it does not write to your graph.