- A persistent Catalogue — your products, suppliers, SKUs, and the relationships between them, with full attribute schemas, history, and provenance.
- Operations that run on the Catalogue — enrichment, validation, normalization, deduplication, taxonomy generation, monitoring, and sync — each producing auditable changes with confidence scores attached.
Core concepts
Catalogue
A Catalogue is a typed object — Products, Suppliers, Line Items, Variants, etc. Each object has:- Attributes — the schema (typed fields: text, number, image, date, enum, reference).
- Records — the actual rows.
- Data Sources — the upstream feeds, files, APIs, or scrapes that populate it.
- Operations — the running and scheduled transformations against it.
- Config — access, retention, write-back rules.
Product.supplier → Supplier).
Operations
Operations are the verbs of the platform. They read records, optionally call models or external systems, and write back changes. Every operation produces a run with inputs, outputs, confidence per change, and a reversible diff. Operations can be triggered manually, scheduled, or chained into pipelines.Confidence and review
Every model-driven write is scored. Each operation has a configurable confidence threshold:- Above threshold → auto-apply
- Below threshold → queue for human review in the Notifications surface
Provenance
Every value in the Catalogue carries the source of the last write — which operation, which run, which underlying data source, which model, which reviewer. Roll-back is per-field.Taxonomy
Taxonomies are first-class objects, not free-text labels. They are hierarchical, versioned, and can be generated from your data, imported from a standard (GS1, Google, internal), or hand-edited. Taxonomy assignments are themselves tracked operations with confidence scores.Similarity & Duplicate graph
Claro maintains a similarity graph across records using a combination of embedding similarity and rule-based matching. Duplicate detection, supplier match, variant grouping, and entity resolution all read from this graph.Platform layout
The product is organized into three groups in the sidebar:What changed from the previous version
If you used Claro before, the most important changes:- Catalogues are now the center of the platform. What used to be one-off datasets generated by Research Agents now sit alongside persistent catalogues with attributes, records, data sources, operations, and config.
- Operations are first-class. Every transformation — enrichment, validation, normalization, deduplication, taxonomy assignment, sync — is an auditable run with confidence scores and a reversible diff.
- The sidebar is restructured into Core (persistent state), Modules (operational stages: Onboard, Analyse, Monitor, Distribute), and Research Agents (ad-hoc tasks).
- Continuous monitoring is built in. Price, competitor, and validation operations can run on a schedule and alert into Notifications.
- Confidence and review are explicit. Every operation has thresholds; uncertain changes queue for human approval rather than failing silently or shipping unreviewed.