Data Schemas
What is a data schema?
Every workflow at Kadoa needs a data schema. It tells Kadoa:
- what entity to look for in source data (such as products, jobs, news)
- what fields to extract for the selected entity (such as title, price, description)
How do I manage my data schemas?
Kadoa offers two ways to create and manage data schemas:
1. Create a custom schema from scratch
- Create a new custom workflow
- Configure the workflow to extract the desired data
- Click the ”…” menu of the workflow on the dashboard and select “Save as schema”. Note: This option is currently not available for workflows that already use a pre-defined schema.
- Give your schema a name and description
- Your custom schema will be available on the data schemas page
2. Customize a pre-defined schema
- Navigate to the data schemas page on your dashboard
- Click “Add new data schema”
- Choose a pre-defined schema as a starting point, based on your use case (e.g., products, jobs, news)
- Customize the schema by adding, removing, or modifying fields
- Save your customized schema
- Create a new workflow and select your customized schema
Using your schemas
When you create a new workflow, you can choose to use one of your custom or customized schemas.
Pre-defined data schemas
Kadoa offers pre-defined schemas for common use cases. These schemas come with a set of standard fields that Kadoa will extract, but you can still customize them based on your specific needs.
Currently available data schemas:
more to come soon!
Job postings schema
We offer a dedicated API endpoint optimized for fetching structured job posting data. When you create a new workflow using this schema, Kadoa will extract the following common fields:
Ecommerce schema
When you create a new workflow using this schema, Kadoa will extract the following common fields:
Name | Description | Example |
---|---|---|
link | URL link to the product page | https://example.com/product/samsung-galaxy-s23 |
name | Name of the product | Samsung Galaxy S23 256GB |
brand | Brand name of the product | Samsung |
price | Price of the product | 999.99 |
priceCurrency | Currency code for the product price | USD |
specs | Technical specifications of the product | {"Processor": "Snapdragon 8 Gen 1", "RAM": "8GB"} |
description | Full product description | The latest Samsung Galaxy S23 comes with a Snapdragon 8 Gen 1 processor, 8GB RAM, and a 256GB storage. |
sku | Stock Keeping Unit identifier for the product | SGS23-256GB-BLK |
gtin | Global Trade Item Number for the product | 00012345678905 |
ean | European Article Number for the product | 1234567890123 |
images | Array of image URLs of the product | [”https://example.com/images/product/samsung-galaxy-s23-front.jpg”, ”https://example.com/images/product/samsung-galaxy-s23-back.jpg”\] |
reviews | Number of reviews for the product | 102 |
News article schema
When you create a new workflow using this schema, Kadoa will extract the following common fields: