What is a schema?

Every workflow at Kadoa needs a schema. It tells Kadoa:

  • what entity to look for in source data (such as products, jobs, news)
  • what fields to extract for the selected entity (such as title, price, description)

What if you need the same schema for many workflows? The answer is templates.

How to create templates

Kadoa offers two ways to create and manage templates:

1. Create a template from scratch

  • Create a new workflow using Auto-detect
  • Configure the workflow schema
  • Go to My Templates tab on your dashboard and click ‘Add New Template.’ You can now reuse a template from an existing workflow. Note: This option is currently not available for workflows that already use a pre-defined template.
  • Give your template a name and description
  • That’s it! You can now create workflows with this template, and the data will be automatically extracted and transformed into your desired format, regardless of the source.
  • Note: you can also do this via the Dashboard or workflow plage.

2. Customize a pre-defined template

  • Navigate to My Templates page on your dashboard
  • Click “Add new template”
  • Choose a pre-defined template as a starting point, based on your use case (e.g., products, jobs, news)
  • Customize the template by adding, removing, or modifying fields
  • Save your customized template

Using your templates

When you create a new workflow, you can choose to use one of your templates.

Ready-made templates

Kadoa provides ready-made templates for popular web sources. These templates come with a set of standard fields that Kadoa will extract, but you can still customize them based on your specific needs.

There are also generic templates, not limited to any specific source:

More to come soon!

Job postings

We offer a dedicated API endpoint optimized for fetching structured job posting data. When you create a new workflow using this template, Kadoa will extract the following common fields:

NameDescriptionExample
descriptionJob description including roles, responsibilities, and company overviewResponsible for developing and maintaining web applications…
jobTitleThe title of the job being postedSenior Software Engineer
datePostedThe date when the job was posted, formatted as YYYY:MM:DD:hh:mm:ss2023:11:20:10:45:00
applyUrlURL where applicants can apply for the jobhttps://example.com/apply
urlDirect URL to the job postinghttps://example.com/job-posting
jobLocationStructured location data of the job including city, country code, and postal code{"@type": "Place", "address": {"@type": "PostalAddress", "addressLocality": "San Francisco", "addressCountry": "US", "postalCode": "94103"}}
baseSalarySalary range for the job, including currency and timeframe{"@type": "MonetaryAmount", "currency": "USD", "value": {"minValue": 100000, "maxValue": 120000, "unitText": "ANNUALLY", "@type": "QuantitativeValue"}}
workHoursTypical working hours for the job9am to 5pm, Monday to Friday
jobBenefitsArray of benefits offered by the company[“Health insurance”, “Retirement plan”]
qualificationsArray of required qualifications other than work experience[“Bachelor’s degree in Computer Science”, “Strong problem-solving skills”]
experienceRequirementsArray of work experience requirements[“At least 5 years of experience in software development”]
recruiterEmailEmail address of the recruiter or hiring managerrecruiter@example.com
occupationalCategoryThe job category as per a predefined classificationSoftware and Web Developers
applicationDeadlineThe deadline for job application submissions, formatted as YYYY:MM:DD:hh:mm:ss2024:01:15:23:59:59
languageThe primary language of the job postingEnglish
logoURL to the company’s logohttps://example.com/logo.png
employmentTypeType of employment offered in the job posting (e.g., FULL_TIME, PART_TIME)FULL_TIME
idUnique identifier for the job postingabc123xyz

Ecommerce

When you create a new workflow using this template, Kadoa will extract the following common fields:

NameDescriptionExample
linkURL link to the product pagehttps://example.com/product/samsung-galaxy-s23
nameName of the productSamsung Galaxy S23 256GB
brandBrand name of the productSamsung
pricePrice of the product999.99
priceCurrencyCurrency code for the product priceUSD
specsTechnical specifications of the product{"Processor": "Snapdragon 8 Gen 1", "RAM": "8GB"}
descriptionFull product descriptionThe latest Samsung Galaxy S23 comes with a Snapdragon 8 Gen 1 processor, 8GB RAM, and a 256GB storage.
skuStock Keeping Unit identifier for the productSGS23-256GB-BLK
gtinGlobal Trade Item Number for the product00012345678905
eanEuropean Article Number for the product1234567890123
imagesArray of image URLs of the product[“https://example.com/images/product/samsung-galaxy-s23-front.jpg”, “https://example.com/images/product/samsung-galaxy-s23-back.jpg”]
reviewsNumber of reviews for the product102

News article

When you create a new workflow using this template, Kadoa will extract the following common fields:

NameDescriptionExample
titleThe title of the news articleReading news improves cognitive function
fullTextFull text of the articleThis is a full article about something interesting - hopefully
authorAuthor of the job postingJohn Doe
datePostedThe date when the news article was posted2023:11:20:10:45:00
readTimeExpected reading time for this article24min
languageThe primary language of the articleEN
dateScrapedThe date when the news article was extracted2023:11:20:10:45:00
urlDirect URL to the news articlehttps://example.com/news
scraperNameName of the scraper from the dashboardNews Site Name
idUnique identifier for the news postingabc123xyz