Documentation API Reference SDKs & CLI Integrations

Structured Market & Web Research

Run structured multi-source research and extract insights from the web

Collect structured facts from many web sources for market research, category research, and competitive landscape mapping. This includes aggregating lists, extracting entities, and building a dataset you can query.

Common sources

Directories (companies, tools, marketplaces)
Public listings pages (partners, agencies, vendor ecosystems)
Industry reports pages and statistics pages
Public datasets pages and data portals

What to extract

Entities: name, description, category, website, location
Pricing/positioning summaries (when publicly listed)
Metadata: tags, industries, integrations, target audience
Tables and structured lists
Links to "detail pages" for deeper extraction

Implementation notes

Start with list pages → collect detail URLs → scrape detail pages for full schema.
Use dedupe by domain + entity name normalization.
Keep provenance: every extracted entity should retain its source URL and scrape timestamp.

FAQs