Messy scraped URLs → usable company rows

Turn messy scraped business URLs into
usable company rows

Paste URLs from Google Maps, Outscraper, Apify, directories, or CSVs. SiteEnrich cleans and classifies each business URL, checks if the site is usable, and returns usable / review / skip with reasons before rows hit Clay, Apollo, or outreach.

"Common pattern from enrichment workflows: bad scraped inputs often get blamed on Clay, Apollo, or the enrichment step — but the real issue is garbage going in."

$ curl api.siteenrich.io/analyze?url=messy-business-url.com \
-H "X-API-Key: your-key"
✓ 200 OK — 387ms
Or try it live below — no signup needed
Live Demo

Test a scraped business URL

Paste a URL from a scraped lead list and see how SiteEnrich cleans, classifies, and routes it before enrichment or outreach.

Workflow-ready decision

Know which rows should move forward.

Every response returns a cleaned URL, source type, resolver flag, website signals when available, and a usable / review / skip decision with reasons. Bad URLs do not break your workflow — they get routed.

response.json200 OK — 387ms
{
"inputUrl": "https://sample-business.test/%3Futm_source%3Dgoogle%26utm_medium%3Dorganic",
"cleanedUrl": "https://sample-business.test",
"sourceType": "tracking_url",
"needsResolver": false,
"scoringProfile": "local_service",
"domain": "sample-business.test",
"companyName": "Sample Business",
"emails": [
"info@sample-business.test"
],
"socials": [
],
"signals": {
"hasContactPage": true,
"hasAboutPage": true,
"hasPhone": true,
"hasQuoteCta": true,
"hasServiceKeywords": true,
},
"preEnrichment": {
"status": "usable",
"score": 90,
"reasons": [
"Tracking URL cleaned",
"Real business website detected",
"Contact page found",
"Phone number found",
"Quote/contact CTA found"
],
"warnings": [
],
},
"error": null,
"errorMessage": null,
}
sourceType

Classifies the scraped URL as a real domain, tracking URL, hosted subdomain, directory/profile URL, dead site, or other source type.

needsResolver

Flags rows that should not go directly into enrichment because the real business website may need to be resolved first.

preEnrichment.status

A workflow-ready decision: usable, review, or skip before the row hits Clay, Apollo, Prospeo, CRM import, or outreach.

preEnrichment.reasons

Human-readable reasons explaining why the row was marked usable, review, or skip.

preEnrichment.warnings

Warnings for blocked sites, dead sites, thin signals, missing emails, directory URLs, or other issues.

signals

When the site is reachable, SiteEnrich can return contact/about pages, phone/email/social signals, quote CTAs, and other local-business indicators.

Safe for automation workflows
directory_profile
Needs resolver
dead_or_unreachable
Bad site
hosted_subdomain
Review row
tracking_url
Cleanable URL

Expected messy inputs return structured fields, so n8n, Make, Zapier, Google Sheets, or custom workflows can route rows without stopping the run.

How it works

From messy URL to usable row.

01

Paste scraped URLs

Send URLs from Google Maps, Outscraper, Apify, directories, CSVs, or n8n workflows. SiteEnrich is built for messy business URLs, not just perfect domains.

GET /analyze?url=business.localsearch.com
02

We clean and classify

SiteEnrich cleans tracking URLs, detects hosted subdomains, flags directory/profile URLs, checks if the site is live, and looks for basic business signals.

sourceType: directory_profile
03

Route the row

Get usable / review / skip with reasons. Send good rows to Clay, Apollo, Prospeo, CRM import, or outreach. Hold back weak rows before they waste credits.

status: review

Fits into

n8nn8n
ZapierZapier
MakeMake
PipedreamPipedream
REST API
Pricing

Beta pricing for scraped URL QA

Start with a free 20-URL test. No credit card needed.

Built for messy scraped business URLs before enrichment or outreach.

Free Test
$0
20 scraped URLs

For testing SiteEnrich on a real scraped list sample.

  • Usable / review / skip
  • Cleaned URL
  • Source type classification
  • Needs resolver flag
  • Basic website signals
  • Reasons and warnings
Run 20 sample URLs

No credit card needed

Best first paid test
Starter QA Run
$49one-time
Up to 1,000 URLs

For cleaning and checking one scraped CSV or Google Sheet.

  • Everything in Free Test
  • CSV or Google Sheet output
  • Directory/profile URL detection
  • Dead or unreachable site detection
  • Hosted subdomain detection
  • Basic email/social/signal extraction
Run full CSV for $49

One-time CSV run

Operator
$199/month
Up to 10,000 URLs/month

For recurring lead-list QA before enrichment or outreach.

  • Everything in Starter QA Run
  • API access
  • n8n / Google Sheets setup help
  • Workflow-ready JSON
  • Priority beta support
  • Early access to batch processing
Get beta access

Beta access by request

Common questions
What does SiteEnrich do?
SiteEnrich turns messy scraped business URLs into usable company rows before enrichment or outreach. It cleans and classifies each URL, checks whether the site is usable, and returns usable / review / skip with reasons.
Is this just URL cleanup?
No. Basic URL cleanup removes tracking parameters. SiteEnrich also detects directory/profile URLs, hosted subdomains, dead or unreachable sites, weak sites, and useful website signals so your workflow can decide what should continue downstream.
Do I need an API to use it?
Not at first. For beta users, we can return a CSV or Google Sheet. The API is available when you want to plug SiteEnrich into n8n, Google Sheets, Make, Zapier, or your own workflow.
What happens in the free test?
You get access to run 20 scraped business URLs and see the output: cleaned URL, source type, needs resolver flag, usable / review / skip, signals, reasons, and warnings.
Who is this for?
SiteEnrich is for people working with scraped local or SMB lead lists from Google Maps, Outscraper, Apify, Leadswift, directories, CSVs, or n8n workflows before sending rows into Clay, Apollo, Prospeo, CRM import, or outreach.
Does it work on all websites?
No. Some sites are blocked, dead, slow, or protected. Those failures return structured statuses and warnings so your workflow can route them instead of breaking.
Beta Access

Run 20 scraped URLs for free

Send a small sample from a real lead list. SiteEnrich will return usable / review / skip with cleaned URLs, source types, signals, reasons, and warnings before rows hit enrichment or outreach.

No credit card. Includes API access for testing 20 scraped business URLs.