How to Build AI Data Pipelines with Cloudflare's New /crawl API (Step-by-Step Guide)

curl -X POST \ "https://api.cloudflare.com/client/v4/accounts/{account_id}/browser-rendering/crawl" \ -H "Authorization: Bearer {api_token}" \ -H "Content-Type: application/json" \ -d '{ "url": "https://docs.example.com", "limit": 100, "formats": ["markdown", "html"], "render": true }'

Parameter	What Happens	Best For	Cost
`render: true`	Full headless Chrome execution	SPAs (React, Vue, Angular), JS-rendered content	Browser time billed
`render: false`	Simple HTTP fetch, no JS	Static sites, docs, blogs, server-rendered HTML	Free during beta

Parameter

What Happens

Best For

Cost

render: true

Full headless Chrome execution

SPAs (React, Vue, Angular), JS-rendered content

Browser time billed

render: false

Simple HTTP fetch, no JS

Static sites, docs, blogs, server-rendered HTML

Free during beta

{ "url": "https://shop.example.com", "formats": ["json"], "jsonOptions": { "prompt": "Extract the product name, price, and availability", "response_format": { "type": "json_schema", "json_schema": { "name": "product", "schema": { "type": "object", "properties": { "name": { "type": "string" }, "price": { "type": "number" }, "in_stock": { "type": "boolean" } } } } } } }

Tool	Pricing Model	Cost for 10,000 pages/month	Notes
Cloudflare /crawl (render: true)	Time-based ($5/mo + $2/hr overage)	$5–$12	Depends on page complexity
Cloudflare /crawl (render: false)	Free during beta	$0	Static sites only
Firecrawl Standard	Per-page ($47/mo for 100K)	$47	Fixed monthly commitment
Firecrawl Growth	Per-page ($97/mo for 500K)	$97	Better per-page rate
Crawl4AI	Self-hosted (infrastructure costs)	$20–$100+	Depends on your hosting
Jina Reader	Per-page (free tier + paid)	$0–$20	Single-page only, no multi-page

Tool

Pricing Model

Cost for 10,000 pages/month

Notes

Cloudflare /crawl (render: true)

Time-based ($5/mo + $2/hr overage)

$5–$12

Depends on page complexity

Cloudflare /crawl (render: false)

Free during beta

Static sites only

Firecrawl Standard

Per-page ($47/mo for 100K)

$47

Fixed monthly commitment

Firecrawl Growth

Per-page ($97/mo for 500K)

$97

Better per-page rate

Crawl4AI

Self-hosted (infrastructure costs)

$20–$100+

Depends on your hosting

Jina Reader

Per-page (free tier + paid)

$0–$20

Single-page only, no multi-page

curl -X POST \ "https://api.cloudflare.com/client/v4/accounts/{account_id}/browser-rendering/crawl" \ -H "Authorization: Bearer {api_token}" \ -d '{ "url": "https://docs.yourproduct.com", "limit": 100, "formats": ["markdown"], "render": false, "includePatterns": ["/docs/**"] }'

Cloudflare /crawl API FAQ: Common Questions Answered

How much does Cloudflare /crawl cost?+

The free plan allows 5 jobs per day with 100 pages each. The paid plan costs $5/month and includes 10 hours of browser rendering time. The render:false mode (no JavaScript execution) is free during the beta period. Additional browser time costs $2.00 per hour.

Can Cloudflare /crawl handle JavaScript-rendered websites?+

Yes. Set render:true to launch a full headless Chrome instance that executes JavaScript before extracting content. This handles React, Vue, Angular, and other SPA frameworks. Set render:false for static sites to save cost.

How does Cloudflare /crawl compare to Firecrawl?+

Cloudflare /crawl uses time-based billing (roughly $5-12/month for 10,000 pages) while Firecrawl charges per page ($47/month for 100,000 pages). Cloudflare is cheaper at scale but has less polished SDKs and documentation. Firecrawl offers better developer experience and built-in AI extraction features.

Does Cloudflare /crawl respect robots.txt?+

Yes. The tool identifies itself as a bot and respects robots.txt by default. It does not bypass CAPTCHAs, Bot Fight Mode, or other anti-bot protections. If a site blocks bots, the extraction will fail.

What output formats does Cloudflare /crawl support?+

The API returns content in HTML, Markdown, and structured JSON. The JSON format supports AI-powered extraction using prompts or JSON schemas, allowing you to extract specific data fields like product names, prices, and descriptions without writing custom parsers.

Can I use Cloudflare /crawl for building RAG pipelines?+

Yes, this is one of the primary use cases. Extract documentation or knowledge base content in Markdown format, chunk it by heading structure, vectorize the chunks, and store them in a vector database. Use the modifiedSince parameter for efficient incremental updates.

What are the page limits for Cloudflare /crawl?+

Both free and paid plans cap individual jobs at 100 pages. For larger sites, you need to run multiple jobs with different starting URLs or use includePatterns to target specific sections of a site.

Is Cloudflare /crawl stable enough for production use?+

The API is currently in open beta. Some developers have reported intermittent "Crawl job not found" errors. For production pipelines, implement retry logic, error handling, and fallback mechanisms. The render:false mode appears more stable than render:true.

How to Build AI Data Pipelines with Cloudflare's New /crawl API (Step-by-Step Guide)

Soizic

Web Data Extraction Just Got a $5/Month Upgrade

How Cloudflare /crawl Works: The Two-Step Process

Step 1: Start the Job

Step 2: Fetch Results

URL Discovery

Key Parameters That Shape Your Pipeline

render: true vs render: false

Filtering with includePatterns and excludePatterns

maxAge and modifiedSince for Incremental Pipelines

Structured JSON Extraction with AI

Pricing Breakdown: Why /crawl Changes the Economics

Free Plan (Workers Free)

Paid Plan (Workers Paid $5/month)

Cost-per-Page Comparison

Building a RAG Pipeline with Cloudflare /crawl

Step 1: Ingest Documentation in Markdown

Step 2: Chunk and Vectorize

Step 3: Store in a Vector Database

Step 4: Query with Context

Keeping the Pipeline Fresh

Automated Competitive Intelligence Pipeline

The Workflow

Cloudflare /crawl vs Firecrawl vs Crawl4AI: When to Use Each

Choose Cloudflare /crawl When:

Choose Firecrawl When:

Choose Crawl4AI When:

Choose Jina Reader When:

The Ethics Debate: Cloudflare Selling the Lock and the Lockpick

Current Limitations to Know Before Building

No Image Extraction

No Bot Protection Bypass

Open Beta Stability

Page Limits

Getting Started: From Zero to Working Pipeline in 15 Minutes

Cloudflare /crawl API FAQ: Common Questions Answered

Ready to get started?