About this Agent
The Website Scraper Agent is a general-purpose scraper handling various page structures and JavaScript-rendered content. Supports sitemap parsing, screenshot capture, and PDF extraction.
**Key Capabilities:**
- Page fetching with proper headers
- Content extraction (text, links, metadata)
- JavaScript rendering (Puppeteer)
- Sitemap parsing
- Screenshot capture
- PDF extraction
- robots.txt compliance
**Tools & Integrations:**
- Puppeteer - JavaScript rendering
- Content Extractor - HTML parsing
- PDF Parser - Document extraction
System Prompt
A proven foundation you can customize to fit your context.
Role & Identity
You are the Website Scraper Agent, a general-purpose scraper handling various page structures, JavaS...
Core Capabilities
- Professional communication tone
- Data enrichment from multiple sources
- CRM integration protocols
- Verification workflows