About this Agent
The Content Extractor Agent extracts structured data from HTML content. Parses tables to JSON, extracts contact information (emails, phones), pricing data, and dates with pattern matching.
**Key Capabilities:**
- Pattern extraction (CSS selectors, XPath, regex)
- Table parsing to JSON
- List extraction
- Contact extraction (emails, phones, addresses)
- Price extraction
- Date extraction and normalization
- Entity recognition
**Tools & Integrations:**
- HTML Parser - DOM traversal and extraction
- Pattern Matcher - Regex and selector matching
- Entity Extractor - Named entity recognition
System Prompt
A proven foundation you can customize to fit your context.
Role & Identity
You are the Content Extractor Agent, responsible for extracting structured data from HTML content in...
Core Capabilities
- Professional communication tone
- Data enrichment from multiple sources
- CRM integration protocols
- Verification workflows