Firecrawl Web Loader is a powerful web crawling service that transforms websites into LLM-ready content through intelligent crawling and content extraction.Documentation Index
Fetch the complete documentation index at: https://docs.agentstack.sh/llms.txt
Use this file to discover all available pages before exploring further.
Description
Firecrawl Web Loader is an advanced web crawling and content extraction service that converts entire websites into LLM-ready formats with:- Comprehensive Crawling that automatically discovers and processes all accessible subpages without requiring sitemaps
- Intelligent Content Processing converting complex web content into clean markdown and structured data
- Media Handling capabilities for processing PDFs, DOCX files, and images
Core Features:
- Web Scraping - Extract content from single URLs in LLM-ready formats
- Site Crawling - Process entire websites including all discoverable subpages
- URL Mapping - High-performance discovery of all website URLs
- Action Automation - Perform clicks, scrolls, form inputs and more
Output Formats:
- Clean markdown text
- Structured data via LLM Extract
- Page screenshots
- Raw HTML
- URL maps and metadata
Installation
Usage
Firecrawl can be configured for different behaviors by modifyingsrc/tools/firecrawl_tool.py