Secure Scraper

Protect AI agents from web-based attacks

Firecrawl-compatible web scraping with built-in security scanning. Block indirect prompt injections, cache responses, and control which URLs your agents can access.

6

Threat categories

< 10ms

Cached response

50%+

Cost savings

How It Works

Scan every scraped page before it reaches your agent.

1

Agent scrapes URL

Your agent requests web content via the Firecrawl-compatible endpoint.

2

Bastio scans content

Six threat categories analyzed in real-time before content is returned.

3

Block, sanitize, or warn

Safe content delivered instantly. Threats blocked, redacted, or flagged.

Threat Categories

Six categories of web content attacks.

ThreatExampleAction
Env Exfiltrationprocess.env.*, os.environ[]Block
Malicious Codeexec(), spawn(), system()Block
Suspicious URLsngrok, webhook.site, IP-basedBlock
Fake DocumentationURGENT: Security update...Sanitize
Prompt InjectionsIgnore previous instructionsSanitize
Jailbreak AttemptsDAN prompts, roleplay bypassBlock
Block Behaviors

Configurable responses for detected threats.

ActionBehaviorUse Case
blockReturn error, no content deliveredAutonomous agents, compliance
sanitizeRedact threats, return safe contentResearch assistants (default)
warnReturn full content with threat warningsTesting, monitoring

What's included

Security, caching, and compatibility — built in

Every scraped URL gets automatic threat scanning, intelligent caching, and domain control at no extra configuration.

6 threat categories detected
Indirect prompt injection blocking
Env variable exfiltration prevention
Configurable block behaviors
Per-proxy URL caching (24h TTL)
Firecrawl v2 API compatibility
Allow-list & block-list controls
BYOK Firecrawl key support
Malicious code block detection
Suspicious URL identification
Fake documentation detection
Cost tracking & cache analytics

Drop-in Firecrawl Replacement

Change one URL to add security scanning to your existing Firecrawl integration.

# Before (Firecrawl)
POST api.firecrawl.dev/v2/scrape

# After (Bastio)
POST api.bastio.com/v1/guard/{proxyID}/scrape

Python Example

Full Firecrawl v2 API compatibility with security response.

response = requests.post(
    f"https://api.bastio.com/v1/guard/{PROXY_ID}/scrape",
    headers={"Authorization": f"Bearer {API_KEY}"},
    json={"url": "https://example.com", "formats": ["markdown"]}
)

result = response.json()
if result["security"]["action"] == "BLOCK":
    print(f"Threat blocked: {result['security']['threats_found']}")
else:
    content = result["data"]["markdown"]

Intelligent Caching

Per-proxy 24-hour URL cache cuts costs by 50%+ while still scanning for threats on every request.

URL Control

Allow-lists and block-lists give defense-in-depth control over which domains your agents can access.

BYOK Mode

Bring your own Firecrawl API key for just $0.0005/URL security scanning fee.

Start securing your web scraping

100 free secure scrapes per month. Full Firecrawl compatibility.