Secure Scraping
Give your AI agents safe internet access
The only secure gateway designed for autonomous web scraping. Prevent indirect prompt injection, block malicious sites, and control costs.
6
Threat types detected
<Â 10ms
Cached response time
50%+
Cost savings via caching
Three steps to safe web access.
Agent requests URL
Your agent sends a URL to the Bastio API instead of fetching it directly.
Sandboxed security scan
We render the page in an isolated browser, scanning for injections, malware, and PII.
Clean content returned
You receive safe, clean Markdown or JSON, stripped of all threats.
Six types of web content threats.
| Threat | Example | Action |
|---|---|---|
| Prompt Injection | Hidden instructions in page content | Block |
| Credential Theft | API keys, tokens in scraped data | Redact |
| Malicious URLs | C2 servers, phishing domains | Block |
| Code Injection | Malicious script blocks | Sanitize |
| Data Exfiltration | process.env leaks | Block |
| Fake Documentation | Poisoned API docs | Warn |
Three configurable response modes.
| Behavior | Description | Use Case |
|---|---|---|
| block | Return error, no content | Maximum security |
| sanitize | Redact threats, return safe content | Default |
| warn | Return full content with threat warnings | Monitoring |
What's included
Complete protection for web scraping agents
From threat detection to cost control, everything you need to give your agents safe internet access.
Firecrawl SDK (TypeScript)
Drop-in replacement with security scanning
import Firecrawl from '@mendable/firecrawl-js';
const app = new Firecrawl({
apiKey: process.env.BASTIO_API_KEY,
apiUrl: "https://api.bastio.com/v1/firecrawl"
});
// Use exactly as you normally would
const result = await app.scrape(
'https://example.com',
{
formats: ['markdown', 'html'],
onlyMainContent: true
}
);cURL Example
POST /v1/guard/{proxyID}/scrape
curl -X POST \
"https://api.bastio.com/v1/guard/{proxyID}/scrape" \
-H "Authorization: Bearer bastio_sk_..." \
-H "Content-Type: application/json" \
-d '{
"url": "https://example.com",
"formats": ["markdown"]
}'Safe Browsing
Block access to known malicious domains, C2 servers, and phishing sites. Maintain allow-lists for strict control.
Credential Protection
Detect and redact API keys, PII, and sensitive data from scraped content before it reaches your agent.
Cost Control
Intelligent caching reduces scraping costs by up to 50%. Set strict budget limits and rate limits per agent.
Secure your agents today
Don't let your AI agents become a security liability. Start protecting your infrastructure now.