Build with Orsa.dev
Orsa is the Internet's Context API — web data extraction and brand intelligence in a single platform.
What is Orsa?
Orsa provides a unified API for:
- Web Scraping — Extract HTML, Markdown, images, and sitemaps from any URL
- Brand Intelligence — Get logos, colors, fonts, social links, and industry data from any domain
- AI Extraction — Ask natural language questions about any website
- Screenshots — Capture pixel-perfect screenshots with light/dark mode support
- Site Crawling — Crawl entire websites with configurable depth and real-time progress
Quick Example
import Orsa from 'orsa';
const client = new Orsa({ apiKey: process.env.ORSA_API_KEY });
// Get brand data for any company
const brand = await client.brand.retrieve({ domain: 'stripe.com' });
console.log(brand.name); // "Stripe"
console.log(brand.colors); // ["#635BFF", "#0A2540", ...]
console.log(brand.logos); // [{ url: "...", type: "svg" }]
// Scrape a page as clean markdown
const page = await client.web.scrapeMarkdown({
url: 'https://stripe.com/pricing',
});
// AI-powered data extraction
const data = await client.ai.query({
domain: 'stripe.com',
data_to_extract: 'pricing plans and features',
});Key Features
| Feature | Description | Credits |
|---|---|---|
| Brand Retrieve | Full brand data from a domain | 5 |
| Scrape HTML | Raw HTML extraction | 1 |
| Scrape Markdown | Clean Markdown conversion | 2 |
| Screenshot | Pixel-perfect capture | 5 |
| AI Query | Natural language extraction | 10 |
| Site Crawl | Multi-page crawl | 50+ |
Getting Started
- Create an account (opens in a new tab) and get your API key
- Follow the Quickstart Guide
- Explore the API Reference
Open Source
Orsa's core engine is fully open source. Self-host, extend, or contribute on GitHub (opens in a new tab).