Introduction

Build with Orsa.dev

Orsa is the Internet's Context API — web data extraction and brand intelligence in a single platform.

What is Orsa?

Orsa provides a unified API for:

  • Web Scraping — Extract HTML, Markdown, images, and sitemaps from any URL
  • Brand Intelligence — Get logos, colors, fonts, social links, and industry data from any domain
  • AI Extraction — Ask natural language questions about any website
  • Screenshots — Capture pixel-perfect screenshots with light/dark mode support
  • Site Crawling — Crawl entire websites with configurable depth and real-time progress

Quick Example

import Orsa from 'orsa';
 
const client = new Orsa({ apiKey: process.env.ORSA_API_KEY });
 
// Get brand data for any company
const brand = await client.brand.retrieve({ domain: 'stripe.com' });
console.log(brand.name);       // "Stripe"
console.log(brand.colors);     // ["#635BFF", "#0A2540", ...]
console.log(brand.logos);       // [{ url: "...", type: "svg" }]
 
// Scrape a page as clean markdown
const page = await client.web.scrapeMarkdown({
  url: 'https://stripe.com/pricing',
});
 
// AI-powered data extraction
const data = await client.ai.query({
  domain: 'stripe.com',
  data_to_extract: 'pricing plans and features',
});

Key Features

FeatureDescriptionCredits
Brand RetrieveFull brand data from a domain5
Scrape HTMLRaw HTML extraction1
Scrape MarkdownClean Markdown conversion2
ScreenshotPixel-perfect capture5
AI QueryNatural language extraction10
Site CrawlMulti-page crawl50+

Getting Started

  1. Create an account (opens in a new tab) and get your API key
  2. Follow the Quickstart Guide
  3. Explore the API Reference

Open Source

Orsa's core engine is fully open source. Self-host, extend, or contribute on GitHub (opens in a new tab).