Contents Extraction
Pull clean Markdown or HTML from any URL — ideal for LLM ingestion.
Hand the Contents API a list of URLs and get back clean Markdown (or HTML) plus page metadata — no headless browser, no parsing. It’s the fastest way to turn arbitrary web pages into LLM-ready text without standing up your own crawler.
What You’ll Build
A one-call extractor that turns any URL into clean Markdown — title, body, and page metadata in a single response. The formats parameter lets you ask for Markdown, raw HTML, or both, and crawl_timeout caps the wait per page.
Try It Live
Run a real Contents API request right here — no setup. Open the Try It panel below, paste your API key, drop in a URL, and send it against the live endpoint.
Prerequisites
Walkthrough
Python
TypeScript
Example Output
Next Steps
Find the URLs to extract, then feed them to Contents.
Let the Research API search, read, and synthesize for you.
Full docs for formats, crawl timeouts, and metadata.