Wait...

API to Search, Extract, Structure Web Data

Get clean data for your AI from any website and automate
your web workflows

Diagonal Sections

Using the rotation transform is how you might think to do it but I think skew is the way to go!

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
One unified API for your AI
< Your AI Agent >
  • Image
    Start "research YC" workflow
  • Image
    Automate brand protection
  • Image
    Research donors in NYC
    Image
    Find local businesses
  • Image
    Analyze brand visibility
ImageImageImageImageImage
Image
Image
[-- Data Layer --]
Image
· Research agents
· Parsers - structured data
· Data router
· Automation engine
· Click, fill forms
· Distributed infra
· Map/Crawl
· VM sandboxes
· Batches API
Image
•   Output

{
    "id": "request_56is5c9gyw",
    "created": 1317322740,
    "result": {
        "markdown_content": "# Ex",         "json_content": {}
        "html_content": "<DOC>"
    }
}

ImageImageImageImage
Trusted by the best startups in the world
ImageImageImageImageImageImageImage

...and many more

Diagonal Sections

Using the rotation transform is how you might think to do it but I think skew is the way to go!

Developer-centric
1import requests
2
3API_URL = 'https://api.olostep.com/v1/answers'
4API_KEY = '<your_token>'
5
6headers = {
7    'Authorization': f'Bearer {API_KEY}',
8    'Content-Type': 'application/json'
9}
10
11data = {
12    "task": "What is the latest book by J.K. Rowling?",
13    "json": {
14        "book_title": "",
15        "author": "",
16        "release_date": ""
17    }
18}
19
20response = requests.post(API_URL, headers=headers, json=data)
21result = response.json()
22
23print(json.dumps(result, indent=4))
1// Using native fetch API (Node.js v18+)
2const API_URL = 'https://api.olostep.com/v1/answers';
3const API_KEY = '<your_token>';
4
5fetch(API_URL, {
6  method: 'POST',
7  headers: {
8    'Authorization': `Bearer ${API_KEY}`,
9    'Content-Type': 'application/json'
10  },
11  body: JSON.stringify({
12    "task": "What is the latest book by J.K. Rowling?",
13    "json": {
14        "book_title": "",
15        "author": "",
16        "release_date": ""
17    }
18  })
19})
20  .then(response => response.json())
21  .then(result => {
22    console.log(JSON.stringify(result, null, 4));
23  })
24  .catch(error => console.error('Error:', error));
1import requests
2
3API_URL = 'https://api.olostep.com/v1/crawls'
4API_KEY = '<token>'
5
6headers = {'Authorization': f'Bearer {API_KEY}'}
7data = {
8    "start_url": "https://docs.stripe.com/api",
9    "include_urls": ["/**"],
10    "max_pages": 10
11}
12
13response = requests.post(API_URL, headers=headers, json=data)
14result = response.json()
15
16print(f"Crawl ID: {result['id']}")
17print(f"URL: {result['start_url']}")
1// Using native fetch API (Node.js v18+)
2const API_URL = 'https://api.olostep.com/v1/crawls';
3const API_KEY = '<token>';
4
5fetch(API_URL, {
6  method: 'POST',
7  headers: {
8    'Authorization': `Bearer ${API_KEY}`,
9    'Content-Type': 'application/json'
10  },
11  body: JSON.stringify({
12    "start_url": "https://docs.stripe.com/api",
13    "include_urls": ["/**"],
14    "max_pages": 10
15  })
16})
17.then(response => response.json())
18.then(result => {
19  console.log(`Crawl ID: ${result.id}`);
20  console.log(`URL: ${result.start_url}`);
21})
22.catch(error => console.error('Error:', error));
1import requests
2
3API_URL = 'https://api.olostep.com/v1/scrapes'
4API_KEY = '<your_token>'
5
6headers = {'Authorization': f'Bearer {API_KEY}'}
7data = {"url_to_scrape": "https://github.com"}
8
9response = requests.post(API_URL, headers=headers, json=data)
10result = response.json()
11
12print(f"Scrape ID: {result['id']}")
13print(f"URL: {result['url_to_scrape']}")
1// Using native fetch API (Node.js v18+)
2const API_URL = 'https://api.olostep.com/v1/scrapes';
3const API_KEY = '<your_token>';
4
5fetch(API_URL, {
6  method: 'POST',
7  headers: {
8    'Authorization': `Bearer ${API_KEY}`,
9    'Content-Type': 'application/json'
10  },
11  body: JSON.stringify({
12    "url_to_scrape": "https://github.com"
13  })
14})
15.then(response => response.json())
16.then(result => {
17  console.log(`Scrape ID: ${result.id}`);
18  console.log(`URL: ${result.url_to_scrape}`);
19})
20.catch(error => console.error('Error:', error));
1import requests
2
3API_URL = 'https://api.olostep.com/v1/agents' # endpoint available to select customers
4API_KEY = '<token>'
5
6headers = {'Authorization': f'Bearer {API_KEY}', 'Content-Type': 'application/json'}
7data = {
8    "prompt": '''
9      Search every portfolio company from every fund from 
10      (https://www.vcsheet.com/funds) and return the results into a google sheet 
11      with the following columns (Fund Name, Fund Website 
12      URL, Fund LinkedIn URL, Portfolio Company Name, Portfolio
13      Company URL, Portfolio Company LinkedIn URL). Run every week 
14      on Monday at 9:00 AM. Send an email to [email protected] when 
15      new portfolio companies are added to any of these funds.  
16    ''',
17    "model": "gpt-4.1"
18}
19
20response = requests.post(API_URL, headers=headers, json=data)
21result = response.json()
22
23print(f"Agent ID: {result['id']}")
24print(f"Status: {result['status']}")
25# You can then schedule this agent
1// Using native fetch API (Node.js v18+)
2const API_URL = 'https://api.olostep.com/v1/agents'; // endpoint available to select customers
3const API_KEY = '<token>';
4
5fetch(API_URL, {
6  method: 'POST',
7  headers: {
8    'Authorization': `Bearer ${API_KEY}`,
9    'Content-Type': 'application/json'
10  },
11  body: JSON.stringify({
12    "prompt": `
13      Search every portfolio company from every fund from 
14      (https://www.vcsheet.com/funds) and return the results into a google sheet 
15      with the following columns (Fund Name, Fund Website 
16      URL, Fund LinkedIn URL, Portfolio Company Name, Portfolio
17      Company URL, Portfolio Company LinkedIn URL). Run every week 
18      on Monday at 9:00 AM. Send an email to [email protected] when 
19      new portfolio companies are added to any of these funds.
20    `,
21    "model": "gpt-4.1"
22  })
23})
24  .then(response => response.json())
25  .then(result => {
26    console.log(`Agent ID: ${result.id}`);
27    console.log(`Status: ${result.status}`);
28    // You can then schedule this agent
29  })
30  .catch(error => console.error('Error:', error));

Get the data in the format you want

Get Markdown, HTML, PDF or Structured JSON

Pass the URL to the API and retrieve the HTML, Markdown, PDF, or plain text of the website. You can also specify the schema to only get the structured, clean JSON data you want

ImageImage

JS execution + residential ipS

Web-pages rendered in a browser

Full JS support is the norm for every request, as well as premium residential IP addresses and proxies rotation to avoid all bot detection

Image

Crawl

Get all the data from a single URL

Multi-depth crawling enables you to get clean markdown from all the subpages of a website. Works also without a sitemap (e.g. useful for doc websites).

Image
Get clean data

We handle the heavy lifting

Browser infra, rate limits and js-rendered content

Image

Crawling

Get the data from all subpages of a website. No sitemap required. This is useful if you are building an AI agent that need to get a specific context from a documentation website

Image
Image

Batches

You can submit from 100 to 100k URLs in a batch and have the content (markdown, html, raw pdfs or structured JSON) back in 5-7 mins. Useful for deep research agents, monitoring social media, and for aggregating data at scale

Image
Image

Reliable

Get the content you want when you want it. All requests are done with a premium proxy

Image

PDF parsing

Olostep can parse and output content from web hosted pdfs, docx, and more.

Image

Actions

Click, type, fill forms, scroll, wait and more dynamically on websites

Most cost-effective API on the market

Pricing that Makes Sense

We want you to be able to build a business on top of Olostep.
Start for free. Scale with no worries.

Free
COST/500 $0
$0
No credit card required
Image
500 successful requests
Image
All requests are JS rendered + utilizing residential IP addresses
Image
Low rate limits
Starter
COST/1K $1.800
$9
per month
Image
5000 successful requests/month
Image
Everything in Free Plan
Image
150 concurrent requests
Standard
COST/1K $0.495
$99 USD
per month
Image
200K successful requests/month
Image
Everything in Starter Plan
Image
500 concurrent requests
Scale
COST/1K $0.399
$399 USD
per month
Image
1 Million successful requests/month
Image
Everything in Standard Plan
Image
AI-powered Browser Automations
Free
$0
per month
Image
3000 successful scrapes
Image
All requests are JS rendered + utilizing residential IP addresses
Starter
$29
per month
Image
20K successful scrapes
Image
All requests are JS rendered + utilizing residential IP addresses
Standard
$99 USD
per month
Image
200K successful scrapes
Image
All requests are JS rendered + utilizing residential IP addresses
Scale
$399 USD
per month
Image
1 Million successful scrapes
Image
All requests are JS rendered + utilizing residential IP addresses

Top-ups

Need flexibility or have spiky usage? You can buy credits pack. They are valid for 6 months.

Credit pack

$20 for 10k credits
Purchase Credit Pack

Credit pack

$200 for 250k credits
Purchase Credit Pack

Credit pack

$1000 for 2M credits
Purchase Credit Pack

Enterprise

Hundreds of millions of credits with enterprise-grade reliability. We offer custom discounts
Contact Sales

Data tailored to your industry

Access clean, structured data that matters most to you, when it matters the most. Power search, deep resarch, AI Agents and your applications.

Deep Search

Access custom, hyper-specialized B2B indexes for your industry to search and extract comprehensive data beyond what general web indexes cover

Image
Recruiting

Identify, research, and validate candidates faster with intelligence and data aggregated from top-quality profiles and specialist web sources.

Image
Power AI applications

Get clean, structured data from any website as markdown, html, screenshot, etc. to power your AI application and workflows

Image
Monitor the Web

Monitor any webpage for DOM changes, stock availability, price changes, job openings or fresh content. Run automatically on a schedule and get alerted

Image
Automate data pipelines

Automate complex data pipelines with the /agents endpoint through natural language prompts. You can also pass your own internal knowledge as context

Image
Automate data pipelines

Automate complex data pipelines with the /agents endpoint through natural language prompts. You can also pass your own internal knowledge as context

Image
Image

Deep research agents

Enable your agent to conduct deep research on large Web datasets.

Image

Spreadsheet enrichment

Get real-time web data to enrich your spreadsheets and analyze data.

Image

Lead generation

Research, enrich, validate and analyze leads. Enhance your sales data

Image

Vertical AI search

Build industry specific search engines to turn data into an actionable resource.

Image

AI Brand visibility

Monitor brands to help improve their AI visibility (Answer Engine Optimization).

Image

Agentic Web automations

Enable AI Agents to automate tasks on the Web: fill forms, click on buttons, etc.

Customers

Trusted by world-class teams

Discover why the best teams in the world choose Olostep.
Read more customer stories

Image
Michelle Julia
Co-founder & CEO Aurium

Olostep is the best!!! We automated entire data pipelines with just a prompt

Image
Richard He
Co-founder & CEO Openmart

Olostep has become the default Web Layer infrastructure for our company

Image
Max Brodeur-Urbas
Co-founder & CEO Gumloop

Olostep works like a charm! And your customer service is exceptional

Image
Rob Hayes
Co-founder Merchkit

Olostep lets us turn any website into an API. Great product, great people

Image
Brandon Cohen
Co-founder & CTO CivilGrid

I highly recommend Olostep, great product!

Image
Co-founder & CEO Gedd.it

We verify coupon codes at scale. Love Olostep. It works on any e-commerce

Image
Trevor West
Co-founder & CEO Podqi

Olostep is the best API to search, extract, and structure data from the Web. Happy to be customers

Image
Rida Naveed
Co-founder Zecento

We use /batches combined with parsers and it's magical how we can get structured data deterministically at large scale

Image
Kieran V.
Growth PlotsEvents

Olostep allowed us to search and structure events data across the Web

Image
Paul Mit
Founder Foundbase

Reliable and cost-effective API for working with data. Congrats on the cool product

Questions?

Frequently asked questions

Have other questions? Get in touch via [email protected]

What is Olostep?
Image

Olostep is the Web Data API for AI and Research Agents.

The Olostep API is the best web search, scraping and crawling API for AI used by some of the leading startups in the world.

The Olostep Agent allows anyone to automate research workflows and build data pipelines in a no code way with just a prompt in natural language

What is counted as a request?
Image

1 request is counted as one webpage/pdf. We don't charge you additionally for GB or for proxies and all those costs are included in the cost per request.

Does Olostep charge for failed requests?
Image

We don't charge for failed requests. If you are using the answer endpoint or an endpoint that needs to make LLM calls we will pass down those costs to you but on our end we only charge for successful requests.

Which websites can Olostep access/interact?
Image

You can access and interact with any website that is publicly accessible. If you are building AI automations and your agent needs to pass cookies or login, get in touch at [email protected]

Can Olostep support my high-volume requests?
Image

Yes, the API can scale to billions of requests per month

How can I pay?
Image

You can pay using the Stripe Payment Links.

Why should I use Olostep?
Image

Because it's reliable (99.5%), cost-effective (up to 70% cheaper), scalable, and flexible to be compatible with your existing workflows and backend. Olostep is the only platforms where you can create your own parsers to return deterministic results at scale in a cost effective way. You can request the features you need and we will try to build them for you. Plus you can test it for free to see if it fits your need. Get your free API keys from here.

Can I switch plans after signing up?
Image

Yes, plans are pro-rated, meaning if you've already paid for a previous plan, the remaining credits will be transferred to your next plan. You won't have to pay again for what you've already covered.

Does Olostep offer a free trial?
Image

Yes, Olostep is free for the first 500 requests. Then paid plans start from $9/month for 5000 credits per month. Olostep is considered the most reliable and cost effective API on the market. Try it for free and see it on your own

Can I ask for a refund if I don't use it?
Image

We’re fully committed to building products that you love. If for whatever reason you’re unsatisfied with the Olostep API, please email us at [email protected] to receive a full refund within a few hours. We'll also refund you if it doesn't turn out being useful. If you decide to use it but only after a certain period of time, we'll refund the time you don't use it.

How does it return the results?
Image

The API returns the id of the request (for future retrievals), the Markdown and the HTML of the page. You can also retrieve JSON with specific parsers or structured data with LLM extraction. If you are using the /answers endpoint as the search basis for your AI it will return an answer, a json in the schema you have defined and the sources Olostep has searched.

Can Olostep automate my data pipelines?
Image

Reach out to us at [email protected] or contact our sales team https://www.olostep.com/contact-sales with your use case and we can take a look. Our aim with the Olostep Agent is to be able to automate any business data pipeline and research workflow on the Web so we will do our best to assist your use case.

Who should use Olostep?
Image

Olostep is especially useful for AI startups that rely on Web data to power or improve their services or for companies that need to enrich data, monitor websites changes, analyze historical web data and equip their AIs with web search capabilities to ground them on real world data and facts. Olostep can also be used by developers, AI engineers, data scientists, and researchers looking to use web data for market research, LLM-finetuning, and more. Olostep returns clean, structured data in one single API so that it's compatible with existing backend.

Can I extract data with a prompt?
Image

Yes, Olostep lets you extract data using natural language prompts. If you know the exact URL containing your data, use the /scrapes endpoint with llm_extract and describe what you want to extract. At scale, for deterministic needs, we recommend using Olostep's parsers. For more complex tasks like searching for data, navigating between pages, handling pagination, or validating results, use the /agents endpoint that automatically finds and extracts data based on your prompt.