API - Resume the Web Crawler

Web Crawler
Web Crawler API

Resume the Web Crawler

Resume a paused web crawler immediately, without waiting for the next cron tick. Continues from where the crawler left off.

Endpoint

GET https://opensolr.com/solr_manager/api/start_crawl
This uses the same start_crawl endpoint with clean=no to continue from where the crawler left off, rather than starting a fresh crawl.

Parameters

ParameterStatusDescription
emailRequiredYour Opensolr registration email address
api_keyRequiredYour Opensolr API key
core_nameRequiredThe name of the index to resume crawling for
cleanRequiredSet to no to resume from where the crawler left off
If the crawler has finished processing all URLs in its queue (todo is empty), resuming will have no effect. Check stats first using the Get LIVE Web Crawler Stats API to see if there are remaining pages to crawl.

Code Examples

cURL

curl -s "https://opensolr.com/solr_manager/api/start_crawl?email=YOUR_EMAIL&api_key=YOUR_API_KEY&core_name=my_solr_core&clean=no"

PHP

$params = http_build_query([
    'email'     => 'YOUR_EMAIL',
    'api_key'   => 'YOUR_API_KEY',
    'core_name' => 'my_solr_core',
    'clean'     => 'no',
]);
$response = file_get_contents("https://opensolr.com/solr_manager/api/start_crawl?{$params}");
$result = json_decode($response, true);
print_r($result);

Python

import requests

response = requests.get("https://opensolr.com/solr_manager/api/start_crawl", params={
    "email": "YOUR_EMAIL",
    "api_key": "YOUR_API_KEY",
    "core_name": "my_solr_core",
    "clean": "no",
})
print(response.json())

Related Documentation

Need help with the Opensolr Web Crawler? We are here to help.

Contact Support