Opensolr Changelog

Recent updates and improvements to the Opensolr platform.

Web Crawler × Feb 2026 ×

Web Crawler Feb 27, 2026

  • Improved Web Crawler indexing is now faster — crawled pages are sent to Solr in larger batches instead of one at a time, reducing round-trip overhead and significantly speeding up the overall indexing process.
  • Improved Clicking Resume when the crawler queue is empty now shows a clear message explaining there are no more pages to process, instead of silently doing nothing. It suggests stopping the cron schedule and starting a fresh crawl.
  • Improved The crawler status badge now distinguishes between Running (green), Paused (blue), and Stopped (red). When the cron schedule is active but no crawler processes are running, the dashboard shows Paused instead of Running, so you always know the actual state of your crawl.
  • New New Pause and Resume controls for the Web Crawler. You can now temporarily pause a running crawl without losing your cron schedule — the crawler will automatically pick back up on the next scheduled tick, or you can hit Resume to restart it immediately. The Stop button has been renamed to Stop Cron Schedule to make it clear that it permanently removes the schedule.

Web Crawler Feb 25, 2026

  • Improved Smarter content extraction — the web crawler now uses a dual-extraction strategy that runs two independent text extraction engines and picks whichever captures more real content. Pages with heavy JavaScript, complex layouts, or framework-rendered content (React, Next.js, Angular, Vue) are now detected and rendered automatically. The result: richer, more complete text in your Opensolr Index, especially for modern web applications.