Documentation

Find answers to your questions quickly and easily

Enable Remote Streaming in Tika Extract with Opensolr

Enabling Remote Streaming in OpenSolr for Drupal Integration

First things first:

Make sure you contact us (or email [email protected]) in order to have remoteStreaming enabled for your Opensolr index. Once we've enabled that, you can go ahead and make sure the below stuff is setup correctly in your index. This should work for any index, not just for use with Drupal.

Context

You're integrating Drupal 10 with OpenSolr and encountering this error:

Remote Streaming is disabled.

—even after setting enableRemoteStreaming="true" inside <requestDispatcher>.


✅ Correct Way to Enable Remote Streaming in OpenSolr

OpenSolr does not rely solely on the <requestDispatcher> setting. While setting this is required, it's not sufficient.

Here's the complete step-by-step to get remote PDF indexing via stream.url working:


🔧 Step 1: Ensure Both of These Settings Exist

In your solrconfig.xml:

1.1 Inside <requestDispatcher>, this must exist:

<requestDispatcher handleSelect="true">
  <requestParsers enableRemoteStreaming="true" multipartUploadLimitInKB="2048000" />
</requestDispatcher>

1.2 Also, inside the ExtractingRequestHandler, enable this:

Look for this section (or add it if it doesn’t exist):

<requestHandler name="/update/extract" class="solr.extraction.ExtractingRequestHandler">
  <lst name="initParams">
    <bool name="allowRemoteStreaming">true</bool>
  </lst>
</requestHandler>

✅ This is the part that is usually forgotten. Without this, Solr will silently reject stream.url requests with that Remote Streaming is disabled error.


🔁 Step 2: Upload Config to OpenSolr Dashboard

Make sure to:

  1. Edit your Solr config set (either via the OpenSolr dashboard or local files)
  2. Re-upload the entire config set ZIP to your core/index using the OpenSolr Control Panel

🔄 Step 3: Reload or Restart Core

After updating and re-uploading:

  • Go to OpenSolr Control Panel
  • Click on your index (59a6fb53f5a58b06f113c1badef132cd)
  • Click “Reload Core” under the Tools menu

🧪 Step 4: Test It

You can test remote PDF indexing using:

curl "https://host.opensolr.com/solr/<INDEX_NAME>/update/extract?stream.url=https://example.com/sample.pdf&literal.id=test-doc&commit=true"

You should see a 200 OK and no longer get the Remote Streaming is disabled error.


📚 Official Documentation

There isn't a single perfect guide, but here are the closest:

  • Solr official: https://solr.apache.org/guide/solr/latest/indexing-guide/extracting-request-handler.html

Or reach out to [email protected]


Read Full Answer

Tika Extractor doesn't work with my Opensolr Index

📄 Troubleshooting Tika Extractor Errors with Opensolr

So, your Tika Extractor is throwing a tantrum (a.k.a. errors) in Drupal or another application? Fear not! Here’s what you can do to get your content extraction back on track—without losing your mind (or your data).


1. Try a Different Solr Version

Before you deep-dive into endless logs, try creating a new Opensolr Index in an environment using Solr Version 9, or even 7, 6, or 5.
Avoid Solr 8.x – it has some famous-for-all-the-wrong-reasons Jetty issues that often cause Tika-related problems.

👉 Create a new Opensolr Index


2. Make Sure Tika Is Enabled in Your Region

If you suspect Tika’s not even present (it happens!), Contact Opensolr Support and let us know.
We’ll happily enable Tika for your index and region if it’s not already there.


3. Confirm Your solrconfig.xml Setup

Check that your solrconfig.xml has the proper Tika Extraction RequestHandler defined:

<requestHandler name="/extract/tika" class="org.apache.solr.handler.extraction.ExtractingRequestHandler" startup="lazy">
    <lst name="defaults">
    </lst>
    <!-- This path only extracts - never updates -->
    <lst name="invariants">
        <bool name="extractOnly">true</bool>
    </lst>
</requestHandler>

This makes sure Tika extraction is only extracting—no accidental document updates sneaking in.


4. When All Else Fails… Book Time With Us (For Free!)

Sometimes, the best solution is a human one!
Book your time with an Opensolr expert (Zoom, free!)
We’ll get your Tika integration working, no sweat.


Pro tip: Keep an eye on Solr and Jetty version compatibility whenever you hit a weird issue. Opensolr support can always point you in the right direction—no Tika tantrum too tough for us!

Read Full Answer




Review us on Google Business
ISO-9001 CERTIFIED ISO-27001 CERTIFIED