To index attachments:
The Apache Solr Search configuration page now displays the Attachments tab. Use the settings on this tab to configure the file attachment indexing settings.
The Apache Solr Attachments module is compatible only with Apache Solr Search. If you are using the Search API module, you can use the Search API attachments module instead. Configure it to work with Acquia Search. Be sure to use version 7.x-1.4 or later; earlier versions of the Search API attachments module are not compatible with Acquia Search.
The Attachments tab of the Apache Solr Search configuration page contains the following configuration options for indexing attachments in Apache Solr Search:
| S. No. | Item | Description |
|---|---|---|
| 1 | Excluded file extensions | A space-separated list of file extensions that are excluded from indexing. Modify this list to suit the needs of your site. Extensions are internally mapped to a MIME type, so it is not necessary to include variations that map to the same type. For example, tif is sufficient to exclude both the tif and tiff file extensions. |
| 2 | Extract using | Acquia Search includes Apache Tika for indexing documents. For best performance, select Solr (remote server). |
| 3 | File Size Limit | If a file is larger than this limit, do not index it. Default is 41943040 bytes (40MB). |
| 4 | Tika directory | Leave this blank. |
| 5 | Tika jar file | Leave this set to the default value. |
| 6 | Tika extracting servlet path | update/extract |
| 7 | Tika extracting servlet parameters | extractOnly=true |
File attachments get indexed at the same time as their parent entities. Under Actions, you can:
| Action | Description |
|---|---|
| Clear the attachment text extraction cache | Clears all extracted data. |
| Delete the attachments from index | Deletes all the attached files on your site from the Acquia Search index. You need to do this if you change what types of files should be indexed, if your search index becomes corrupted, or if you install a new schema.xml. |
| Test your tika extraction | Tests if your Tika configuration settings work. |
If this content did not answer your questions, try searching or contacting our support team for further assistance.
To index attachments:
The Apache Solr Search configuration page now displays the Attachments tab. Use the settings on this tab to configure the file attachment indexing settings.
The Apache Solr Attachments module is compatible only with Apache Solr Search. If you are using the Search API module, you can use the Search API attachments module instead. Configure it to work with Acquia Search. Be sure to use version 7.x-1.4 or later; earlier versions of the Search API attachments module are not compatible with Acquia Search.
The Attachments tab of the Apache Solr Search configuration page contains the following configuration options for indexing attachments in Apache Solr Search:
| S. No. | Item | Description |
|---|---|---|
| 1 | Excluded file extensions | A space-separated list of file extensions that are excluded from indexing. Modify this list to suit the needs of your site. Extensions are internally mapped to a MIME type, so it is not necessary to include variations that map to the same type. For example, tif is sufficient to exclude both the tif and tiff file extensions. |
| 2 | Extract using | Acquia Search includes Apache Tika for indexing documents. For best performance, select Solr (remote server). |
| 3 | File Size Limit | If a file is larger than this limit, do not index it. Default is 41943040 bytes (40MB). |
| 4 | Tika directory | Leave this blank. |
| 5 | Tika jar file | Leave this set to the default value. |
| 6 | Tika extracting servlet path | update/extract |
| 7 | Tika extracting servlet parameters | extractOnly=true |
File attachments get indexed at the same time as their parent entities. Under Actions, you can:
| Action | Description |
|---|---|
| Clear the attachment text extraction cache | Clears all extracted data. |
| Delete the attachments from index | Deletes all the attached files on your site from the Acquia Search index. You need to do this if you change what types of files should be indexed, if your search index becomes corrupted, or if you install a new schema.xml. |
| Test your tika extraction | Tests if your Tika configuration settings work. |
If this content did not answer your questions, try searching or contacting our support team for further assistance.