Content Comparison

Excerpt

hidden	true

Configure the start address of the content source

Status


subtle	true
colour	Grey
title	Needed permissions: #4

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → Content Sources. The content source "Confluence Source" should be listed there:

Image: Confluence content source (click to enlarge)

Click Confluence Source to open the Edit Content Source page:

Image: edit content source (click to enlarge)

Under Start Addresses, there is already an address present which serves as a placeholder. Delete the entry confluence://localhost/.

Then enter the start addresses, one per line. Replace <address> with the address of your Confluence installation:

confluence://<address>/?entity=space
confluence://<address>/?entity=page
confluence://<address>/?entity=comment
confluence://<address>/?entity=blogentry
confluence://<address>/?entity=attachment

Important: After the last line there has to be a line break (i.e. a new line)!

If you have an ip address instead of "confluence:", replace the "http:" (or "https:") with "confluence:"!

For example, if your Confluence installation is accessible under http://www.contoso.com:8090 the start addresses would look like:

confluence://www.contoso.com:8090/?entity=space
confluence://www.contoso.com:8090/?entity=page
confluence://www.contoso.com:8090/?entity=comment
confluence://www.contoso.com:8090/?entity=blogentry
confluence://www.contoso.com:8090/?entity=attachment

If you have multiple Confluence installations you can also add additional start addresses.

Info

title	HTTPS

If Confluence uses the https scheme, you must configure the start addresses as described in chapter "HTTPS Configuration" on Options and Configuration documentation page of Connector for SharePoint Search.

Add crawl rules

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → Crawl Rules.

Create a new crawl rule by clicking New Crawl Rule. Enter the values in the Add Crawl Rule page like on this screenshot:

Image: add first crawl rule (click to enlarge)

Tip
For copying: ://

Click OK to create the crawl rule.

Create another new crawl rule by again clicking New Crawl Rule. Enter the values in the Add Crawl Rule page like on this screenshot:

Image: add second crawl rule (click to enlarge)

Tip
For copying: https://pages/viewpage.action

Be sure to check "Crawl complex URLs". This ensures that all pages of Confluence can be successfully crawled.

Info

title	HTTPS

The screenshot uses "http" as schema. Replace this by "https" if your Confluence is accessible via a secure connection.

Add file type

Confluence uses URLs ending with .action which SharePoint detects as "action" file type. This type must be registered for SharePoint to crawl these pages.

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → File Types which shows a list of file types already known by SharePoint. If action is already present you can skip to the next step. If not, click New File Type:

Image: add file type (click to enlarge)

In the field File extension type action and click OK:

Image: add file extension (click to enlarge)

The file extension action should now be listed:

Image: search file extension (click to enlarge)

Configuration of AD Account/Group Mapping

If user groups with different names are used in SharePoint and Confluence, you have to configure the AD-Mapping in order to receive search results.

Read how: AD Mapping

IISReset and Search Service Restart

An IISReset and search service restart is required.

In the command prompt type iisreset and press the Return key to reset IIS.

If applicable you should run the cmd as administrator.
For SharePoint 2013
1. Then type net stop osearch15 and press Return to stop the search service.
2. Then type net start osearch15 and press Return to start the search service again.
For SharePoint 2016
1. Then type net stop osearch16 and press Return to stop the search service.
2. Then type net start osearch16 and press Return to start the search service again.

Warning

title	Crawl Servers

This step is necessary on all servers that host the Search Crawl role.

Crawl content

To test the crawling process you have to start a crawl.

Go to Central Administration → Application Management → Service Applications → Manage Service Applications → Search Service Application → Crawling → Content Sources. Open the drop-down menu of Confluence Source and click Start Full Crawl:

Depending on how much content you have in Confluence this can take a while to finish. Refresh the page from time to time (for example by pressing F5) and wait for the Status of the content source to switch to Idle.
Verify that crawling succeeded by clicking View Crawl Log in the drop-down menu of the content source:

The crawl log should show a number of Successes that corresponds with the number of crawlable URLs in your Confluence:

Info
If you see warnings or errors in the crawl log then refer to Troubleshooting and Frequently Asked Questions articles respectively, for help.

Info

title	To wiki editors

The content of this page is hidden from view. To see it, edit the page or view the wiki markup.

Version	Old Version 4	New Version 5
Changes made by	Jona Borishi (External)	Jona Borishi (External)
Saved on	Oct 07, 2019	Oct 07, 2019

Versions Compared

Key

Configure the start address of the content source

Add crawl rules

Add file type

Configuration of AD Account/Group Mapping

IISReset and Search Service Restart

Crawl content