Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt
hiddentrue

Configure the start address of the content source

Status
subtletrue
colourGrey
titleNeeded permissions: #4

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → Content Sources. The content source "Confluence Source" should be listed there:


Image: Confluence content source (click to enlarge)

Click Confluence Source to open the Edit Content Source page:


Image: edit content source (click to enlarge)

Under Start Addresses, there is already an address present which serves as a placeholder. Delete the entry confluence://localhost/.

Then enter the start addresses, one per line. Replace <address> with the address of your Confluence installation:

  • confluence://<address>/?entity=space
  • confluence://<address>/?entity=page
  • confluence://<address>/?entity=comment
  • confluence://<address>/?entity=blogentry
  • confluence://<address>/?entity=attachment

(warning) Important: After the last line there has to be a line break (i.e. a new line)!

(warning) If you have an ip address instead of "confluence:", replace the "http:" (or "https:") with "confluence:"!

For example, if your Confluence installation is accessible under http://www.contoso.com:8090 the start addresses would look like:

If you have multiple Confluence installations you can also add additional start addresses.

Info
titleHTTPS

If Confluence uses the https scheme, you must configure the start addresses as described in chapter "HTTPS Configuration" on Options and Configuration documentation page of Connector for SharePoint Search.

Add crawl rules

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → Crawl Rules.

Create a new crawl rule by clicking New Crawl Rule. Enter the values in the Add Crawl Rule page like on this screenshot:


Image: add first crawl rule  (click to enlarge)

Tip
For copying: *://*

Click OK to create the crawl rule.

Create another new crawl rule by again clicking New Crawl Rule. Enter the values in the Add Crawl Rule page like on this screenshot:


Image: add second crawl rule (click to enlarge)

Tip
For copying: https://*pages/viewpage.action*

Be sure to check "Crawl complex URLs". This ensures that all pages of Confluence can be successfully crawled.

Info
titleHTTPS

The screenshot uses "http" as schema. Replace this by "https" if your Confluence is accessible via a secure connection.

Add file type

Confluence uses URLs ending with .action which SharePoint detects as "action" file type. This type must be registered for SharePoint to crawl these pages.

Go to Central Administration → Application Management → Manage service applications → Search Service Application → Crawling → File Types which shows a list of file types already known by SharePoint. If action is already present you can skip to the next step. If not, click New File Type:


Image: add file type (click to enlarge)

In the field File extension type action and click OK:


Image: add file extension (click to enlarge)

The file extension action should now be listed:


Image: search file extension (click to enlarge)

Configuration of AD Account/Group Mapping

If user groups with different names are used in SharePoint and Confluence, you have to configure the AD-Mapping in order to receive search results.

Read how: AD Mapping

IISReset and Search Service Restart

An IISReset and search service restart is required.

  1. In the command prompt type iisreset and press the Return key to reset IIS.

    If applicable you should run the cmd as administrator.
  2. For SharePoint 2013
    1. Then type net stop osearch15 and press Return to stop the search service.
    2. Then type net start osearch15 and press Return to start the search service again.
  3. For SharePoint 2016
    1. Then type net stop osearch16 and press Return to stop the search service.
    2. Then type net start osearch16 and press Return to start the search service again.


Warning
titleCrawl Servers

This step is necessary on all servers that host the Search Crawl role.

Crawl content

To test the crawling process you have to start a crawl.

Go to Central Administration → Application Management → Service Applications → Manage Service Applications → Search Service Application → Crawling → Content Sources. Open the drop-down menu of Confluence Source and click Start Full Crawl:

Depending on how much content you have in Confluence this can take a while to finish. Refresh the page from time to time (for example by pressing F5) and wait for the Status of the content source to switch to Idle.
Verify that crawling succeeded by clicking View Crawl Log in the drop-down menu of the content source:

The crawl log should show a number of Successes that corresponds with the number of crawlable URLs in your Confluence:

Info

If you see warnings or errors in the crawl log then refer to Troubleshooting and Frequently Asked Questions articles respectively, for help.

Info
titleTo wiki editors

The content of this page is hidden from view. To see it, edit the page or view the wiki markup.