MS Sharepoint Crawler

Reading time: 5 minutes

Prerequisites

  1. You need to install the MS Sharepoint Crawler (SPC) by following the SPC Installation and Administration guide.
  2. To understand the basics of working with Crawlers, please read the Working with Crawlers guide first.

Creating Crawler

Navigate to Crawlers in the Cogniware Data Collector administration (accessible at https://YOUR_IP_ADDRESS/datacollector).

Click on Add Crawler button.


Fill in the Name of the Crawler, you may also add Description.

GDPR Tip

The Name of the Crawler is used to describe to DPOs, Data Admins and other users where to look for the file in the system.

For the Crawler configuration, you need to provide the following parameters:

  • Connector ClassSelect the Connector Class you have installed in SPC Installation and Administration.
  • ConnectorsSelect the Sharepoint connector you want to use. To configure the Connector please follow SPC Installation and Administration - for each Sharepoint site you need a separate connector.
  • Object Types Select documents
  • sourceName – Specify the relative path to the document library in MS Sharepoint (you can specify only part of the Library in the path /DocumentLibrary/Subfolder).
    • SharePoint 2016:
      • Starting folder with “/” at the beginning (e. g. /Shared Documents)
    • SharePoint Online:
      • Starting folder without “/” at the beginning (e.g. Shared Documents)
  • sourceTypeSelect document

To find the Path to the document library, open the Sharepoint interface in a browser of your choice. Navigate to the site you want to configure and click "Documents" in the left menu navigation. Now click on the URL in your browser.

You will se something similar to: 

https://my.sharepoint.com/sites/TestSite/Shared%20Documents/Forms/AllItems.aspx

The relative path to be put into the sourceName is what follows after the name of the MS Sharepoint Site and before the specification of the query (/Forms... in this case)

Click the Add Crawler button.


Congratulations! Crawler configuration is done. We can start crawling!

Please reffer to Working with Crawlers on how to start, stop or change the Crawler.


Get me there: