Skip to main content
Skip table of contents

Content Landscape Assessment (CLA) Requirements up to 1 TB

A Content Landscape Assessment (CLA) with Shinydocs helps you establish an ongoing program for eliminating redundant, obsolete, and trivial documents, files, and media across your file shares. For more information, visit https://shinydocs.com/audit-your-data-get-started/ on the Shinydocs website or contact Shinydocs Support.

Technical Requirements (up to 1TB)

Hardware

Hardware

Requirement

CPU

Intel i5, i7, or i9 CPU or equivalent

Memory

16 GB RAM

Storage

100 GB free disk space (SSD highly recommended)

Software

Software

Requirement

OS

Either:

  • Windows Server 2019 or newer

  • Windows 10 or newer

Java®/OpenJDK/Amazon Corretto®

Version 11.x (latest of version 11)

If OCR

Visual C++

Visual C++ Runtime (64 bit) for Windows
https://docs.microsoft.com/en-US/cpp/windows/latest-supported-vc-redist?view=msvc-170#visual-studio-2015-2017-2019-and-2022

If Enterprise Search

Microsoft Internet Information Services (IIS)

Version 10.0

  • Included with Windows 2019

  • Available as an installable feature in Windows 10

Connector Requirements

File Share/File System

  • Your user account can access the data you want to crawl

  • You can browse the file share/file system from your machine

OpenText Content Server

  • URL that you use to access Content Server’s web interface

    • This typically ends in .../cs.exe

  • The username you use to access Content Server

    • Typically, these are your Windows credentials

  • The password of the username you use to access the Content Server

Microsoft SharePoint® (Online Office 365)

Authentication to Microsoft SharePoint® Online (Office 365) requires Azure application registration. See Setting up Azure authentication for more information. Additional requirements are as follows:

  • Your computer can access the internet and your account has read access to the site you want to crawl. An easy way to check if this connection can be made is via web browser, though more advanced methods do exist (example: https://ACMECorp.sharepoint.com/site/sitename)

  • Azure app registration permissions for SharePoint® Online (Office 365) are configured as follows:

Crawl

Migration (destination)

Sites.Read.All

Provides read access to Sharepoint® site collections

Sites.ReadWrite.All

Allows the crawling account write access to sites/libraries

User.Read.All

Provides read access to user profiles (required for crawling Sharepoint®)

User.Read.All

Provides read access to user profiles (required for crawling Sharepoint®)

Microsoft SharePoint® (On-Premise)

Authentication to Microsoft SharePoint® on-premise requires an administrator username and password. Additional requirements are as follows:

  • The servers running the Cognitive Toolkit can access the site root and all sub-sites. An easy way to check if this connection can be made is via web browser, though more advanced methods do exist (example: https://ACMECorp.sharepoint.com/site)

  • Service account permissions for SharePoint® (on-premise) are configured as follows:

Crawl

Migration (if destination)

Full Read

Has full read-only access

Provides read access to SharePoint site collections

Full Control

Has full control

Allows the crawling account write access to sites/libraries

Microsoft OneDrive

Authentication to Microsoft OneDrive requires Azure application registration. See Setting up Azure authentication for more information. An internet connection is required to access OneDrive.

Microsoft Exchange® Online (Office 365)

  • Azure® app registration. See Setting up Azure authentication for more information

  • Servers running the Cognitive Toolkit can access EWS (Exchange Web Services). An easy way to check if this connection can be made is via web browser, though more advanced methods do exist (example: https://outlook.office365.com/ews/exchange.asmx)

  • For Microsoft Exchange Online®, either an internet connection is required or a network proxy must be set to allow the Cognitive Toolkit machine to connect to:

    • https://outlook.office365.com

    • https://login.microsoftonline.com

  • Service Account permissions are configured as follows:

Azure® Office 365 Exchange Online API

Username and Password Authentication

full_access_as_app

Provides full access to mailboxes for the purpose of crawling

ApplicationImpersonation

Allows the crawling account to impersonate each of the users and mailboxes 

Grant Admin Consent

Required step in Azure App Registration

 

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.