Monday, September 15, 2008

Preparing to crawl content (Office SharePoint Server 2007)
After new sources of content that you need to crawl have been identified, you must make the necessary preparations before you are ready to crawl that content. Preparing to crawl content includes the following tasks:
·
Add a content source for search (Office SharePoint Server). You use content sources to specify what content to crawl.
·
Install IFilters (Office SharePoint Server 2007). Ensure that the required IFilters are installed on the index server. Microsoft Office SharePoint Server 2007 uses IFilters to open and read the content that is crawled so that it can be indexed.
·
Install protocol handlers (Office SharePoint Server). Ensure that the protocol handlers that are required to access the content specified in the content sources are installed on the index server.
·
Configure proxy server settings for search (Office SharePoint Server 2007). Ensure that you make any proxy server setting changes that are necessary to crawl the new content. For example, you might specify that the proxy server should not be used to access some addresses in your content sources.
·
Configure SSL certificate warning (Office SharePoint Server 2007). Verify whether Office SharePoint Server 2007 should ignore Secure Sockets Layer (SSL) certificate name warnings. If you are using the HTTPS protocol to crawl content and you know that the SSL certificate name does not exactly match the name that is expected, you can choose to turn off these warnings.
·
Change the contact e-mail address (Office SharePoint Server 2007). Ensure that administrators of crawled servers can reach the search administrator by supplying a contact e-mail address that is highly available.
·
Manage crawler impact rules (Office SharePoint Server 2007). Manage the impact that crawling has on the servers being crawled. · Specify timeout settings (Office SharePoint Server 2007). Make any necessary changes to the time-out settings to compensate for the speed at which the servers you want to crawl can serve content requests.

No comments: