Enabling OCR of TIFF images for SharePoint 2013 Search

By | August 27, 2013

SharePoint 2013 Enterprise Search has the built-in ability to OCR and index the content of your scanned tiff images during a crawl (whether they are are stored in SharePoint or not). This is a very powerful feature, yet a bit mysterious to configure as the configuration steps have changed since the 2010 version. I’ll outline the steps below:

  1. Using Server Manager, ensure the Windows TIFF iFilter feature is enabled on each crawl server
    Windows TIFF iFilter Feature
  2. Open the Local Group Policy Editor and locate the OCR folder beneath Computer Configuration > Administrative Templates.
    Group Policy Editor
  3. Edit the policy setting for “Select OCR languages from a code page”. Choose Enabled and select the appropriate languages.
    Select OCR Languages
  4. Open the SharePoint Management Shell (using Run as Administrator) and run the following commands to configure content parsing for TIFF images.
    $ssa = Get-SPEnterpriseSearchServiceApplication
    New-SPEnterpriseSearchFileFormat -SearchApplication $ssa tif "TIFF Image File" "image/tiff"
    New-SPEnterpriseSearchFileFormat -SearchApplication $ssa tiff "TIFF Image File" "image/tiff"
  5. Restart the SharePoint Search Host Controller service.
    Restart Search Host Controller Service
  6. Open the Search Service Application administration. Under the Crawling navigation item, navigate to File Types. Add two new File Types for tif and tiff.Add File Type
  7. Perform a Full Crawl of your content.

Depending on how many TIFF images are crawled, this may be a considerably longer amount of time than your previous crawl time. Additional planning may be necessary, such as potentially scoping a Content Source to only content that should be OCR’d, or adjusting crawl schedules.

 

6 thoughts on “Enabling OCR of TIFF images for SharePoint 2013 Search

  1. Sreeharsha

    Thanks for the post Trent. Saved my day. Do you have any idea if or how could enable the OCR for image file extensions other than TIFF/TIF?

  2. Nikhil Doomra

    Have you tried searching for the content in Tiff images in SP 2013. It doesn’t work for me?

  3. Milind Saraswala

    Same here it doesn’t work for me. Another point how can I enable “Arabic” language.

  4. Mallikarjuna Reddy

    how to search images in enterprises search in sharepoint 2013.

  5. Matt

    Thank you for this information. Do you think this is also possible if the sysadmin has already OCR scanned all the PDF documents inside of a share? Can SP2013 Search be configured to display the results of PDF contents?

  6. Sam

    I tried to search OCR tiff and pdf files but seems not working for me. Am I missing anything? I simply followed your steps as mentioned above.

Leave a Reply

Your email address will not be published. Required fields are marked *