Home Search Member List Faq Register Login  
UltimateSearch
PPTX Indexing Error

Thread Starter: tgaskell   Started: 10-16-2015 12:54 PM   Replies: 3
 Karamasoft Support Forums » General Discussions » UltimateSearch » PPTX Indexing Error
 Printable Version    « Previous Thread   Next Thread »
  16 Oct 2015, 12:54 PM
tgaskell is not online. Last active: 1/25/2016 9:12:32 AM tgaskell

Not Ranked
Joined on 10-16-2015
Posts 4
PPTX Indexing Error
UltimateSearch 3.7 is incorrectly indexing some of the text from PowerPoint files.

Apparently, the indexer is stringing together all PowerPoint text areas without inserting spaces or other breaks between the separate blocks, meaning that some words cannot be found. Is there any way to correct this?

Steps to reproduce:
1. Create a new PowerPoint file (I used PowerPoint 2010).
2. On the first page, set the title to "The Hobbit" and the subtitle to "There and Back Again.
3. Under File -> Info, set the document title to "Book List"
4. Save the file, add it to a directory indexed by UltimateSearch, and run the indexer.
5. Use UltimateSearch to search for "Hobbit". The document will not be found.
6. Use UltimateSearch to search for "There". The document will not be found.
7. Use UltimateSearch to search for "HobbitThere" as one word. The document will be found.

  
  16 Oct 2015, 5:35 PM
Karamasoft is not online. Last active: 5/8/2018 10:36:45 AM Karamasoft

Top 10 Posts
Joined on 09-05-2004
Posts 6,820
Re: PPTX Indexing Error
We followed your steps, but couldn't reproduce the issue. All the words except "there" got indexed properly because "there" is one of the stop words (stopWord) on the UltimateSearch.config file.

Please note that you need to click "Index Full" on your admin page (Admin/UltimateSearch.admin.aspx) to index your file, and then click the "Display Indexed Words" to see the list of indexed words.

We used Office 2013 though. If you send us your PPTX file, we can test with it.

  
  20 Oct 2015, 8:32 AM
tgaskell is not online. Last active: 1/25/2016 9:12:32 AM tgaskell

Not Ranked
Joined on 10-16-2015
Posts 4
Re: PPTX Indexing Error
On further investigation I found that both your solution and mine created the correct word list when run locally on the ASP.NET Development Server but created an incorrect word list when deployed to a test server running IIS 6.0.

On my local environment the following error was logged while generating the index, but the index was generated correctly:

10/19/2015 5:42:38 PM, IFilter cannot initialize for file C:\SourceDir\WebApp\UltimateSearchInclude\Index\tempaf0f7aa6-c7ce-42d7-b348-0f20efbf0752.pptx.

On the test server, that error was NOT present, but the index was generated incorrectly.

(The words "screen" and "show" also appear on the index only when the solution is run on IIS if that gives any clue to what is happening internally.)

  
  20 Oct 2015, 3:57 PM
Karamasoft is not online. Last active: 5/8/2018 10:36:45 AM Karamasoft

Top 10 Posts
Joined on 09-05-2004
Posts 6,820
Re: PPTX Indexing Error
We'll send you a new dll to test with.
  
 Page 1 of 1 (4 items)
Karamasoft Support Forums » General Discussions » UltimateSearch » PPTX Indexing Error

You can add attachments
You can post new topics
You can reply to topics
You can delete your posts
You can edit your posts
You can create polls
You can vote in polls
Forum statistics are enabled
Forum is unmoderated

© 2002-2018 Karamasoft LLC. All rights reserved.