Content Refinement for FAST
Enterprise Crawler

Mongoose has developed a valuable Plug-in tool that refines content being indexed for the FAST Enterprise Crawler. This results in an enhanced, more meaningful search experience for the end-user, as well as considerably increasing the efficiency of the crawling process and reducing the index size. This in turn reduces the administrative effort from an IT resource perspective.

Key benefits include:

  • Stops unwanted elements of web pages from being indexed, greatly improving relevancy and avoiding interference.
  • Significantly improves result summaries, as only required content is indexed.
  • Speeds up the search process for end users.
  • Increased crawling efficiency means reduced administrative effort.
  • Headers, menus, common elements of web pages can be removed.
  • Non-content pages can be omitted.
  • Adds extra functionality for selecting crawled pages to index.
  • Irrelevant pages can be ignored, rather than cluttering your index.
  • Full GUI interface to create content selection rules.
  • Quick integration with the FAST Enterprise Crawler.
  • Cross-platform; support *NIX and Windows versions of ESP.

Differences between standard and refined crawl