IFilter

Last updated

An IFilter is a plugin that allows Microsoft's search engines to index various file formats (as documents, email attachments, database records, audio metadata etc.) so that they become searchable. Without an appropriate IFilter, contents of a file cannot be parsed and indexed by the search engine.

Contents

They can be obtained as standalone packages or bundled with certain software such as Adobe Reader, [Note 1] LibreOffice, Microsoft Office [Note 2] and OpenOffice.

It also refers to the software interface needed to implement such plugins. [1]

How it works

An IFilter acts as a plug-in for extracting full-text and metadata for search engines. A search engine usually works in two steps: [2] [3]

  1. The search engine goes through a designated place, e.g. a file folder or a database, and indexes all documents or newly modified documents, including the various types documents, in the background and creates internal data to store indexing result.
  2. A user specifies some keywords they would like to search for, and the search engine answers the query immediately by looking up the indexing result and responding to the user with all the documents that contains the keywords.

During Step 1, the search engine itself doesn't understand format of a document. Therefore, it looks on Windows registry for an appropriate IFilter to extract the data from the document format, filtering out embedded formatting and any other non-textual data.

Search engines

Windows Indexing Service and the newer Windows Search, Windows Desktop Search, MSN Desktop Search, Internet Information Server, SharePoint Portal Server, Windows SharePoint Services (WSS), Site Server, Exchange Server, SQL Server and all other products based on Microsoft Search technology support indexing technology. Also, IFilters are used by SQL Server as a component of the SQL Server Full Text Search service.

See also

Notes

  1. Adobe provides only the 32-bit IFilter bundled with its Reader software. To install the 64-bit version, there is a standalone package at Acrobat for Windows Downloads Page.
  2. Microsoft provides its Office IFilters bundled and available as standalone packages at Microsoft Office 2010 Filter Packs and 2007 Office System Converter: Microsoft Filter Pack.

Related Research Articles

<span class="mw-page-title-main">Microsoft Office</span> Suite of office software

Microsoft Office, or simply Office, is a family of client software, server software, and services developed by Microsoft. It was first announced by Bill Gates on August 1, 1988, at COMDEX in Las Vegas. Initially a marketing term for an office suite, the first version of Office contained Microsoft Word, Microsoft Excel, and Microsoft PowerPoint. Over the years, Office applications have grown substantially closer with shared features such as a common spell checker, Object Linking and Embedding data integration and Visual Basic for Applications scripting language. Microsoft also positions Office as a development platform for line-of-business software under the Office Business Applications brand.

<span class="mw-page-title-main">PDF</span> Portable Document Format, a digital file format

Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. Based on the PostScript language, each PDF file encapsulates a complete description of a fixed-layout flat document, including the text, fonts, vector graphics, raster images and other information needed to display it. PDF has its roots in "The Camelot Project" initiated by Adobe co-founder John Warnock in 1991. PDF was standardized as ISO 32000 in 2008. The last edition as ISO 32000-2:2020 was published in December 2020.

<span class="mw-page-title-main">Microsoft Outlook</span> Email and calendaring software

Microsoft Outlook is a personal information manager software system from Microsoft, available as a part of the Microsoft 365 software suites. Though primarily being popular as an email client for businesses, Outlook also includes functions such as calendaring, task managing, contact managing, note-taking, journal logging, web browsing, and RSS news aggregation.

<span class="mw-page-title-main">Adobe Acrobat</span> Set of application software to view, edit and manage files in Portable Document Format (PDF)

Adobe Acrobat is a family of application software and Web services developed by Adobe Inc. to view, create, manipulate, print and manage Portable Document Format (PDF) files.

This is a comparison of both historical and current web browsers based on developer, engine, platform(s), releases, license, and cost.

The following comparison of video players compares general and technical information for notable software media player programs.

<span class="mw-page-title-main">Desktop search</span>

Desktop search tools search within a user's own computer files as opposed to searching the Internet. These tools are designed to find information on the user's PC, including web browser history, e-mail archives, text documents, sound files, images, and video. A variety of desktop search programs are now available; see this list for examples. Most desktop search programs are standalone applications. Desktop search products are software alternatives to the search software included in the operating system, helping users sift through desktop files, emails, attachments, and more.

The following tables compare general and technical information for a number of relational database management systems. Please see the individual products' articles for further information. Unless otherwise specified in footnotes, comparisons are based on the stable versions without any add-ons, extensions or external programs.

Open XML Paper Specification is an open specification for a page description language and a fixed-document format. Microsoft developed it as the XML Paper Specification (XPS). In June 2009, Ecma International adopted it as international standard ECMA-388.

This is an overview of software support for the OpenDocument format, an open document file format for saving and exchanging editable office documents.

Compared with previous versions of Microsoft Windows, features new to Windows Vista are very numerous, covering most aspects of the operating system, including additional management features, new aspects of security and safety, new I/O technologies, new networking features, and new technical features. Windows Vista also removed some others.

<span class="mw-page-title-main">Quick View</span> Windows file viewer software

Quick View is a file viewer in Windows 95, Windows 98 and Windows NT 4.0 operating systems. The viewer can be used to view practically any file.

Oracle XML Publisher (XMLP) is Oracle Corporation's latest reporting technology. It was originally developed to solve the reporting problems faced by Oracle Applications. Osama Elkady from the Applications Technology Group and Tim Dexter from the Financials Group were the main drivers for the product. It was first released with Oracle E-Business Suite 11.5.10 in 2003. Since then it has been integrated into most of Oracle Products including JD Edwards EnterpriseOne application 8.12 and PeopleSoft Enterprise 9, and as a standalone version, XML Publisher Enterprise with no dependency on Oracle Applications. When XML Publisher became part of the Oracle BI Enterprise Edition Suite it was re-branded as Oracle BI Publisher.

Microsoft SQL Server is a proprietary relational database management system developed by Microsoft. As a database server, it is a software product with the primary function of storing and retrieving data as requested by other software applications—which may run either on the same computer or on another computer across a network. Microsoft markets at least a dozen different editions of Microsoft SQL Server, aimed at different audiences and for workloads ranging from small single-machine applications to large Internet-facing applications with many concurrent users.

<span class="mw-page-title-main">Windows Search</span> Desktop search platform by Microsoft

Windows Search is a content index desktop search platform by Microsoft introduced in Windows Vista as a replacement for both the previous Indexing Service of Windows 2000 and the optional MSN Desktop Search for Windows XP and Windows Server 2003, designed to facilitate local and remote queries for files and non-file items in compatible applications including Windows Explorer. It was developed after the postponement of WinFS and introduced to Windows constituents originally touted as benefits of that platform.

OpenSearchServer is an open-source application server allowing development of index-based applications such as search engines. Available since April 2009 on SourceForge for download, OpenSearchServer was developed under the GPL v3 license and offers a series of full text lexical analyzers. It can be installed on different platforms.

The history of Microsoft SQL Server begins with the first Microsoft SQL Server database product – SQL Server v1.0, a 16-bit relational database for the OS/2 operating system, released in 1989.

References