Version: 3.1
Back to
The LEA Document Object reads information from common document formats,
and is implemented in the file DOCCOM.DLL.
See The Lynx Exploration Archive Object for details of how to
add other file formats, or attributes to the information read by default.
The following attributes are read from document files.
The attributes are defined in the INI file DOCOBJ.INI.
The following file formats can be recognised:
HTML is the standard format used to publish documents on the internet. As such it is widely portable, and a good standard interchange format. The Title attribute is read from the BODY of the HTML document. Author, subject and keywords are read from META tags in the document header.
Microsoft Office files are stored as OLE structured files, and meta-information can be read from these files independently of the applications used to write them. Attributes are read from the Document Summary Information. OLE files are created by the following applications: Word, Excel, PowePoint and Binder. Embedded documents, such as a Microsoft Word document containing an embedded Excel spreadsheet, or a Binder file containing a number of Word documents, will be classified as logical files (see LEA Base Object Attributes).
AmiPro documents contain only a description field, which is read as the document title.
WordPro documents by default contain Author, Description and Keywords fields.
Adobe PDF is a platform independent document format used by Adobe Acrobat Exchange and Acrobat Reader.
RTF is an application-independent format. The mechanism for storing document meta-data is not standardised between applcaitions, however, the document attributes will be read from the file if present in the form written by Microsoft or Lotus applications.
Lynx does not supply customised viewers for common document formats, since most PCs will already be equipped with a number of applications and utilities for viewing these types of files. A useful general purpose viewing utility is Quick View Plus from Inso Corporation.
See also
[ The Lynx Exploration
Archive Object ]