next up previous contents index


31.2.1.2 Table 'archivseiten'

Table 'archivseiten' contains the information regarding the text recognition of a page, i.e. we find the extracted text as well as the OCR definition belonging to the page. Note that the archiving process leaves table 'archivseiten' untouched.

Image t_archiv1e

The following fields are relevant for us:

  • Seite: reference to document and page of table 'archive' (document*1000+page)
  • Ausschliessen: do not treat page with OCR
  • Erfasst: page already treated with OCR
  • Text: memo field for various information
  • Indexiert: shows whether an index has been done for this document (obsolete)
  • OCR: definition (0-x) making the desired language strings available
  • ScreenQuality: reduction factor for screen copy: 0-50