Technology & Products

Technology & products

Mimotek's technology offerings are based on the advanced processing of PDF files. In particular, they combine in a single structured PDF file both the description of the appearance of a document and the description the document's logical structure.

By processing these two alternative views of the document in parallel the structure can be used to identify and extract portions of a document (for example articles from a newspaper or chapters/paragraphs from books), while the detailed appearance of the publication is retained, thereby maintaining the original look-and-feel.

Click here to download samples of extracted PDF newspaper articles (, 267 kB)

Mimotek's solution consists of two main groups of software components :

  1. Software to encode the document's logical structure in the PDF file and make the content accessible at a fine level of aggregation

    This is a production tool which will analyse PDF files and automatically add structure data to them. The structure information can be written back into the PDF file (as Structured PDF), or exported as XML. Very complex files such as newspaper pages are analysed and structured within seconds.

  2. Tools that allow extraction and reformatting of individual articles from within the structured PDF file, while retaining the original look and feel.

    A set of tools that automate the extraction of content, such as newspaper and magazine articles or sections, or chapters/paragraphs from books. The extracted content can be packaged as a new PDF file that maintains all the typographic features of the original publication for delivery to end-users as personalized content. The extracted content can also be written to other formats (XML, HTML, PALM/PDA) for electronic delivery.

The tools are delivered as plug-ins to Adobe Acrobat 6 and 7 and, in the case of server products, as applications built on Adobe PDF Library, running on Microsoft Windows platforms.

Mimotek logo   Site Map | Contact Us | © 2007 Mimotek b.v.