Trillions of documents are digitized every year through a process which in one phase, layout analysis or ZONING, still relies on manual intervention. Zoning, preceding OCR and all content classification in the digitization process, is imperative for the result to be usable at all. No reliable tool exists to automatically zone various document types today, meaning costly human intervention is always required. This consortium proposes a software concept which will resolve this problem.
Project leader: Fredrika Haneborg-Luhr
Institution: LUMEX AS