History
This project builds on From Papyrus to Pixels: Optical Character Recognition Applied to Ancient Egyptian Hieratic by Julius Tabin (2022), with code and data on GitHub. The web application was built by Mark-Jan Nederhof, in collaboration with Julius Tabin and Christian Casey.
Aims
This project aims to digitize facsimiles of different hieratic texts, annotated with shapes of glyphs. Image processing techniques are used for automatic segmentation, and OCR techniques are used for automatic classification.
Given the database of shapes, techniques of dimensionality reduction can used to compare manuscripts from different periods, provenances and genres, and to identify different realizations of the same underlying character.
Hosting
This project is hosted in partnership with the Egyptology department of the University of Liège.
The source code is on GitHub.
Bibliography
J. Tabin, M.-J. Nederhof, and C. Casey. Collaborative Annotation and Computational Analysis of Hieratic. In M. Coustaty and A. Fornés (eds.), Document Analysis and Recognition -- ICDAR 2023 Workshops, Lecture Notes in Computer Science, volume 14193, Part 1, pages 267-283, San José, CA, USA, 2023. Springer-Verlag.