Bibliography#
Gonzalo Navarro. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31–88, March 2001. doi:10.1145/375360.375365.
Saul B Needleman and Christian D Wunsch. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol., 48(3):443–453, 1970. doi:10.1016/0022-2836(70)90057-4.
Clemens Neudecker, Konstantin Baierer, Mike Gerber, Christian Clausner, Apostolos Antonacopoulos, and Stefan Pletschacher. A survey of OCR evaluation tools and metrics. In Proc. 6th Int. Workshop Hist. Doc. Imaging Process. (HIP '21), 13–18. 2021. doi:10.1145/3476887.3476888.
Christos Papadopoulos, Stefan Pletschacher, Christian Clausner, and Apostolos Antonacopoulos. The IMPACT dataset of historical document images. In Proc. 2nd Int. Workshop Hist. Doc. Imaging Process. (HIP '13), 123–130. ACM, 2013. doi:10.1145/2501115.2501130.
Uwe Springmann, Christian Reul, Stefanie Dipper, and Johannes Baiter. Ground truth for training ocr engines on historical documents in german fraktur and early modern latin. Journal for Language Technology and Computational Linguistics, 33(1):97–114, Jul. 2018. URL: https://jlcl.org/article/view/220, doi:10.21248/jlcl.33.2018.220.
Unicode Consortium. The Unicode® standard version 17.0. Core Specification, Unicode Consortium, September 2025. URL: https://www.unicode.org/versions/Unicode17.0.0/core-spec/.
Unicode Consortium. Unicode emoji. Unicode Technical Report UAX #51, Unicode Consortium, October 2025. URL: https://www.unicode.org/reports/tr51/ (visited on 2025-09-04).
Unicode Consortium. Unicode normalization forms. Unicode Technical Report UAX #15, Unicode Consortium, July 2025. URL: https://unicode.org/reports/tr15/ (visited on 2025-07-30).
Unicode Consortium. Unicode security mechanisms. Unicode Technical Report UAX #39, Unicode Consortium, September 2025. URL: https://unicode.org/reports/tr39/ (visited on 2025-09-04).
Unicode Consortium. Unicode text segmentation. Unicode Technical Report UAX #29, Unicode Consortium, August 2025. URL: https://unicode.org/reports/tr29/ (visited on 2025-08-17).