TY - GEN
T1 - Spanish Historical Handwritten Text Recognition with Deep Learning
AU - Choque Dextre, Gustavo Jorge
AU - Beltrán Castañón, César
AU - Pineda Ancco, Ferdinand
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - The recognition of historical texts presents a significant challenge due to a range of factors, such as the physical deterioration of manuscripts and the diverse, complex writing styles commonly found in these documents. These factors complicate the accurate interpretation and processing of historical documents. In recent years, numerous handwritten text recognition (HTR) models have been developed, targeting a variety of languages including English, Chinese, Arabic and Japanese, among others. Despite of this progress, there has been a notable lack of HTR initiatives specially focused on the Spanish language, mainly due to the scarcity of publicly available datasets that could support the development of solution for this specific language. This publication presents the application of Deep Learning techniques based on an Encoder-Decoder Neural Network architecture and Gated Convolutional Neural Networks (Gated-CNN), which in recent years have demonstrated outstanding results in addressing this problem. Additionally, the application of Transfer Learning is employed to improve the accuracy of recognition of historical texts in Spanish. The experiments show that the application of these methods can provide outstanding results, in addition the application of other techniques such as Data Augmentation and N-gram Language Models lead to significant improvements in the results. The use of a new dataset of historical texts in Spanish is also proposed, made up of 1000 elements taken from Peruvian historical texts referring to the 18th century.
AB - The recognition of historical texts presents a significant challenge due to a range of factors, such as the physical deterioration of manuscripts and the diverse, complex writing styles commonly found in these documents. These factors complicate the accurate interpretation and processing of historical documents. In recent years, numerous handwritten text recognition (HTR) models have been developed, targeting a variety of languages including English, Chinese, Arabic and Japanese, among others. Despite of this progress, there has been a notable lack of HTR initiatives specially focused on the Spanish language, mainly due to the scarcity of publicly available datasets that could support the development of solution for this specific language. This publication presents the application of Deep Learning techniques based on an Encoder-Decoder Neural Network architecture and Gated Convolutional Neural Networks (Gated-CNN), which in recent years have demonstrated outstanding results in addressing this problem. Additionally, the application of Transfer Learning is employed to improve the accuracy of recognition of historical texts in Spanish. The experiments show that the application of these methods can provide outstanding results, in addition the application of other techniques such as Data Augmentation and N-gram Language Models lead to significant improvements in the results. The use of a new dataset of historical texts in Spanish is also proposed, made up of 1000 elements taken from Peruvian historical texts referring to the 18th century.
KW - Convolutional Neural Network
KW - Deep Learning
KW - Handwritten Text Recognition
KW - Recurrent Neural Network
KW - Transfer Learning
UR - http://www.scopus.com/inward/record.url?scp=105006818440&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-91428-7_23
DO - 10.1007/978-3-031-91428-7_23
M3 - Conference contribution
AN - SCOPUS:105006818440
SN - 9783031914270
T3 - Communications in Computer and Information Science
SP - 329
EP - 341
BT - Information Management and Big Data - 11th Annual International Conference, SIMBig 2024, Proceedings
A2 - Lossio-Ventura, Juan Antonio
A2 - Ceh-Varela, Eduardo
A2 - Díaz, Eduardo
A2 - Paz Espinoza, Freddy
A2 - Tadonki, Claude
A2 - Alatrista-Salas, Hugo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 11th Annual International Conference on Information Management and Big Data, SIMBig 2024
Y2 - 20 November 2024 through 22 November 2024
ER -