Learning from Imperfections: Building Datasets with Probabilistic Alignment of Handwritten Text: DE