Multivariate calibration of classifier scores into probability space: Comparison of uni- and multivariate calibration techniques for classification and introduction of the Dirichlet Calibration