Advanced optimization techniques for MT simulation on GPUs: Using massively parallel devices for scientific computing