Software agent data preprocessing classification data mining