Strings of Natural Languages: Unsupervised Analysis and Segmentation on the Expression Level