Language Corpora Annotation and Processing