Natural Language Processing Using Very Large Corpora (Text, Speech and Language Technology)