Information Extraction from Semi-structured and Unstructured Sources: A proposed General Entity Recognition System