Real-time Speech and Music Classification by Large Audio Feature Space Extraction (Springer Theses)