Multimodal Learning toward Recommendation