Cognitively Inspired Video Text Processing