Simple Processors for Executing Scalar/Vector/ Matrix Instructions: Design, Implementation, and Performance Evaluation