As a general rule of thumb running the code on a GPU device will be better, but this is not always the case. If the size of the vectors/matrices get smaller, sometimes the CPU will become faster, and conversely, usually when the matrices dimensions get larger you will get more rapid speedups using a GPU.
…l things and the business objectives are the very high level goals defining product value delivery. Understanding of your software as a system of connected components to deploy your model lies right in the middle. You must understand the architecture of your system, the pipeline of connected blocks. You can then view your model as just another component, block, or module to plug in to the system.