Accelerating The Derivation Of Optimal Powertrain Control Strategies Using Reinforcement Learning And Virtual Prototypes