Figure ten reveals the instruction curve with the proposed DQN-based UAV detouring algorithm when you will find thirty sensors and six obstructions. We could see that the rewards gradually improved from the beginning as the amount of coaching episodes greater. The overall reward grows significantly following the 4000th episode from your 7000 in who