基于云控系统高精度地图驱动的深度强化学习型混合动力汽车集成控制

Deep Reinforcement Learning-based Integrated Control of Hybrid Electric Vehicles Driven by High Definition Map in Cloud Control System

  • 摘要: 在智能化、网联化与新能源化的发展背景下,汽车工业将联合计算机、信息通信、人工智能等领域实现融合性发展。基于新一代信息与通信技术——智能网联汽车云控系统,通过网联数据驱动的形式实现新能源汽车的云控级自动驾驶,将为车辆行驶与动力系统提供革新的规划与控制思路。首先,基于云控系统的资源平台获取目标路段的经纬度、海拔、气象信息,建立包含坡度、曲率、转角等数据在内的高精度模型。其次,提出了一种基于高精地图驱动的深度强化学习型混合动力汽车集成控制方法,通过利用两种深度强化学习算法对整车层的速度与转向以及动力系统层的发动机与变速器进行控制,实现了四种控制策略的同步学习。最后,采用高性能边缘计算设备NVIDIA Jetson AGX Xavier进行了处理器在环测试。结果表明,当变量空间涉及14种状态与4种动作时,深度强化学习型集成控制策略在全程172 km的高速工况下实现了在整车层对速度与转向的精准控制,同时取得了5.53 L/100 km的燃油经济性,并且在嵌入式处理器在环测试中仅消耗104.14 s的计算时间,有效验证了学习型多目标集成控制策略的优化性与实时性。

     

    Abstract: In the context of the development of intelligence, connectivity, and new energy, the automotive industry combines computer, information communication, artificial intelligence(AI) to achieve integrated development. Based on the new generation of information and communication technology--cloud control system(CCS) of intelligent and connected vehicles(ICVs), the cloud-level automatic driving of new energy vehicles is realized driven by connected data, which provides innovative planning and control ideas for vehicle driving and power systems. Firstly, based on the resource platform of CCS, the latitude, longitude, altitude, and weather of the target road are obtained, and a high definition(HD) path model including slope, curvature, and steering angle is established. Secondly, a deep reinforcement learning(DRL)-based integrated control method for hybrid electric vehicle(HEV) drive by the HD model is proposed. By adopting two DRL algorithms, the speed and steering of the vehicle and the engine and transmission in the powertrain are controlled, and the synchronous learning of four control strategies is realized. Finally, processor-in-the-loop(PIL) tests are performed by using the high-performance edge computing device NVIDIA Jetson AGX Xavier. The results show that under a variable space including 14 states and 4 actions, the DRL -based integrated control strategy realizes the precise control of the speed and steering of the vehicle layer under the high-speed driving cycle of 172 km, and achieves a fuel consumption of 5.53L/100km. Meanwhile, it only consumes 104.14s in the PIL test, which verifies the optimization and real-time performance of the learning-based multi-objective integrated control strategy.

     

/

返回文章
返回