[1] 左思翔.基于深度强化学习的无人驾驶智能决策控制研究[D].哈尔滨:哈尔滨工业大学,2018.[2] TOURAN A,BRACKSTONE M A,MCDONALD M.A collision model for safety evaluation of autonomous intelligent cruise control[J].Accident analysis &prevention,1999,31(5):567-578.
[3] PADEN B,CAP M,YONG S Z,et al.A survey of motion planning and control techniques for self-driving urban vehicles[J].IEEE transactions on intelligent vehicles,2016,1(1):33-55.
[4] 夏伟,李慧云.基于深度强化学习的自动驾驶策略学习方法[J].集成技术,2017,6(3):29-34.
[5] 翁岳暄,多尼米克·希伦布兰德.汽车智能化的道路:智能汽车,自动驾驶汽车安全监管研究[J].科技与法律,2014 (4):632-655.
[6] GONZALEZ D,PEREZ J,MILANES V,et al.A review of motion planning techniques for automated vehicles[J].IEEE transactions.intelligent transportation systems,2016,17(4):1135-1145.
[7] HINTON G E,OSINDDRO S,TEH Y W.A fast learning algorithm for deep belief nets[J].Neural computation,2006,18(7):1527-1554.
[8] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[J].Advances in neural information processing systems,2012,25(2):1097-1105.
[9] BOJARSKI M,DEL TESTA D,DWORAKOWSKI D,et al.End to end learning for self-driving cars[EB/OL].(2016-3-25)[2019-09-31].https://arxiv.org/abs/1604.07316.
[10] CHEN C,SEFF A,KORNHAUSER A,et al.Deepdriving:Learning affordance for direct perception in autonomous driving[C]//Proceedings of the IEEE International Conference on Computer Vision.Santiago:IEEE,2015:2722-2730.
[11] MNIH V,KAVUKCUOGLU K,SILVER D,et al.Playing atari with deep reinforcement learning[EB/OL].(2013-10-19)[2019-10-22].https://arxiv.org/abs/1312.5602.
[12] KONDA V R,TSITSIKLIS J N.Actor-critic algorithms[C]//Advances in Neural Information Processing Systems.[S.l.]:The MIT Press,2000:1008-1014.
[13] LILLICRAP T P,HUNT J J,PRITZEL A,et al.Continuous control with deep reinforcement learning[EB/OL].(2015-03-19)[2019-12-20].https://arxiv.org/abs/1509.02971.
[14] 刘赫.动物行为训练的理论基础[J].中国动物保健,2014,16(2):23-25.
[15] SILVER D,LEVER G,HEESS N,et al.Deterministic policy gradient algorithms[C]// Proceedings of the 31st International Conference on International Conference on Machine Learning.[S.l.]:JMLR,2014:387-395.
[16] GERS F A,SCHMIDHUBER J ,CUMMINS F.Learning to forget:continual prediction with LSTM[J].Neural computation,2000,12(10):2451-2471.
[17] WYMANN B,ESPIE E,GUIONNEAU C,et al.TORCS:the open racing car simulator[EB/OL].(2013-12-15)[2019-11-12].http://www.cse.chalmers.se/~chrdimi/papers/torcs.pdf.