This content is not included in
your SAE MOBILUS subscription, or you are not logged in.
Implementation of Reinforcement Learning on Air Source Heat Pump Defrost Control for Full Electric Vehicles
Technical Paper
2018-01-1193
ISSN: 0148-7191, e-ISSN: 2688-3627
This content contains downloadable datasets
Annotation ability available
Sector:
Language:
English
Abstract
Air source heat pumps as the heating system for full electric vehicles are drawing more and more attention in recent years. Despite the high energy efficiency, frost accumulation on the heat pump evaporator is one of the major challenges associated with air source heat pumps. The evaporator needs to be actively defrosted periodically and heat pump heating will be interrupted during defrosting process. Proper defrost control is needed to obtain high average heat pump energy efficiency. In this paper, a new method for generating air source heat pump defrost control policy using reinforcement learning is introduced. This model-free method has several advantages. It can automatically generate optimal defrost control policy instead of requiring manually determination of the control policy parameters and logics. More measurement results can be incorporated into the defrost control policy without too many changes in the reinforcement learning algorithm so that the control policy can be better optimized under wider range of working conditions. The learning features also enable the controller to adapt to the system differences and changes which are impossible to predict a priori when designing defrost control policy. The algorithm was validated using experimentally obtained heating capacity and COP data in frost growth cycle of a heat pump under different conditions. The results showed that reinforcement learning can be used to generate defrost control policy that optimizes energy consumption for various working conditions.
Recommended Content
Topic
Citation
Zhu, J. and Elbel, S., "Implementation of Reinforcement Learning on Air Source Heat Pump Defrost Control for Full Electric Vehicles," SAE Technical Paper 2018-01-1193, 2018, https://doi.org/10.4271/2018-01-1193.Data Sets - Support Documents
Title | Description | Download |
---|---|---|
Unnamed Dataset 1 | ||
Unnamed Dataset 2 |
Also In
References
- Seker , D. , Karatas , H. , and Egrican , N. Frost Formation on Fin-and-tube Heat Exchangers. Part I-Modeling of Frost Formation on Fin-and-tube Heat Exchangers Int. J. Refrig. 27 4 367 374 2004
- Seker , D. , Karatas , H. , and Egrican , N. Frost Formation on Fin-and-tube Heat Exchangers. Part II-Experimental Investigation of Frost Formation on Fin-and-tube Heat Exchangers Int. J. Refrig. 27 4 375 377 2004
- Guo , X.M. , Chen , Y.G. , Wang , W.H. , and Chen , C.Z. Experimental Study on Frost Growth and Dynamic Performance of Air Source Heat Pump System Appl. Therm. Eng. 28 17 2267 2278 2008
- Ding , Y. , Ma , G. , Chai , Q. , and Jiang , Y. Experiment Investigation of Reverse Cycle Defrosting Methods on Air Source Heat Pump with TXV as the Throttle Regulator Int. J. Refrig. 27 6 671 678 2004
- Fahlen , P.O. Frosting and Defrost of Air Coils-Results from Laboratory Testing Boras, Sweden Swedish National Testing and Research Institute 1996
- Ciricillo , S.F. 1985 Heat Pump De-icing Controlling for Energy Conservation and Costs In Proceedings of the Clima 2000 Congress on Heating, Ventilation and Air Conditioning Copenhagen
- Heinzen , R.A. How Adaptive Defrost Maintains Refrigeration System Efficiency Aust. Refrig., Air Conditioning Heat 42 4 12 16 1988
- Muller , E.D. A New Concept for Defrosting Refrigeration Plants Kalte 28 2 52 54 1975
- Watkins , C.J.C.H. and Dayan , P. Q-learning Machine Learning 8 3 279 292 1992
- Jaakkola , T. , Jordan , M. , Singh , S. 1993 On the Convergence of Stochastic Iterative Dynamic Programming Algorithms Massachusetts Institute of Technology Cambridge, MA
- Sutton , R.S. , Barto , A.G. , and Williams , R.J. Reinforcement Learning is Direct Adaptive Optimal Control IEEE Control Systems 12 2 19 22 1992
- Anderson , C.W. Learning to Control an Inverted Pendulum using Neural Networks IEEE Control Syst. Mag. 31 37 1989
- Moore , A.W. 1990 Efficient Memory-based Learning for Robot Control Cambridge University, Computer Science Department
- Sutton , R.S. 1990 First Results with Dyna: An Integrated Architecture for Learning, Planning, and Reacting
- Crites , R.H. , Barto , A.G. 1996 Improving Elevator Performance using Reinforcement Learning
- Ng , A. , Coates , A. , Diel , M. , Ganapathi , V. , Schulte , J. , Tse , B. , Berger , E. , Liang , E. 2006 Autonomous Inverted Helicopter Flight via Reinforcement Learning 363 372