Pouyan, M., Golzari, S., Mousavi, A., Hatam, Ahmad (2016) ‘Improving Q-Learning Using Simultaneous Updating and Adaptive Policy Based on Opposite Action’, Nashriyyah -i Muhandisi -i Barq va Muhandisi -i Kampyutar -i Iran, 14(2), pp. 137-146. doi: