USING CONVOLUTION NEURAL NETWORKS TO DEVELOP ROBUST COMBAT BEHAVIORS THROUGH REINFORCEMENT LEARNING
Loading...
Authors
Cannon, Christopher T.
Goericke, Stefan
Subjects
artificial intelligence
neural network
machine learning
constructive simulation
combat behaviors
reinforcement learning
RL
convolutional neural networks
CNN
artificial intelligence
AI
neural network
machine learning
constructive simulation
combat behaviors
reinforcement learning
RL
convolutional neural networks
CNN
artificial intelligence
AI
Advisors
Darken, Christian J.
Date of Issue
2021-06
Date
Publisher
Monterey, CA; Naval Postgraduate School
Language
Abstract
The success of reinforcement learning (RL), as shown with video games such as StarCraft and DOTA 2 achieving above-human performance levels, begs questions about the future role of the technology in military constructive simulations. The objective of this study was to use convolutional neural networks (CNN) to develop artificial intelligence (AI) agents capable of learning optimal behaviors in simple scenarios featuring multiple unit and terrain types. This thesis sought to incorporate a multi-agent training regimen that could be employed in the domain of military constructive simulations. Eight different scenarios, all with varying levels of complexity, were used to train agents capable of exhibiting multiple types of combat behaviors. Overall, the results demonstrate that the AI agents can learn robust tactical behaviors required to achieve optimal or near-optimal performance in each scenario. The findings additionally indicate that a better understanding of multi-agent training was attained. Ultimately, CNN combined with RL techniques prove to be an efficient and viable method to train intelligent agents in military constructive simulations, and their application can potentially save human resources in the execution of live exercises and missions. It is recommended that future work should investigate how to best incorporate similar deep-RL methods into an existing military program of record constructive simulation.
Type
Thesis
Description
Series/Report No
Department
Computer Science (CS)
Computer Science (CS)
Organization
Identifiers
NPS Report Number
Sponsors
Funder
Format
Citation
Distribution Statement
Approved for public release. Distribution is unlimited.
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States. Copyright is reserved by the copyright owner.