hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Global-and-Local Attention-Based Reinforcement Learning for Cooperative Behaviour Control of Multiple UAVs
The School of Computer Science, Northwestern Polytechnical University, Xi’an, China.ORCID iD: 0000-0001-6234-1001
The School of Computer Science, Northwestern Polytechnical University, Xi’an, China.ORCID iD: 0009-0006-2590-928X
The School of Computer Science, Northwestern Polytechnical University, Xi’an, China.ORCID iD: 0000-0002-7557-2965
The School of Computer Science, Northwestern Polytechnical University, Xi’an, China.ORCID iD: 0000-0003-0023-7617
Show others and affiliations
2024 (English)In: IEEE Transactions on Vehicular Technology, ISSN 0018-9545, E-ISSN 1939-9359, Vol. 73, no 3, p. 4194-4206Article in journal (Refereed) Published
Abstract [en]

Due to the strong adaptability and high flexibility, unmanned aerial vehicles (UAVs) have been extensively studied and widely applied in both civil and military applications. Although UAVs can achieve significant cost reduction and performance enhancement in large-scale systems by taking full advantage of their cooperation and coordination, they result in a serious cooperative behaviour control problem. Especially in dynamic environments, the cooperative behaviour control problem which has to quickly produce a safe and effective behaviour decision for each UAV to achieve group missions, is NP-hard and difficult to settle. In this work, we design a global-and-local attention-based reinforcement learning algorithm for the cooperative behaviour control problem of UAVs. First, with the motion and coordination models, we analyze the collision avoidance, motion state update, and task execution constraints of multiple UAVs, and abstract the cooperative behaviour control problem as a multi-constraint decision-making one. Then, inspired from the human-learning process where more attention is devoted to the important parts of data, we design a multi-agent reinforcement learning algorithm with a global-and-local attention mechanism to cooperatively control the behaviours of UAVs and achieve the coordination. Simulation experiments in a multi-agent particle environment provided by OpenAI are conducted to verify the effectiveness and efficiency of the proposed approach. Compared with baselines, our approach shows significant advantages in mean reward, training time, and coordination effect. © 2023 IEEE.

Place, publisher, year, edition, pages
Piscataway: Institute of Electrical and Electronics Engineers (IEEE), 2024. Vol. 73, no 3, p. 4194-4206
Keywords [en]
Global-and-local attention mechanism, reinforcement learning, cooperative behaviour control, multiple UAVs, multi-constraint decision-making
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-54283DOI: 10.1109/TVT.2023.3327571ISI: 001184969900117Scopus ID: 2-s2.0-85179001590OAI: oai:DiVA.org:hh-54283DiVA, id: diva2:1883865
Available from: 2024-07-12 Created: 2024-07-12 Last updated: 2024-07-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Tiwari, Prayag

Search in DiVA

By author/editor
Chen, JinchaoLi, TingyangZhang, YingYou, TaoLu, YantaoTiwari, PrayagKumar, Neeraj
By organisation
School of Information Technology
In the same journal
IEEE Transactions on Vehicular Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 100 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf