The paper introduces a new class of problems called spatio-temporal task allocation with quality optimization (STAQ), which requires maximizing the quality of task allocations while ensuring the makespan of the associated schedule is less than a user-specified time budget.
To solve STAQ problems, the authors develop the Quality-Optimized Incremental Task Allocation Graph Search (Q-ITAGS) algorithm. Q-ITAGS uses an interleaved architecture to simultaneously solve task allocation, scheduling, and motion planning. It models both agents and tasks in terms of capabilities, and uses continuous trait-quality maps to capture the relationship between collective capabilities and task performance.
Q-ITAGS employs an active learning module to efficiently learn the unknown trait-quality maps from data. It uses two novel heuristics - Normalized Allocation Quality (NAQ) and Time Budget Overrun (TBO) - to guide the search, balancing the objectives of maximizing allocation quality and respecting the time budget.
The authors provide theoretical analysis, deriving suboptimality bounds on the allocation quality that vary as a function of a single hyperparameter controlling the relative importance of the two heuristics. Extensive experiments on a simulated emergency response domain and a real-world video game dataset demonstrate that Q-ITAGS outperforms a state-of-the-art method in terms of allocation quality while respecting time budget constraints, and that its active learning approach is more sample-efficient than passive learning.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Glen Neville... às arxiv.org 04-12-2024
https://arxiv.org/pdf/2404.07902.pdfPerguntas Mais Profundas