Abstract
In the era of data-intensive scientific discovery, visualization serves as a crucial cognitive tool for researchers, while captions that align with visuals are essential for accurately conveying scientific intent. However, current scientific visualization workflows face significant challenges, including high technical barriers and semantic misalignment among user intent, visual output, and textual descriptions. To address these issues, this paper proposes SciVis-AGE, a visual analytics system based on multi-agent collaboration. Its core methodologies comprise an agent-based task decomposition and operator encapsulation approach for automatic visualization generation, and a multi-agent triangular debate mechanism for semantic alignment and caption optimization. The system effectively reduces the technical burden on domain experts and, through iterative debate among Intent Guardian, Visual Verifier, and Annotation Checker agents, ensures precise alignment of generated images and captions with user intent, visual content, and highlighted features, thereby enhancing the rigor and efficiency of scientific communication.
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
