posted on 2025-08-22, 11:46authored byR. Taormina, G. Kyritsakas, N. Sourlos, J. A. van der Werf, A. Soodan
<p dir="ltr">We present DRACO (Drinking water and wAstewater COgnitive assistant), an initiative to support trustworthy Large Language Model (LLMs) applications for urban water management. Our first contribution is the release of a dataset of domain-specific texts, reasoning, and coding tasks for water networks and treatment systems. The dataset enables benchmarking and comparison of open- and closed-source models. Results from our 239-question evaluation show that state-of-the-art open-weight models can match—and in some cases outperform—proprietary systems. This finding is promising for utilities interested in on-premise deployment or in fine-tuning open-source models for secure, task-specific use. Moving forward, the DRACO initiative will focus on refining the dataset with more realistic, utility-driven case studies, and embedding this work into a broader agentic framework for developing trustworthy, LLM-enabled workflows for water utilities and authorities. This will be carried out through an open, collaborative process with the broader community. In parallel, we will also assess whether fine-tuning a dedicated model offers a meaningful advantage.</p><p dir="ltr">This paper was presented at the 21st Computing and Control in the Water Industry Conference (CCWI 2025) at the University of Sheffield (1st - 3rd September 2025).</p>
History
Methodology, headings and units
Headings and units are explained in the files
Policy
The data complies with the institution and funders' policies on access and sharing
Sharing and access restrictions
The uploaded data can be shared openly
Data description
The file formats are open or commonly used
Responsibility
The depositor is responsible for the content and sharing of the attached files
Ethics
There is no personal data or any that requires ethical approval