What Questions Should Robots Be Able to Answer? A Dataset of User Questions for Explainable Robotics
Abstract
A dataset of user questions for household robots is introduced, providing insights into the types of questions users ask and the information robots need to answer, supporting the development of conversational interfaces and explanation strategies.
With the growing use of large language models and conversational interfaces in human-robot interaction, robots' ability to answer user questions is more important than ever. We therefore introduce a dataset of 1,893 user questions for household robots, collected from 100 participants and organized into 12 categories and 70 subcategories. Most work in explainable robotics focuses on why-questions. In contrast, our dataset provides a wide variety of questions, from questions about simple execution details to questions about how the robot would act in hypothetical scenarios -- thus giving roboticists valuable insights into what questions their robot needs to be able to answer. To collect the dataset, we created 15 video stimuli and 7 text stimuli, depicting robots performing varied household tasks. We then asked participants on Prolific what questions they would want to ask the robot in each portrayed situation. In the final dataset, the most frequent categories are questions about task execution details (22.5%), the robot's capabilities (12.7%), and performance assessments (11.3%). Although questions about how robots would handle potentially difficult scenarios and ensure correct behavior are less frequent, users rank them as the most important for robots to be able to answer. Moreover, we find that users who identify as novices in robotics ask different questions than more experienced users. Novices are more likely to inquire about simple facts, such as what the robot did or the current state of the environment. As robots enter environments shared with humans and language becomes central to giving instructions and interaction, this dataset provides a valuable foundation for (i) identifying the information robots need to log and expose to conversational interfaces, (ii) benchmarking question-answering modules, and (iii) designing explanation strategies that align with user expectations.
Community
Our dataset is also available to explore on HuggingFace: https://huggingface.co/datasets/lwachowiak/xai-questions-dataset
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation (2025)
- Multi-Hop Question Answering: When Can Humans Help, and Where do They Struggle? (2025)
- Asking Clarifying Questions for Preference Elicitation With Large Language Models (2025)
- $How^{2}$: How to learn from procedural How-to questions (2025)
- CARIS: A Context-Adaptable Robot Interface System for Personalized and Scalable Human-Robot Interaction (2025)
- How Good are Foundation Models in Step-by-Step Embodied Reasoning? (2025)
- Presenting Large Language Models as Companions Affects What Mental Capacities People Attribute to Them (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
 You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: 
@librarian-bot
	 recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
