- Browse by Author
Patricia Snell Herzog
Permanent URI for this collection
Browse
Browsing Patricia Snell Herzog by Author "Bolchini, Davide"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item Data-To-Question Generation Using Deep Learning(IEEE, 2023) Koshy, Nicole; Dixit, Anshuman; Jadhav, Siddhi Shrikant; Penmatsa, Arun V.; Samanthapudi, Sagar V.; Kumar, Mothi Gowtham Asok; Anuyah, Sydney Oghenetega; Vemula, Gourav; Herzog, Patricia Snell; Bolchini, DavideMany publicly available datasets exist that can provide factual answers to a wide range of questions that benefit the public. Indeed, datasets created by governmental and non- governmental organizations often have a mandate to share data with the public. However, these datasets are often underutilized by knowledge workers due to the cumbersome amount of expertise and embedded implicit information needed for everyday users to access, analyze, and utilize their information. To seek solutions to this problem, this paper discusses the design of an automated process for generating questions that provide insight into a dataset. Given a relational dataset, our prototype system architecture follows a five-step process from data extraction, cleaning, pre-processing, entity recognition using deep learning, and questions formulation. Through examples of our results, we show that the questions generated by our approach are similar and, in some cases, more accurate than the ones generated by an AI engine like ChatGPT, whose question outputs while more fluent, are often not true to the facts represented in the original data. We discuss key limitations of our approach and the work to be done to bring to life a fully generalized pipeline that can take any data set and automatically provide the user with factual questions that the data can answer.Item Question-Generating Datasets: Facilitating Data Transformation of Official Statistics for Broad Citizenry Decision-Making(Universitat Politècnica de València, 2020-05) Yadav, Rahul; Herzog, Patricia Snell; Bolchini, Davide; Lilly Family School of PhilanthropyCitizenry decision-making relies on data for informed actions, and official statistics provide many of the relevant data needed for these decisions. However, the wide, distributed, and diverse datasets available from official statistics remain hard to access, scrutinise and manipulate, especially for non-experts. As a result, the complexities involved in official statistical databases create barriers to broader access to these data, often rendering the data non-actionable or irrelevant for the speed at which decisions are made in social and public life. To address this problem, this paper proposes an approach to automatically generating basic, factual questions from an existing dataset of official statistics. The question generating process, now specifically instantiated for geospatial data, starts from a raw dataset and gradually builds toward formulating and presenting users with examples of questions that the dataset can answer, and for which geographic units. This approach exemplifies a novel paradigm of question-first data rendering, where questions, rather than data tables, are used as a human-centred and relevant access points to explore, manipulate, navigate and cross-link data to support decision making. This approach can automate time-consuming aspects of data transformation and facilitate broader access to data.