Introducing MANtIS: a novel Multi-Domain Information Seeking Dialogues Dataset release_ywrcnweumzhdxh46de64r4vs74

by Gustavo Penha, Alexandru Balan, Claudia Hauff

Released as a article .

2019  

Abstract

Conversational search is an approach to information retrieval (IR), where users engage in a dialogue with an agent in order to satisfy their information needs. Previous conceptual work described properties and actions a good agent should exhibit. Unlike them, we present a novel conceptual model defined in terms of conversational goals, which enables us to reason about current research practices in conversational search. Based on the literature, we elicit how existing tasks and test collections from the fields of IR, natural language processing (NLP) and dialogue systems (DS) fit into this model. We describe a set of characteristics that an ideal conversational search dataset should have. Lastly, we introduce MANtIS (the code and dataset are available at https://guzpenha.github.io/MANtIS/), a large-scale dataset containing multi-domain and grounded information seeking dialogues that fulfill all of our dataset desiderata. We provide baseline results for the conversation response ranking and user intent prediction tasks.
In text/plain format

Archived Files and Locations

application/pdf  642.6 kB
file_42w2hr7h3jcs7hpprtsl7x2ghe
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2019-12-10
Version   v1
Language   en ?
arXiv  1912.04639v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: da5ef193-be22-40bf-ab11-3b13a13bd213
API URL: JSON