A methodology for using crowdsourced data to measure uncertainty in natural speech release_k7gbsrq2vrdutlxda2d2eu43py

by Lara Martin, Matthew Stone, Florian Metze, Jack Mostow

Entity Metadata (schema)

abstracts[] {'sha1': '58d140e08a58cec487355fb60211fd165d187be8', 'content': 'People sometimes express uncertainty unconsciously in order to add layers of meaning on top of their speech, conveying doubts about the accuracy of the information they are trying to communicate. In this paper, we propose a methodology for annotating uncertainty, which is usually a subjective and expensive process, by using crowdsourcing. In our experiment, we used an online database which consists of colors that more than 200,000 users have named. Based on the amount of unique names that users have given each color, an entropy value was calculated to represent the uncertainty level of the color. A model, which performed better than chance, was created to predict whether or not the color that the participant was describing was ambiguous or borderline, given certain prosodic cues of their speech when asked to name the color verbally. Using crowdsourced data can greatly streamline the process of annotating uncertainty, but our methods have yet to be tested in other domains besides color. By using methods such as ours to measure prosodic attributes of uncertainty, it should be possible to increase the accuracy of voice search.', 'mimetype': 'text/plain', 'lang': 'en'}
container
container_id
contribs[] {'index': 0, 'creator_id': None, 'creator': None, 'raw_name': 'Lara Martin', 'given_name': 'Lara', 'surname': 'Martin', 'role': 'author', 'raw_affiliation': None, 'extra': None}
{'index': 1, 'creator_id': None, 'creator': None, 'raw_name': 'Matthew Stone', 'given_name': 'Matthew', 'surname': 'Stone', 'role': 'author', 'raw_affiliation': None, 'extra': None}
{'index': 2, 'creator_id': None, 'creator': None, 'raw_name': 'Florian Metze', 'given_name': 'Florian', 'surname': 'Metze', 'role': 'author', 'raw_affiliation': None, 'extra': None}
{'index': 3, 'creator_id': None, 'creator': None, 'raw_name': 'Jack Mostow', 'given_name': 'Jack', 'surname': 'Mostow', 'role': 'author', 'raw_affiliation': None, 'extra': None}
ext_ids {'doi': '10.1184/r1/6472973', 'wikidata_qid': None, 'isbn13': None, 'pmid': None, 'pmcid': None, 'core': None, 'arxiv': None, 'jstor': None, 'ark': None, 'mag': None, 'doaj': None, 'dblp': None, 'oai': None, 'hdl': None}
files[] {'state': 'active', 'ident': 'sk6smrbdfrhu7dpibtawsi7uju', 'revision': '9f60814d-3d04-4e53-aa9a-ac463bb21941', 'redirect': None, 'extra': None, 'edit_extra': None, 'size': 299246, 'md5': '83b69a9141416ef3ae58fb331892701f', 'sha1': '0b642c56e892d2ee77f59af2dfd2a3a3db02d123', 'sha256': 'beeb3edfe715bfc0bbf7be873303c8fb001c84ada20adffa8cc154edfa6af7e7', 'urls': [{'url': 'https://s3-eu-west-1.amazonaws.com/pstorage-cmu-348901238291901/11902493/file.pdf', 'rel': 'publisher'}, {'url': 'https://web.archive.org/web/20200225073853/https://s3-eu-west-1.amazonaws.com/pstorage-cmu-348901238291901/11902493/file.pdf', 'rel': 'webarchive'}], 'mimetype': 'application/pdf', 'content_scope': None, 'release_ids': ['k7gbsrq2vrdutlxda2d2eu43py'], 'releases': None}
filesets []
issue
language
license_slug RS-INC
number
original_title
pages
publisher Figshare
refs []
release_date 2018-06-10
release_stage published
release_type article-journal
release_year 2018
subtitle
title A methodology for using crowdsourced data to measure uncertainty in natural speech
version
volume
webcaptures []
withdrawn_date
withdrawn_status
withdrawn_year
work_id rap5z3epgff4hfk23heazczgye
As JSON via API

Extra Metadata (raw JSON)

datacite.license [{'rights': 'In Copyright', 'rightsUri': 'http://rightsstatements.org/page/InC/1.0?language=en'}]
datacite.resourceType Paper
datacite.resourceTypeGeneral Text
datacite.subjects [{'schemeUri': 'http://www.abs.gov.au/ausstats/abs@.nsf/0/6BB427AB9696C225CA2574180004463E', 'subject': '89999 Information and Computing Sciences not elsewhere classified', 'subjectScheme': 'FOR'}]
release_month 6