Natural Language Processing for Virtual Reference Analysis release_dfxo7ilbgveg3h2lzviujin2wa

by Ansh Sharma, Kathryn Barrett, Kirsta Stapelfeldt

Published in Evidence Based Library and Information Practice by University of Alberta Libraries.

2022   Volume 17, p78-93

Abstract

Objective – Chat transcript analysis can illuminate user needs by identifying common question topics, but traditional hand coding methods for topic analysis are time-consuming and poorly suited to large datasets. The research team explored the viability of automatic and natural language processing (NLP) strategies to perform rapid topic analysis on a large dataset of transcripts from a consortial chat service. Methods – The research team developed a toolchain for data processing and analysis, which incorporated targeted searching for query terms using regular expressions and natural language processing using the Python spaCy library for automatic topic analysis. Processed data was exported to Tableau for visualization. Results were compared to hand-coded data to test the accuracy of conclusions. Results – The processed data provided insights about the volume of chats originating from each participating library, the proportion of chats answered by operator groups for each library, and the percentage of chats answered by different staff types. The data also captured the top referring URLs for the service, course codes and file extensions mentioned, and query hits. Natural language processing revealed that the most common topics were related to citation, subscription databases, and finding full-text articles, which aligns with common question types identified in hand-coded transcripts. Conclusion – Compared to hand coding, automatic and NLP processing approaches have benefits in terms of the volume of data that can be analyzed and the time frame required for analysis, but they come with a trade-off in accuracy, such as false hits. Therefore, computational approaches should be used to supplement traditional hand coding methods. As NLP becomes more accurate, approaches such as these may widen avenues of insight into virtual reference and patron needs.
In application/xml+jats format

Archived Files and Locations

application/pdf  580.9 kB
file_s3xcbujyqbdwvbhpa5ahbyiak4
journals.library.ualberta.ca (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2022-03-15
Container Metadata
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  1715-720X
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 101cfa1b-e7bc-443b-8f74-17d46a9b17d5
API URL: JSON