NeurST: Neural Speech Translation Toolkit release_ucitfz7j3bdcpdxsrwnsynbqnu

by Chengqi Zhao and Mingxuan Wang and Lei Li

Released as a article .

2021  

Abstract

NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at facilitating the speech translation research for NLP researchers and building reliable benchmarks for this field. It provides step-by-step recipes for feature extraction, data preprocessing, distributed training, and evaluation. In this paper, we will introduce the framework design of NeurST and show experimental results for different benchmark datasets, which can be regarded as reliable baselines for future research. The toolkit is publicly available at https://github.com/bytedance/neurst/ and we will continuously update the performance of NeurST with other counterparts and studies at https://st-benchmark.github.io/.
In text/plain format

Archived Files and Locations

application/pdf  150.7 kB
file_knzq53k7yjcqdp5ucrvpnnggym
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2021-06-10
Version   v2
Language   en ?
arXiv  2012.10018v2
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 66763aee-65fc-4b70-9800-c408857924bf
API URL: JSON