NeurST: Neural Speech Translation Toolkit
release_ucitfz7j3bdcpdxsrwnsynbqnu
by
Chengqi Zhao and Mingxuan Wang and Lei Li
2021
Abstract
NeurST is an open-source toolkit for neural speech translation. The toolkit
mainly focuses on end-to-end speech translation, which is easy to use, modify,
and extend to advanced speech translation research and products. NeurST aims at
facilitating the speech translation research for NLP researchers and building
reliable benchmarks for this field. It provides step-by-step recipes for
feature extraction, data preprocessing, distributed training, and evaluation.
In this paper, we will introduce the framework design of NeurST and show
experimental results for different benchmark datasets, which can be regarded as
reliable baselines for future research. The toolkit is publicly available at
https://github.com/bytedance/neurst/ and we will continuously update the
performance of NeurST with other counterparts and studies at
https://st-benchmark.github.io/.
In text/plain
format
Archived Files and Locations
application/pdf 150.7 kB
file_knzq53k7yjcqdp5ucrvpnnggym
|
arxiv.org (repository) web.archive.org (webarchive) |
2012.10018v2
access all versions, variants, and formats of this works (eg, pre-prints)