speech2text
Syntax
Description
transcribes speech in the input audio signal to text using a wav2vec 2.0 pretrained deep
learning model or a third-party speech service.transcript
= speech2text(clientObj
,audioIn
,fs
)
Note
To use speech2text
with the third-party speech services, you
must download the extended Audio Toolbox™ functionality from File Exchange. The File Exchange submission includes a tutorial to get started
with the third-party services.
Using wav2vec 2.0 requires Deep Learning Toolbox™ and installing the pretrained model.
specifies the time in seconds to wait for the initial server connection to the third-party
speech service.transcript
= speech2text(___,HTTPTimeout=timeout
)
[
also returns the unprocessed server output from the third-party speech service.transcript
,rawOutput
] = speech2text(___)
Examples
Input Arguments
Output Arguments
References
[1] Baevski, Alexei, Henry Zhou, Abdelrahman Mohamed, and Michael Auli. “Wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations,” 2020. https://doi.org/10.48550/ARXIV.2006.11477.
Version History
Introduced in R2022b