Skip to content

Commit

Permalink
Add MMS_FA bundle
Browse files Browse the repository at this point in the history
The new bundle Wav2Vec2FABundle and its instance MMS_FA are added.
They are specialized for forced alignment, and the usage is explained in
the multilingual FA tutorial.
  • Loading branch information
mthrok committed Aug 3, 2023
1 parent 09aabcc commit 8b72231
Show file tree
Hide file tree
Showing 6 changed files with 306 additions and 138 deletions.
30 changes: 30 additions & 0 deletions docs/source/pipelines.rst
Original file line number Diff line number Diff line change
Expand Up @@ -142,6 +142,36 @@ Pretrained Models
HUBERT_ASR_LARGE
HUBERT_ASR_XLARGE

wav2vec 2.0 / HuBERT - Forced Alignment
---------------------------------------

Interface
~~~~~~~~~

``Wav2Vec2FABundle`` bundles pre-trained model and its associated dictionary. Additionally, it supports appending ``star`` token dimension.

.. image:: https://download.pytorch.org/torchaudio/doc-assets/pipelines-wav2vec2asrbundle.png

.. autosummary::
:toctree: generated
:nosignatures:
:template: autosummary/bundle_class.rst

Wav2Vec2FABundle

.. rubric:: Tutorials using ``Wav2Vec2FABundle``

.. minigallery:: torchaudio.pipelines.Wav2Vec2FABundle

Pertrained Models
~~~~~~~~~~~~~~~~~

.. autosummary::
:toctree: generated
:nosignatures:
:template: autosummary/bundle_data.rst

MMS_FA

.. _Tacotron2:

Expand Down
9 changes: 9 additions & 0 deletions docs/source/refs.bib
Original file line number Diff line number Diff line change
Expand Up @@ -570,3 +570,12 @@ @incollection{45611
URL = {https://arxiv.org/abs/1609.09430},
booktitle = {International Conference on Acoustics, Speech and Signal Processing (ICASSP)}
}

@misc{pratap2023scaling,
title={Scaling Speech Technology to 1,000+ Languages},
author={Vineel Pratap and Andros Tjandra and Bowen Shi and Paden Tomasello and Arun Babu and Sayani Kundu and Ali Elkahky and Zhaoheng Ni and Apoorv Vyas and Maryam Fazel-Zarandi and Alexei Baevski and Yossi Adi and Xiaohui Zhang and Wei-Ning Hsu and Alexis Conneau and Michael Auli},
year={2023},
eprint={2305.13516},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Loading

0 comments on commit 8b72231

Please sign in to comment.