Add `run_speech_recognition_seq2seq.py` #428

callumm-graphcore · 2023-06-20T15:54:02Z

What does this PR do?

Adds run_speech_recognition_seq2seq.py for training/fine-tuning Seq2Seq speech recognition models, such as Whisper, on the IPU.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

callumm-graphcore · 2023-06-20T15:54:27Z

This should be considered WIP, I need to test it with e.g. whisper-tiny

callumm-graphcore · 2023-07-11T10:58:23Z

Sorry, should have clarified: this is no longer WIP

HuggingFaceDocBuilderDev · 2023-07-11T11:11:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

katalinic-gc · 2023-07-17T12:27:18Z

examples/speech-recognition/run_speech_recognition_seq2seq.py

+
+        # if bos token is appended in previous tokenization step,
+        # cut bos token here as it's append later anyways
+        if (labels[:, 0] == self.decoder_start_token_id).all().cpu().item():


Wouldn't this step lead to labels of non-static shape?

examples/speech-recognition/run_speech_recognition_seq2seq.py

katalinic-gc · 2023-07-17T12:57:22Z

The failing test is the one that compares the diff file. Is the current one up to date?

callumm-graphcore · 2023-07-17T14:34:39Z

The failing test is the one that compares the diff file. Is the current one up to date?

I thought it was, but with the new changes, I'll redo it

Add run_speech_recognition_seq2seq.py

e4967a5

Script now working with interleaved training + validation

6e01df2

make style + add diff .txt to tests

5ac00ca

katalinic-gc reviewed Jul 17, 2023

View reviewed changes

callumm-graphcore added 2 commits July 17, 2023 14:51

eval_parallelize_kwargs -> inference_parallelize_kwargs

b42ddf8

Remove pad_to_multiple_of

c7c3f9e

callumm-graphcore added 3 commits July 17, 2023 15:36

Redo diff file

53742af

Set inference_parallelize_kwargs in IPUConfig

ee5282c

Try manually removing transformers version bit from diff file

fe466ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `run_speech_recognition_seq2seq.py` #428

Add `run_speech_recognition_seq2seq.py` #428

callumm-graphcore commented Jun 20, 2023

callumm-graphcore commented Jun 20, 2023

callumm-graphcore commented Jul 11, 2023

HuggingFaceDocBuilderDev commented Jul 11, 2023

katalinic-gc Jul 17, 2023

katalinic-gc commented Jul 17, 2023

callumm-graphcore commented Jul 17, 2023 •

edited

Loading

Add run_speech_recognition_seq2seq.py #428

Are you sure you want to change the base?

Add run_speech_recognition_seq2seq.py #428

Conversation

callumm-graphcore commented Jun 20, 2023

What does this PR do?

Before submitting

callumm-graphcore commented Jun 20, 2023

callumm-graphcore commented Jul 11, 2023

HuggingFaceDocBuilderDev commented Jul 11, 2023

katalinic-gc Jul 17, 2023

Choose a reason for hiding this comment

katalinic-gc commented Jul 17, 2023

callumm-graphcore commented Jul 17, 2023 • edited Loading

Add `run_speech_recognition_seq2seq.py` #428

Add `run_speech_recognition_seq2seq.py` #428

callumm-graphcore commented Jul 17, 2023 •

edited

Loading