You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Echogarden v1.5.0
Start stage 1: Align speech to transcript
Transcode with command-line ffmpeg.. 1102.4ms
Convert wave buffer to raw audio.. 384.1ms
Resample audio to 16kHz mono.. 962.1ms
Crop using voice activity detection.. 1263.1ms
Normalize and trim audio.. 181.2ms
No language specified. Detect language using reference text.. 84.4ms
Language detected: Japanese (ja)
Load alignment module.. 0.2ms
Synthesize alignment reference with eSpeak.. 5911.2ms
Starting alignment pass 1/1: granularity: low, max window duration: 189s
Compute reference MFCC features.. 1069.2ms
Compute source MFCC features.. 721.3ms
DTW cost matrix memory size: 685.4MB
Align reference and source MFCC features using DTW.. 2345.1ms
Convert path to timeline.. 20.7ms
Postprocess timeline.. 54.9ms
Total alignment time: 14195.5ms
Start stage 2: Align timeline to translated transcript
No source language specified. Detect source language.. 0.9ms
Source language detected: Japanese (ja)
No target language specified. Detect target language.. 0.6ms
Target language detected: Chinese (zh)
Load e5 module
Prepare text for semantic alignment.. 331.4ms
Initialize E5 embedding model.. 1184.6ms
Extract embeddings from source 1.. Error: Token '4' not found in text
The text was updated successfully, but these errors were encountered:
The align-transcript-and-translation is a complex operation that combines alignment engines and a special word embedding model.
Due to how the text is tokenized when passed to the embedding model, it's possible that there are various edge cases where the tokenization and de-tokenization fails to match the original text.
I'll need the exact inputs used so I can reproduce the error and determine how to fix it.
when I use
I got
The text was updated successfully, but these errors were encountered: