Translatotron-v

Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024).

Install

cd src
pip install -e ./

Dataset

Dataset can be downloaded here

Run data-build/create_lmdb.sh to process IIMT data.

Training

Stage 1

Run script/train_mgpu_tiny.sh to train the image tokenizer.

Stage 2

Run script/vit-vqgan/run_t2i_layout.sh to train the teacher model.
Run script/iimt/run_translatotron_v.sh to train Translatotron-V

Inference

Run script/test_translatotron_v.sh to test Translatotron-V.

Acknowledgement

parti-pytorch : the codebase we built upon. This repository is an implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data-build		data-build
eval		eval
script		script
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Translatotron-v

Install

Dataset

Training

Stage 1

Stage 2

Inference

Acknowledgement

About

Releases

Packages

Languages

DeepLearnXMU/Translatotron-V

Folders and files

Latest commit

History

Repository files navigation

Translatotron-v

Install

Dataset

Training

Stage 1

Stage 2

Inference

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages