Transformer XL

Description

The goal of this project is to generate long and coherent sequences of data using Transformer architectures based on the following papers:

The neural networks are tested on two separate tasks : music generation and text generation. All the models are implemented from scratch in Tensorflow 2.

Architecture

Music Model	Text Model

The structure of the GTrXL (Gated Transformer XL) block is illustrated in detail below:

The architecture used for text generation is the one proposed in the paper Stabilizing Transformers for Reinforcement Learning. Music generation requires a modified model where the input features are split into MIDI events (note_on, note_off and control_change) and MIDI deltas (time periods between consecutive MIDI events).

Dataset

For the task of music generation the union of the following datasets is used:

All of the above contain classical piano music in MIDI format. The MIDI files are preprocessed with the mido library.

As for the text generation, the CLAIR collection of "Nigerian" fraud emails is used.

Generated data for both datasets can be found here.

Dependencies

NumPy
Tensorflow
argparse
pathlib
tqdm
pickle
re
joblib
mido
glob
bs4
dload

Usage for music

Data Preprocessing


python preprocess_music.py -d

Training


python train_music.py

Music generation


python generate_music.py <n_songs> <checkpoint path>

Usage for text

Data Preprocessing


python preprocess_text.py <corpus path>

Training


python train_text.py

Text generation


python generate_text.py <n_samples> <checkpoint path>

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
generated_samples		generated_samples
readme		readme
LICENSE		LICENSE
README.md		README.md
config_music.py		config_music.py
config_text.py		config_text.py
generate_music.py		generate_music.py
generate_text.py		generate_text.py
midi_parser.py		midi_parser.py
model.py		model.py
preprocess_music.py		preprocess_music.py
preprocess_text.py		preprocess_text.py
text_parser.py		text_parser.py
train_music.py		train_music.py
train_text.py		train_text.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer XL

Table of contents

Description

Architecture

Dataset

Dependencies

Usage for music

Data Preprocessing

Training

Music generation

Usage for text

Data Preprocessing

Training

Text generation

About

Releases

Packages

Languages

License

LuziaKn/transformer-xl

Folders and files

Latest commit

History

Repository files navigation

Transformer XL

Table of contents

Description

Architecture

Dataset

Dependencies

Usage for music

Data Preprocessing

Training

Music generation

Usage for text

Data Preprocessing

Training

Text generation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages