RAST

This repository contains code for the paper Diversify Question Generation with Retrieval-Augmented Style Transfer

we provide our processed_data in data_link.
we also provide our model checkpoint in checkpoint_link.
if you use our repository, please cite paper. If you find this code useful in your research, please consider citing:

@misc{gou2023diversify,
      title={Diversify Question Generation with Retrieval-Augmented Style Transfer}, 
      author={Qi Gou and Zehua Xia and Bowen Yu and Haiyang Yu and Fei Huang and Yongbin Li and Nguyen Cam-Tu},
      year={2023},
      eprint={2310.14503},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

To reproduce

1. download data

squad1.1, zhou split
- This split of squad refers to Neural Question Generation from Text- A Preliminary Study
- data num of train/dev/test is 86,635/8,965/8,964 respectively.
squad1.1, du split
- This split of squad refers to Learning to Ask: Neural Question Generation for Reading Comprehension
- data num of train/dev/test is 70484/10570/11877 respectively.
newsqa
- This dataset refers to NewsQA: A Machine Comprehension Dataset
- data num of train/dev/test is 92549/5166/5126 respectively.

2. process data

process original data

python data/process_data.py  
refer to  data/readme.md

convert and store corpus data into faiss vector

python rast/rag/prepare_dataset.py
refer to rast/rag/prepare_dataset.py

3. train generator with skeleton

refer to rast/qg/readme.md

4. train vanilla generator

refer to rast/qg/readme.md

5. train QA model

refer to rast/reward_mdoel/T5_QA/readme.md

6. train rag

refer to rast/rag/readme_v100.md

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
data		data
qg		qg
rag		rag
reward_model		reward_model
README.md		README.md
collator.py		collator.py
datasets_tasks.py		datasets_tasks.py
draw.py		draw.py
eval_squad.py		eval_squad.py
nltk_bleu.py		nltk_bleu.py
requirements.txt		requirements.txt
rouge.py		rouge.py
test_metric.py		test_metric.py
utils.py		utils.py
~$mple_v1.docx		~$mple_v1.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAST

To reproduce

1. download data

2. process data

3. train generator with skeleton

4. train vanilla generator

5. train QA model

6. train rag

About

Releases

Packages

Languages

gouqi666/RAST

Folders and files

Latest commit

History

Repository files navigation

RAST

To reproduce

1. download data

2. process data

3. train generator with skeleton

4. train vanilla generator

5. train QA model

6. train rag

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages