GitHub - Lizhecheng02/DRS: Repository for our paper "DRS: Deep Question Reformulation With Structured Output".

DRS: Deep Question Reformulation With Structured Output

This is the repository for our paper DRS: Deep Question Reformulation With Structured Output

In this paper, our proposed DRS method leverages LLMs and the DFS-based algorithm to iteratively search for possible entity combinations and constrain the output with certain entities, effectively improving the capabilities of LLMs in question reformulation.

Our approach consists of three main steps:

Entity Extraction and Filtering, where LLMs are used to extract core entities and classify extracted entities to preserve certain types of entities.
DFS Entity Combination Search and Structured Output Generation, which employs a DFS-based algorithm to identify entity combinations to form answerable questions with entity constraint.
DFS Pruning and Candidate Selection, where we limit the number of candidate questions and search depth through DFS pruning, then select the most reliable question from the candidate table according to question content and entity number for final output.

Run Codes

1. Install Packages

pip install -r requirements.txt

2. OpenAI & HuggingFace

Use the config.yaml file to set your OpenAI api key.

openai:
  api_key: "YOUR_API_KEY"
  organization: "YOUR_ORGANIZATION"

Use the environment variable to set your HuggingFace token.

export HF_TOKEN="your_hf_token"

3. Baseline

Change the parameters.yaml file.
Switch subset_name to run on six different datasets. (Possible names are listed in the parameters.yaml)
Try different baseline methods through the type parameter. (Possible types are listed in the parameters.yaml)
Modify run_model to run with different LLMs. (If you are using OpenAI models, set the run_model_platform to openai; otherwise, set the run_model_platform to huggingface for open-source models)

python baseline_openai.py | python baseline_huggingface.py

4. DRS

Change the parameters.yaml file.
Switch subset_name to run on six different datasets. (Possible names are listed in the parameters.yaml)
Modify run_model to run with different LLMs. (If you are using OpenAI models, set the run_model_platform to openai; otherwise, set the run_model_platform to huggingface for open-source models)

python drs_openai.py | python drs_huggingface.py

Datasets

You could download the CouldAsk from HuggingFace.

Experiments

To demonstrate the effectiveness of our proposed zero-shot DRS method, we test it on six datasets, totaling over 1,700 data points, using four different LLMs. We conduct a fair and thorough comparison between our method and four baseline methods from previous research, with all accuracy scores presented in the table below.

To explore the impact of this parameter on the performance of our zero-shot DRS method, we conduct experiments with varying numbers of candidate questions and observe how accuracy changes. We set the number of candidate questions from 1 to 5 for each LLM and test on all datasets, with the results shown in the figure below.

Questions?

If you have any questions related to the repo or the paper, or you encounter any problems when using the code, feel free to email Zhecheng Li ([email protected]).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Figs		Figs
.gitignore		.gitignore
.python_version		.python_version
README.md		README.md
baseline.json		baseline.json
baseline_huggingface.py		baseline_huggingface.py
baseline_openai.py		baseline_openai.py
drs_huggingface.py		drs_huggingface.py
drs_openai.py		drs_openai.py
parameters.yaml		parameters.yaml
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRS: Deep Question Reformulation With Structured Output

Run Codes

1. Install Packages

2. OpenAI & HuggingFace

3. Baseline

4. DRS

Datasets

Experiments

Questions?

About

Releases

Packages

Languages

Lizhecheng02/DRS

Folders and files

Latest commit

History

Repository files navigation

DRS: Deep Question Reformulation With Structured Output

Run Codes

1. Install Packages

2. OpenAI & HuggingFace

3. Baseline

4. DRS

Datasets

Experiments

Questions?

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages