Explanation of the test_prediction.csv #6

rudra0713 · 2023-06-19T02:21:12Z

Hi,
Thanks for sharing the code. I have done a small-scale experiment to make sure that the code is working on my end. I followed the instruction mentioned under "Label Embedding" in the readme.txt. I used the following setting:

DATASETS=(arc fnc ibmcs)
TARGET=arc

In the generated test_predictions.csv, I see lines like the following:

14,"[0.4692992568016052, 0.11499807983636856, 0.10301849246025085, 0.07154946774244308, 0.2411346733570099]",0,0,fnc1_agree,14,arc,arc__disagree

Since I set the target dataset to be the arc dataset, I expected the predicted labels to be also from the arc dataset (here, it's fnc1_agree). Can you kindly explain this? Also, I checked the confusion matrix, it seems there are rows corresponding to labels from the fnc and ibmcs datasets, but not from the arc dataset. Can you also kindly explain this?

In the generated test_metric.json, I see a very low accuracy score. I expected that, whenever the "agree" label is predicted (same goes for other labels too), no matter whether it is fnc1_agree or arc_agree, it will be treated as a correct prediction because they both belong to POSITIVE _LABELS. However, I don't think that's how the accuracy score is being computed. Can you kindly clarify this part too?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explanation of the test_prediction.csv #6

Explanation of the test_prediction.csv #6

rudra0713 commented Jun 19, 2023 •

edited

Loading

Explanation of the test_prediction.csv #6

Explanation of the test_prediction.csv #6

Comments

rudra0713 commented Jun 19, 2023 • edited Loading

rudra0713 commented Jun 19, 2023 •

edited

Loading