-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions on the training dataset of the attack model #2
Comments
Hi Toby, thank you for your interest in the implementation. Purpose of providing top-k probabilities onlyMost of the available black box model's APIs only provide top-k probabilities with its corresponding labels, for example Google Vision AI outputs top-10 probabilities, not the entire prediction vector. The paper also uses top-k filter mentioning that
I think using top-k probabilities is hard scenario for the attacker and not deviating from the MIA's principle. Purpose behind Removing Label of the DataI removed the 'class label' column from the Attack Training Set because Thank you! |
Hi, However, I am a bit confused that how remaining topk probs only & removing class labels from the attack training dataset would still provide enough info about shadow models' outputs. For example, The topk includes the major part of distribution of outputs, but it missed the info about which class would highly likely to occur. Can you provide some insight about this? Thank you! |
@taehyeok-jang The point of the MIA, at least for this implementation, is to discern whether individual datapoint has been included in the training set or not. I don't think class info is going to contribute much to such task. |
Hi! I have a question about the way that training dataset of the attack model is formulated.
In the original paper, the dataset consists of three parts: label of the data, prediction vector, and whether the data is in the original training dataset.
However, in you implementation, the dataset consists of two parts: top k probabilities, and whether the data is in the original training dataset.
I wonder if this modification would lead to difference in the way that MIA works. I'm new to MIA, so I would appreciate it if you can help.
The text was updated successfully, but these errors were encountered: