Help needed for FDP calculation #4

Francis-B · 2024-10-30T15:46:38Z

Hi,

In order to get familiar with your tool, I tried to reproduce the TIDE+Percolator-RESET plot in the figure 3 of your article. However, the 3 methods returned me a FDP greatly below the FDR threshold (see figure below). I also tried FDRbench with one of my dataset and a Comet+PeptideProphet pipeline and, once again, the FDPs I got were greatly below the FDR threshold. Moreover, the FDP was a bit lower with a proteogenomic database than with SwissProt, which is unexpected.

Since I got similar results with different search engines and post-processing tools, I guess that my problem arise from the arguments I use to run FDRbench.

Here are my command lines,

to generate entrapment database:

java -jar fdrbench-0.0.1.jar  -enzyme 1 -fix_nc c  -level peptide  -db <path/to/database.fasta>  -o <database_entrapment.tx -uniprot -minLength 7  -maxLength 35

to compute FDP:

java -jar fdrbench-0.0.1.jar -fold 1 -pep <database_entrapment.txt> -i <percolator-RESET_output> -o <output_path> -score "TailorScore:1"

I used the same version of SwissProt as you did (UP000005640) and all the parameters you mentioned in the methods section for Tide (crux 4.2.Linux) and Percolator-RESET (v. 0.0.6).

I did not aggregated the FDPs of all runs of the PXD001468 dataset as you did in the article, but each run yields me a figure similar to the following:

Would you have any idea of what I could have done wrong? If no, could you provide me the the arguments you used to run FDRbench for the figure 3?

If you would like to have more details, I will be happy to provide them!

Thanks a lot!

The text was updated successfully, but these errors were encountered:

wenbostar · 2024-10-30T21:11:48Z

java -jar fdrbench-0.0.1.jar -fold 1 -pep <database_entrapment.txt> -i <percolator-RESET_output> -o <output_path> -score "TailorScore:1"

Hi @Francis-B , could you please share the input files that you ran the command line for the plot you showed with me?

Bo

Francis-B · 2024-10-31T16:26:20Z

Sure, here are OneDrive links to these files:

database_entrapment.txt
percolator-RESET_output

The percolator-RESET_output is in fact a filtered version of percolator output to have only the relevant columns for FDRbench!

wenbostar · 2024-10-31T16:46:52Z

Thanks for sharing the data. I got the same result when I ran FDRBench on your input data. I think the issue is that you did a combined search (multiple DDA MS/MS files from different MS runs) but you evaluated the FDR control in each individual runs.

Could you just run Tide (crux 4.2.Linux) with Percolator-RESET (v. 0.0.6) on the individual run to see how the plot looks like?

Francis-B · 2024-10-31T20:57:48Z

Thanks for your quick answer!

The input files I attached above were obtained from a search I did only on the b1906_293T_proteinID_01A_QE3_122212.mzML file. In fact, I wrote a snakemake pipeline to repeat the analysis for each file of the dataset individually.

To make sure there were no problems in my pipeline that could cause a combined search or mess up any other step, I reran all steps on the b1906(...) file alone. The new plot I got was not exactly the same, but it was really similar. I suppose this small difference is due to the stochasticity of the entrapment database creation (I did not mention a random seed).

If it can help you, I just uploaded my pipeline here so you can see each step I did and I can send you OneDrive link for intermediate files you would like to have to not run all the pipeline.

wenbostar · 2024-10-31T21:28:11Z

I did a quick search using Tide+Percolator on b1906_293T_proteinID_01A_QE3_122212.mgf. Below is what I got:

I will look into your workflow later.

Bo

Francis-B · 2024-10-31T21:40:49Z

Cool, thanks again!

In the mean time, I'll triple check the params of my steps to see if I can get I plot similar to yours!

wenbostar · 2024-11-01T05:57:07Z

Hi @Francis-B , could you please share the inputs and outputs for each step in your workflow with me when you ran it on b1906_293T_proteinID_01A_QE3_122212.mzML?

Francis-B · 2024-11-01T14:47:56Z

Sure, here they are!

All files are organised in the same subfolders as mentionned in the snakefile.

wenbostar · 2024-11-01T21:58:04Z

The issue is that you need to do a concatenated search in Tide (--concat T) when using percolator-reset.

Francis-B · 2024-11-02T15:00:30Z

I confirm that this solved my problem! Thank you so much!

Francis

wenbostar · 2024-11-02T22:53:06Z

Hi Francis,

Great. Thanks for the confirmation.

What does it look like in the proteogenomics search? I haven't tried FDRBench on this type of search.

Bo

Francis-B · 2024-11-11T16:30:31Z

Hi Bo,

Sorry for the delay of my answer, but I'm in rush right now with upcoming deadlines, so I had to put this side project on hold. But I'll be more than happy to share with you the FDP estimations with proteogenomics search when I get them!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help needed for FDP calculation #4

Help needed for FDP calculation #4

Francis-B commented Oct 30, 2024 •

edited

Loading

wenbostar commented Oct 30, 2024 •

edited

Loading

Francis-B commented Oct 31, 2024

wenbostar commented Oct 31, 2024

Francis-B commented Oct 31, 2024

wenbostar commented Oct 31, 2024

Francis-B commented Oct 31, 2024

wenbostar commented Nov 1, 2024

Francis-B commented Nov 1, 2024

wenbostar commented Nov 1, 2024

Francis-B commented Nov 2, 2024

wenbostar commented Nov 2, 2024

Francis-B commented Nov 11, 2024

Help needed for FDP calculation #4

Help needed for FDP calculation #4

Comments

Francis-B commented Oct 30, 2024 • edited Loading

wenbostar commented Oct 30, 2024 • edited Loading

Francis-B commented Oct 31, 2024

wenbostar commented Oct 31, 2024

Francis-B commented Oct 31, 2024

wenbostar commented Oct 31, 2024

Francis-B commented Oct 31, 2024

wenbostar commented Nov 1, 2024

Francis-B commented Nov 1, 2024

wenbostar commented Nov 1, 2024

Francis-B commented Nov 2, 2024

wenbostar commented Nov 2, 2024

Francis-B commented Nov 11, 2024

Francis-B commented Oct 30, 2024 •

edited

Loading

wenbostar commented Oct 30, 2024 •

edited

Loading