Prefer upper bounds when resolving/backtracking #13017

notatallshaw · 2024-10-14T05:41:20Z

Fixes: #12993
Fixes: #12990
Fixes: #12430
Fixes: #13030

This PR is built on top of #12982 so that the unit tests can be expanded, either that PR can be reviewed first, or this PR can supplant that PR.

I have developed some benchmark scripts to ensure that changes to pip's resolution algorithm don't regress common real world requirements: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks.

I plan to keep building out more scenarios, you can see the current ones so far here: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/tree/main/scenarios

Upon testing this PR compared to pip 24.2 I see one small regressions and two big improvements:

Difference for scenario scenarios/problematic.toml - autogluon:
    	Success: False -> True.
    	Failure Reason: Build Failure -> None.

Difference for scenario scenarios/problematic.toml - boto3-urllib3-transient:
    	Number of packages processed: 869 -> 871

Difference for scenario scenarios/big-packages.toml - apache-airflow-all:
    	Number of requirements processed: 593 -> 592
    	Number of packages processed: 681 -> 661

The fact that autogluon can resolve is a big improvement, apache-airflow[all] gets a noticeable improvement in how many packages it has to process (and this has real time improvement, as the number of packages processed can have O(n^2) complexity) , and a scenario involving boto3 and urllib3 as transient requirements gets a small regression in having to process 2 more packages.

I am hoping to find more real world scenarios where this has a noticeable difference, but I think these results are sufficient to show this approach is a net positive.

notatallshaw · 2024-10-14T06:12:14Z

Very tentatively adding this to the 24.3 milestone on the basis of:

If a maintainer with resolver experience can look at Simplify, fix, and add unit tests for PipProvider.get_preference #12982 then this PR only adds a small amount of functional code on top: 70f4d92
This expands the unit tests in that PR to the functional code in this PR
This is backed up as not regressing against a number of scenarios
It has a real world issue it fixes

But I understand if no maintainer is available to review.

notatallshaw · 2024-10-15T00:53:51Z

Added more problematic scenarios in: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/blob/main/scenarios/problematic.toml

And found this also fixes #12430 (which was merged into another issue, but the specific resolution the user had is now solved by this).

potiuk · 2024-10-15T01:32:56Z

I do not know pip resiolution internals - but the rules explained make sense and might improve a number of cases indeed.

notatallshaw · 2024-10-18T14:34:09Z

I took a look to see whether it made any difference to put upper bound preference above or below backtracking cause preference, and at least in the scenarios I currently have in https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/blob/main/scenarios it didn't make any significant difference (there was a very slight regression of apache-airflow-beam putting it below, as it visited 1 extra package).

So I consider this good in its current position, and if I find a scenario in the future, or a user reports one, where it does make a significant difference, then it can be changed.

notatallshaw · 2024-10-20T15:46:41Z

Found a minor improvement, in acryl-datahub[all] which has over 300 total dependencies, it visited 1 less requirement, 6 less packages, and produced a slightly better solution: notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks#2 (comment)

sbidoul · 2024-10-26T08:13:30Z

While this looks very reasonable I'd prefer to have another resolver expert (which I am not, unfortunately) to look into this. So postponing.

notatallshaw · 2024-10-26T16:49:52Z

While this looks very reasonable I'd prefer to have another resolver expert (which I am not, unfortunately) to look into this. So postponing.

I knew this one was pretty unlikely but I thought I'd give it a shot since the recent real world issues raised that this solves.

notatallshaw · 2024-11-10T20:19:12Z

Going to make a single follow up PR once #13001 lands, I'll comment here once done.

notatallshaw added 7 commits October 13, 2024 21:52

Simplify, fix, and add unit tests for PipProvider.get_preference

75ca682

Linting fix

cfeb2c9

Simplify test setup

15d3833

Prefer upper bounded requirements

70f4d92

Update tests for get_preference

2f0a369

Update docs

f1b86b2

NEWS ENTRY

2fe0b77

psf-chronographer bot added the bot:chronographer:provided label Oct 14, 2024

notatallshaw added this to the 24.3 milestone Oct 14, 2024

notatallshaw mentioned this pull request Oct 14, 2024

Request: New Release sarugaku/resolvelib#159

Closed

notatallshaw mentioned this pull request Oct 18, 2024

Vendor resolvelib 1.1.0 #13001

Open

notatallshaw mentioned this pull request Oct 18, 2024

Bug in pip's pinned preference on packages that have a requirement ==N.* #13030

Open

1 task

Add test for "==1.*"

a8926df

notatallshaw mentioned this pull request Oct 20, 2024

add acryl-datahub[all] to big packages list notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks#2

Merged

notatallshaw added 2 commits October 21, 2024 20:16

Update docstring for get_preference

35ed6c9

Update docs for resolution

7cb360b

cburroughs mentioned this pull request Oct 25, 2024

Performance Tracking: generate-lockfiles pantsbuild/pants#21223

Open

sbidoul modified the milestones: 24.3, 25.0 Oct 26, 2024

notatallshaw closed this Nov 10, 2024

This was referenced Nov 10, 2024

Simplify, fix, and add unit tests for PipProvider.get_preference #12982

Closed

PoC: Very verbose resolution #13039

Closed

github-actions bot locked as resolved and limited conversation to collaborators Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer upper bounds when resolving/backtracking #13017

Prefer upper bounds when resolving/backtracking #13017

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 15, 2024 •

edited

Loading

potiuk commented Oct 15, 2024

notatallshaw commented Oct 18, 2024 •

edited

Loading

notatallshaw commented Oct 20, 2024

sbidoul commented Oct 26, 2024

notatallshaw commented Oct 26, 2024

notatallshaw commented Nov 10, 2024 •

edited

Loading

Prefer upper bounds when resolving/backtracking #13017

Prefer upper bounds when resolving/backtracking #13017

Conversation

notatallshaw commented Oct 14, 2024 • edited Loading

notatallshaw commented Oct 14, 2024 • edited Loading

notatallshaw commented Oct 15, 2024 • edited Loading

potiuk commented Oct 15, 2024

notatallshaw commented Oct 18, 2024 • edited Loading

notatallshaw commented Oct 20, 2024

sbidoul commented Oct 26, 2024

notatallshaw commented Oct 26, 2024

notatallshaw commented Nov 10, 2024 • edited Loading

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 15, 2024 •

edited

Loading

notatallshaw commented Oct 18, 2024 •

edited

Loading

notatallshaw commented Nov 10, 2024 •

edited

Loading