Merge with LargeList arrow type not working #2851
Labels
binding/python
Issues for the Python package
bug
Something isn't working
on-hold
Issues and Pull Requests that are on hold for some reason
Environment
--------Version info---------
Polars: 1.6.0
Index type: UInt32
Platform: macOS-14.5-arm64-arm-64bit
Python: 3.12.3 (main, Jun 6 2024, 07:44:06) [Clang 15.0.0 (clang-1500.3.9.4)]
----Optional dependencies----
deltalake 0.19.2
numpy 2.1.1
pyarrow 17.0.0
Bug
What happened:
In version 0.19 large_dtypes flags were deprecated causing my workflow with merging deltatables including list types to break.
What you expected to happen:
Continue to be able to merge on tables including list (arrow largelist) type columns. Or if that is intentionally unsupported, at least some documentation/migration guide to let me know what I have to change.
How to reproduce it:
works in 0.18.x, fails on 0.19.x
More details:
Using polars implies large_dtypes=True which is included in the underlying call to write_deltalake. If I bypass polars function and call DeltaTable.create with a schema including only non-large types things work as expected. Is it intended that the TableMerger does not support large_dtypes? Broader question (between both polars and delta): what is the implications of forcing non-large types and interacting with polars? I know the polars maintainers have strong opinions that they should be using the large types, but for my workflows, I don't need anywhere near the # of elements supported by the standard list, so largelist is just using extra space.
test output
The text was updated successfully, but these errors were encountered: