Problem with "Invalid JSON in log record" when reading a DLT pipeline generated delta table through S3 #1604
-
I have a DLT pipeline that generates SCD table (both 1 and 2 types). Generated tables are stored in S3. I am trying to query those tables using deltalake-python package, using the approach explained in this blog post - though I am using the latest version of deltalake package (0.10.1). Currently I am facing an error that states:
I've tried to go through related issues and found these
From which 531 was commented to resolve the issue in 524 and fix the invalid JSON error. However, the error still seems to arise with DLT generated tables, or at least I am stuck with the issue and it is unclear to me what is the actual root cause. I have checked the corresponding
Additionally, to make this more tangled, the SCD type 1 table (one that doesn't contain history) can be read and it doesn't generate the same error even though the
Any ideas on what I am missing here? Thanks, Edit. Additionally there seems to be an issue #1302 and a draft PR #1305 to fix this. Related specifically to DLT generated tables, when metadata contains no schema information. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
This has probably now been fixed here: #1647 |
Beta Was this translation helpful? Give feedback.
This has probably now been fixed here: #1647