You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm attempting to extract data from a paged table, but after using $click() the LiveHTML object becomes "corrupted". This may be an edge case with this specific website, as I was able to use $click() successfully from some other code examples in the Issues for this repo without this problem.
library(rvest)
sess<- read_html_live("https://www.cicic.ca/869/results.canada?search=§=2&int=3")
sess$click(".rgPageNext", n_clicks=1)
sess#> Error in onRejected(reason) : code: -32000#> message: Could not find node with given id
If you run this interactively with sess$view() you can see that the page loads successfully and the click works successfully, but after that sess seems to lose the information in html_elements (judging by the error).
Additional information
I asked on Mastodon and others were able to reproduce this error. I'm highlighting this because my initial thought was that this issue was caused by my browser being out of date due to OS restrictions, but that doesn't seem to be the case.
Software:
rvest: v1.0.4
R: v4.2.2
Browser: Chromium legacy (latest)
The text was updated successfully, but these errors were encountered:
Oh I bet this is because it loads a new page, and I have not update the ID of the root node. I think this will be a reasonably simple fix when I’m next working on rvest.
Closestidyverse#405.
I tried to make this automatic, and got it working some of the time, but I couldn't find a way to detect that something might change, and thus wait in that situation. I tried to make it as clear as possible for users to be able to fix this. Once a strategy is agreed on, the same strategy should probably be applied to other methods.
Brief description of the problem
I'm attempting to extract data from a paged table, but after using
$click()
the LiveHTML object becomes "corrupted". This may be an edge case with this specific website, as I was able to use$click()
successfully from some other code examples in the Issues for this repo without this problem.If you run this interactively with
sess$view()
you can see that the page loads successfully and the click works successfully, but after thatsess
seems to lose the information inhtml_elements
(judging by the error).Additional information
I asked on Mastodon and others were able to reproduce this error. I'm highlighting this because my initial thought was that this issue was caused by my browser being out of date due to OS restrictions, but that doesn't seem to be the case.
Software:
The text was updated successfully, but these errors were encountered: