Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optional Selector to limit link extraction to be within it #114

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

monagjr
Copy link

@monagjr monagjr commented Dec 16, 2023

Utilize the underlaying Crawlee's feature of limiting link extraction to specific part of the page by exposing it to config file

@@ -72,6 +72,8 @@ type Config = {
url: string;
/** Pattern to match against for links on a page to subsequently crawl */
match: string;
/** Optional Selector to limit grabbing the links from */
matchSelector?: string;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the term matchSelector is a bit broad, maybe we can call this linkSelector to be more clear whats different between this and selector

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @steve8708 for feedback
my reason for selecting "matchSelector" was to make it tied to the "match" property.
on the other hand, you have a point too.
what do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants