boo283 / Facebook_comment_crawler Public

Notifications You must be signed in to change notification settings
Fork 0
Star 5

The Facebook Comments Crawler is an unofficial tool for extracting comments from Facebook posts using Selenium in Python. It's designed to aid in academic and personal research. #Facebook comments scaper #Facebook comments crawler

5 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configuration		configuration
README.md		README.md
crawl.py		crawl.py
requirements.txt		requirements.txt

Repository files navigation

FACEBOOK COMMENTS CRAWLER using Selenium in Python

This is the unofficial FACEBOOK COMMENTS CRAWLER in Python. The main purpose is support to my study.

Main Fuction

Scrape comments with related information such as:
- Comment - text
- User
- Nametag (like @name_tag)
- Check if it is spam (based on user-defined demand)

Installation

Clone the repository git clone https://github.com/boo283/Facebook_comment_crawler.git
Install dependencies: pip install -r requirements.txt

Usage

Clone this repository
Open this repository and add some information:

In folder "configuration":
- config.py:
  - In function configure_driver(), replace by the path to your chrome driver, which could be download at ref: https://googlechromelabs.github.io/chrome-for-testing/#stable
In main folder:
- crawl.py:
  - Just type your Facebook account in the Login info part in main function.
  - Choose your destination to save crawled data

Cd to folder and run script: python crawl.py
Simply enter the Facebook URL post and press Enter

Note

You must have a stable connection to ensure that this code run correctly
Try to rerun this code if you have any web or driver exceptions
Update requirement: Because the html tag will be update monthly(some tag not all of them), so when any function do not run accurately, you just have to open any post and change to developer mode (F12) to look for the name of the parent-html tag of your failed clicking tag. After that, just replace it by the xpath element or css_selector,... They are all located on the config.py and action.py.

Contact:

LinkedIn: https://www.linkedin.com/in/phutrungnguyen283/
Facebook: https://www.facebook.com/ngphtrungboo
Email: [email protected]

Feel free to adjust my code, practice makes perfect ❤️❤️❤️

About

The Facebook Comments Crawler is an unofficial tool for extracting comments from Facebook posts using Selenium in Python. It's designed to aid in academic and personal research. #Facebook comments scaper #Facebook comments crawler

facebook selenium facebook-crawler web-scraping-python facebook-comments-scraper crawl-facebook facebook-comment-crawler

Report repository

Releases 1

Packages

No packages published

Languages

Python 100.0%