Skip to content

testomato/php-minicrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PHP Minicrawler

PHP Minicrawler executes HTTP requests while handling cookies, network connection management and SSL/TLS protocols. By default it follows redirect locations and returns a full response, final URL, parsed cookied and more. It is designed to handle many request in parallel in a single thread by opening a socket for each connection.

Build

Tested platforms: Debian Linux, OS X

Build and install minicrawler first.

Then run:

phpize
./configure
make
sudo make install

Add following to minicrawler.ini or change php.ini file and restart PHP:

[minicrawler]
extension="/usr/local/opt/php-minicrawler/modules/minicrawler.so"