Game Of Thrones full dataset from fandom wiki.

Entity graph created by scraping GOTWiki- https://gameofthrones.fandom.com/wiki

Nodes- ['Organization', 'Person', 'Event', 'Episode', 'Animal', 'Location', 'HistoriesNLore', 'Weapon', 'House', 'PersonType', 'Religion', 'Season']

Relationships- ['SeenOrMentioned', 'Membership', 'Religion', 'Center', 'Location', 'Clergy', 'Allegiance', 'Leader', 'Founder', 'Predecessor', 'Death', 'Culture', 'Conflict', 'Place', 'Outcome', 'AssociatedLocation', 'Father', 'Mother', 'Spouse', 'Siblings', 'Battles', 'Rulers', 'Narratedby', 'Lovers', 'Successor', 'Children', 'Maker', 'Owner', 'Lord', 'Capital', 'Cities', 'Towns', 'Castles', 'Species', 'Range', 'Ruler', 'Population', 'Heir', 'Ancestralweapon', 'PlacesofNote', 'Formerly', 'Placesofnote', 'Military', 'Institutions', 'Villages', 'Placeoforigin', 'Formedfrom', 'Cadetbranches', 'Militarystrength', 'Premiere', 'Finale']

I have written a introductory blog about web scraping - https://codefringo.wordpress.com/2018/10/22/webcralwer-in-python/

Important files-

spiders/GotTGraphSpider.py is the main spider used to scrape fandom wiki
DataProcessor/ScrapyOutputProcessing.ipynb is the jupyter notebook that processes the scrapedOutput and generates tabular data for entities and creates graph in neo4j instance.

Run the whole project-

Run command- "scrapy crawl GotGraphSpider -o Data/ScrapedData.json"
Execute the jupyter notebook - DataProcessor/ScrapyOutputProcessing.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
Data		Data
DataProcessor		DataProcessor
GoTCrawler		GoTCrawler
Outputs		Outputs
.gitignore		.gitignore
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Game Of Thrones full dataset from fandom wiki.

About

Releases

Packages

Languages

VaibhavKankane/GameOfThrones

Folders and files

Latest commit

History

Repository files navigation

Game Of Thrones full dataset from fandom wiki.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages