Metadata-Version: 1.1
Name: mindfactory_crawling
Version: 1.0.3
Summary: A crawler for mindfactory.de
Home-page: https://github.com/RobMcH/mindfactory_crawling
Author: Robert McHardy
Author-email: robert@robertmchardy.de
License: UNKNOWN
Description: # Mindfactory.de Crawler
        This repository contains a crawler for [Mindfactory](https://www.mindfactory.de), a German ecommerce shop (for computer hardware). The crawler extracts the data contained on every single product page and stores the scraped products and reviews in a SQLite database consisting of two tables.  
        
        Each product has the following properties:  
        * ID (SQLite identifier)
        * URL
        * Product name
        * Brand name
        * Category (i.e. CPU)
        * EAN
        * SKU
        * Items sold (Count)
        * People watching (Count)
        * RMA quote (in percent)
        * Average rating (from 1.0 to 5.0)
        * Shipping (information on availability)
        * Price (in Euro)  
        
        Additionally, for every product all reviews are collected and stored in a separate SQLite table. An entry in this table has the following properties:
        * Product ID (Reference to the corresponding ID in the product table)
        * Stars (Rating, from 1 to 5)
        * Text
        * Author
        * Date (YYYY-MM-DD)
        * Verified (actually bought the product at Mindfactory)
        
        # Prerequisites  
        * Python3
        * scrapy
        * SQLite3
        
        # Run the scraper  
            scrapy crawl mindfactory_products
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
