Metadata-Version: 1.1
Name: extract-html-diff
Version: 0.1.0
Summary: Extract difference between two html pages
Home-page: https://github.com/TeamHG-Memex/extract-html-diff
Author: Konstantin Lopuhin
Author-email: kostia.lopuhin@gmail.com
License: MIT
Description: extract-html-diff: extract difference between two html pages
        ============================================================
        
        .. image:: https://img.shields.io/pypi/v/extract-html-diff.svg
           :target: https://pypi.python.org/pypi/extract-html-diff
           :alt: PyPI Version
        
        .. image:: https://img.shields.io/travis/TeamHG-Memex/extract-html-diff/master.svg
           :target: http://travis-ci.org/TeamHG-Memex/extract-html-diff
           :alt: Build Status
        
        .. image:: http://codecov.io/github/TeamHG-Memex/extract-html-diff/coverage.svg?branch=master
           :target: http://codecov.io/github/TeamHG-Memex/extract-html-diff?branch=master
           :alt: Code Coverage
        
        This package allows you to extract a difference between two html pages:
        given pages A and B, it will try to extract parts of A that are changed in B.
        It uses ``lxml.html.diff`` under the hood. but provides only changed parts as HTML.
        
        It requires Python 3 currently.
        
        License is MIT.
        
        Installaton
        -----------
        
        You can install the package from PyPI::
        
            pip install extract-html-diff
        
        
        Usage
        -----
        
        You can extract diff as text::
        
            import extract_html_diff
        
            html = '<div> <h1>My site</h1> <div>My content</div> </div>'
            other_html = '<div> <h1>My site</h1> <div>Other content</div> </div>'
        
            extract_html_diff.as_string(html, other_html)
        
        this will give you::
        
            '<div><div>My content</div>  </div>'
        
        You can also get diff as a tree (an ``lxml.html.HtmlElement``) if
        you plan to do additional transformations or change serialization::
        
            extract_html_diff.as_tree(html, other_html)
        
        You can pass input html as ``str`` or ``bytes``
        (it will be parsed with ``lxml.html.fromstring`` in this case), or as an already parsed
        ``lxml.html.HtmlElement``.
        
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
