Metadata-Version: 2.1
Name: latexml_html_cleaner
Version: 0.0.4
Summary: Utility to clean up an html file generated by LaTeXML
Home-page: https://github.com/eelcovv/latexml-html-cleaner
Author: EVLT
Author-email: evlt@cbs.com
License: MIT
Project-URL: Documentation, https://latexml-html-cleaner.readthedocs.io/en/latest/
Platform: any
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python
Description-Content-Type: text/x-rst; charset=UTF-8
License-File: LICENSE.txt
License-File: AUTHORS.rst
Requires-Dist: importlib-metadata; python_version < "3.9"
Requires-Dist: beautifulsoup4
Provides-Extra: dev
Requires-Dist: black; extra == "dev"
Requires-Dist: pre-commit; extra == "dev"
Requires-Dist: setuptools; extra == "dev"
Requires-Dist: setuptools_scm; extra == "dev"
Requires-Dist: wheel; extra == "dev"
Requires-Dist: tox; extra == "dev"
Requires-Dist: pytest; extra == "dev"
Requires-Dist: pytest-pep8; extra == "dev"
Requires-Dist: pytest-cov; extra == "dev"
Provides-Extra: testing
Requires-Dist: setuptools; extra == "testing"
Requires-Dist: pytest; extra == "testing"
Requires-Dist: pytest-pep8; extra == "testing"
Requires-Dist: pytest-cov; extra == "testing"
Requires-Dist: tox; extra == "testing"

====================
latexml_html_cleaner
====================

Clear html files that are generated by latexml

Description
===========

This tool can be used to postprocess html files generated by LaTeXml.

The purpose it to remove all lxt tags and hyperrefs such that the html file can be imported into external software.

Installation
============

Install the tool by running::

    pip install latex_html_cleaner


Usage
=====

This tool can be used to clean html file generated by the tex to html converter LaTeXML_. You can run it by:

.. code-block:: text

    htmlcleaner.exe filename.html

A new file with the suffix *_clean.html* will be created with the cleaned html.

Details
=======

For all the details, do::

    htmlcleaner.exe --help

giving the output:

.. code-block:: text

    usage: htmlcleaner [-h] [--version] [--output_filename STR] [-v] [-vv] [-w]
                       [-f [PATH ...]] [--clear_find_and_replace_defaults]
                       STR [STR ...]

    Cleans html files and removes hyperrefs

    positional arguments:
      STR                   File name of html input

    options:
      -h, --help            show this help message and exit
      --version             show program's version number and exit
      --output_filename STR
                            File name of output html file
      -v, --verbose         set loglevel to INFO
      -vv, --very-verbose, --debug
                            set loglevel to DEBUG
      -w, --overwrite       Overwrite the input html. Default = False, which means
                            a new html is created withthe suffix _clean
      -f [PATH ...], --find_and_replace [PATH ...]
                        Define a list of key=value pairs to define string
                        patterns you want to replace
      --clear_find_and_replace_defaults
                            Clear the predefined find and replace patterns


Note
====

This project has been set up using PyScaffold 4.0.2. For details and usage
information on PyScaffold see https://pyscaffold.org/.

.. _LaTeXML: https://math.nist.gov/~BMiller/LaTeXML
