Metadata-Version: 2.1
Name: chemdataextractor2
Version: 2.1.2
Summary: A toolkit for extracting chemical information from the scientific literature.
Home-page: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor2
Author: Matt Swain, Callum Court, Juraj Mavracic, Taketomo Isazawa, and contributors
Author-email: m.swain@me.com, cc889@cam.ac.uk, jm2111@cam.ac.uk, ti250@cam.ac.uk
License: MIT
Keywords: text-mining mining chemistry cheminformatics nlp html xml science scientific
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: Topic :: Scientific/Engineering :: Chemistry
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: Linguistic
Classifier: Topic :: Text Processing :: Markup :: HTML
Description-Content-Type: text/markdown
License-File: LICENSE

ChemDataExtractor
==================================

ChemDataExtractor v2 is a toolkit for extracting chemical information from the scientific literature. Python 3.5 to Python 3.9 supported.


Installation
------------

`pip install chemdataextractor2`


Features
--------

- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies

Documentation & Development
-----------------------------

Please read the documentation for instructions on contributing to the project.

https://cambridgemolecularengineering-chemdataextractor-development.readthedocs-hosted.com/en/latest/

License
-------

ChemDataExtractor v2 is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software.

MIT license: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor/blob/master/LICENSE


