Metadata-Version: 2.1
Name: taxopy
Version: 0.1.0
Summary: A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers.
Home-page: https://apcamargo.github.io/taxopy/
Author: Antonio Pedro Camargo
Author-email: antoniop.camargo@gmail.com
License: GNU General Public License v3.0
Description: # taxopy
        
        A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers.
        
        ## Installation
        
        There are two ways to install taxopy:
        
          - Using pip:
        
        ```
        pip install taxopy
        ```
        
          - Using conda:
        
        ```
        conda install -c conda-forge -c bioconda taxopy
        ```
        
        ## Usage
        
        ```python
        import taxopy
        ```
        
        First you need to download taxonomic information from NCBI's servers and put this data into a `TaxDb` object:
        
        
        ```python
        taxdb = taxopy.TaxDb()
        # You can also use your own set of taxonomy files:
        taxdb = taxopy.TaxDb(nodes_dmp="taxdb/nodes.dmp", names_dmp="taxdb/names.dmp", keep_files=True)
        ```
        
        The `TaxDb` object stores the name, rank and parent-child relationships of each taxonomic identifier:
        
        
        ```python
        print(taxdb.taxid2name['2'])
        print(taxdb.taxid2parent['2'])
        print(taxdb.taxid2rank['2'])
        ```
        
            Bacteria
            131567
            superkingdom
        
        
        To get information of a given taxon you can create a `Taxon` object using its taxonomic identifier:
        
        
        ```python
        human = taxopy.Taxon('9606', taxdb)
        gorilla = taxopy.Taxon('9593', taxdb)
        lagomorpha = taxopy.Taxon('9975', taxdb)
        ```
        
        Each `Taxon` object stores a variety of information, such as the rank, identifier and name of the input taxon, and the identifiers and names of all the parent taxons:
        
        
        ```python
        print(lagomorpha.rank)
        print(lagomorpha.name)
        print(lagomorpha.name_lineage)
        ```
        
            order
            Lagomorpha
            ['Lagomorpha', 'Glires', 'Euarchontoglires', 'Boreoeutheria', 'Eutheria', 'Theria', 'Mammalia', 'Amniota', 'Tetrapoda', 'Dipnotetrapodomorpha', 'Sarcopterygii', 'Euteleostomi', 'Teleostomi', 'Gnathostomata', 'Vertebrata', 'Craniata', 'Chordata', 'Deuterostomia', 'Bilateria', 'Eumetazoa', 'Metazoa', 'Opisthokonta', 'Eukaryota', 'cellular organisms', 'root']
        
        
        You can get the lowest common ancestor of a list of taxons using the `find_lca` function:
        
        
        ```python
        human_gorilla_lca = taxopy.find_lca([human, gorilla], taxdb)
        print(human_gorilla_lca.name)
        ```
        
            Homininae
        
        ## Acknowledgements
        
        Some of the code used in taxopy was taken from the [CAT/BAT tool for taxonomic classification of contigs and metagenome-assembled genomes](https://github.com/dutilh/CAT).
Keywords: bioinformatics,taxonomy
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Science/Research
Classifier: Natural Language :: English
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: License :: OSI Approved :: GNU General Public License (GPL)
Classifier: Programming Language :: Python :: 3
Requires-Python: >=3.5
Description-Content-Type: text/markdown
