Metadata-Version: 2.1
Name: pyrodigal
Version: 0.4.3
Summary: Python interface to Prodigal, an ORF finder for genomes, progenomes and metagenomes.
Home-page: https://github.com/althonos/pyrodigal
Author: Martin Larralde
Author-email: martin.larralde@embl.de
License: GPLv3
Project-URL: Bug Tracker, https://github.com/althonos/pyrodigal/issues
Project-URL: Changelog, https://github.com/althonos/pyrodigal/blob/master/CHANGELOG.md
Project-URL: Zenodo, https://doi.org/10.5281/zenodo.4015169
Description: # 🔥 Pyrodigal [![Stars](https://img.shields.io/github/stars/althonos/pyrodigal.svg?style=social&maxAge=3600&label=Star)](https://github.com/althonos/pyrodigal/stargazers)
        
        *Python interface to [Prodigal](https://github.com/hyattpd/Prodigal/), an ORF
        finder for genomes, progenomes and metagenomes.*
        
        [![GitLabCI](https://img.shields.io/gitlab/pipeline/larralde/pyrodigal/master?gitlab_url=https%3A%2F%2Fgit.embl.de&logo=gitlab&style=flat-square&maxAge=600)](https://git.embl.de/larralde/pyrodigal/-/pipelines)
        [![AppVeyor](https://img.shields.io/appveyor/build/althonos/pyrodigal/master.svg?logo=appveyor&maxAge=600&style=flat-square)](https://ci.appveyor.com/project/althonos/pyrodigal/history)
        [![Coverage](https://img.shields.io/codecov/c/gh/althonos/pyrodigal?style=flat-square&maxAge=3600)](https://codecov.io/gh/althonos/pyrodigal/)
        [![License](https://img.shields.io/badge/license-GPLv3-blue.svg?style=flat-square&maxAge=2678400)](https://choosealicense.com/licenses/gpl-3.0/)
        [![PyPI](https://img.shields.io/pypi/v/pyrodigal.svg?style=flat-square&maxAge=3600)](https://pypi.org/project/pyrodigal)
        [![Bioconda](https://img.shields.io/conda/vn/bioconda/pyrodigal?style=flat-square&maxAge=3600)](https://anaconda.org/bioconda/pyrodigal)
        [![Wheel](https://img.shields.io/pypi/wheel/pyrodigal.svg?style=flat-square&maxAge=3600)](https://pypi.org/project/pyrodigal/#files)
        [![Python Versions](https://img.shields.io/pypi/pyversions/pyrodigal.svg?style=flat-square&maxAge=600)](https://pypi.org/project/pyrodigal/#files)
        [![Python Implementations](https://img.shields.io/pypi/implementation/pyrodigal.svg?style=flat-square&maxAge=600&label=impl)](https://pypi.org/project/pyrodigal/#files)
        [![Source](https://img.shields.io/badge/source-GitHub-303030.svg?maxAge=2678400&style=flat-square)](https://github.com/althonos/pyrodigal/)
        [![GitHub issues](https://img.shields.io/github/issues/althonos/pyrodigal.svg?style=flat-square&maxAge=600)](https://github.com/althonos/pyrodigal/issues)
        [![Changelog](https://img.shields.io/badge/keep%20a-changelog-8A0707.svg?maxAge=2678400&style=flat-square)](https://github.com/althonos/pyrodigal.py/blob/master/CHANGELOG.md)
        [![Downloads](https://img.shields.io/badge/dynamic/json?style=flat-square&color=303f9f&maxAge=86400&label=downloads&query=%24.total_downloads&url=https%3A%2F%2Fapi.pepy.tech%2Fapi%2Fprojects%2Fpyrodigal)](https://pepy.tech/project/pyrodigal)
        [![DOI](https://img.shields.io/badge/doi-10.5281%2Fzenodo.4015169-purple?style=flat-square&maxAge=86400)](https://doi.org/10.5281/zenodo.4015169)
        
        
        ## 🗺️ Overview
        
        Pyrodigal is a Python module that provides bindings to Prodigal using
        [Cython](https://cython.org/). It directly interacts with the Prodigal
        internals, which has the following advantages:
        
        - **single dependency**: Pyrodigal is distributed as a Python package, so you
          can add it as a dependency to your project, and stop worrying about the
          Prodigal binary being present on the end-user machine.
        - **no intermediate files**: everything happens in memory, in a Python object
          you fully control, so you don't have to invoke the Prodigal CLI using a
          sub-process and temporary files.
        - **no input formatting**: sequences are manipulated directly as strings, which
          leverages the issue of formatting your input to FASTA for Prodigal.
        - **lower memory usage**: Pyrodigal is slightly more conservative when it comes
          to using memory, which can help process very large sequences. It also lets
          you save some more memory when running several *meta*-mode analyses
        
        ### 📋 Features
        
        The library now features everything needed to run Prodigal in single or
        metagenomic mode. It is still missing some features of the CLI:
        
        **Roadmap**:
        
        - ✔️ Metagenomic mode
        - ✔️ Single mode
        - ❌ External training file support (`-t` flag)
        - ❌ Region masking (`-m` flag)
        
        ### 🐏 Memory
        
        Contrary to the Prodigal command line, Pyrodigal attempts to be more conservative
        about memory usage. This means that most of the allocations will be lazy, and
        that some functions will reallocate their results to exact-sized arrays when
        it's possible. This leads to Pyrodigal using about 30% less memory, but with
        a little bit more overhead to compute the size of buffers in advance.
        
        ### 🧶 Thread-safety
        
        `pyrodigal.Pyrodigal` instances are thread-safe, and use an internal lock to
        prevent parallel calls to their methods from overwriting the internal buffers.
        However, a better solution to process sequences in parallel is to use a
        consumer/worker pattern, and have on `Pyrodigal` instance in each worker.
        Using a pool spawning `Pyrodigal` instances on the fly is also fine,
        but prevents recycling memory:
        ```python
        with multiprocessing.pool.ThreadPool() as pool:
            pool.map(lambda s: Pyrodigal(meta=True).find_genes(s), sequences)
        ```
        
        ## 🔧 Installing
        
        Pyrodigal can be installed directly from [PyPI](https://pypi.org/project/pyrodigal/),
        which hosts some pre-built CPython wheels for x86-64 Unix and Windows platforms,
        as well as the code required to compile from source with Cython:
        ```console
        $ pip install pyrodigal
        ```
        
        Otherwise, Pyrodigal is also available as a [Bioconda](https://bioconda.github.io/)
        package:
        ```console
        $ conda install -c bioconda pyrodigal
        ```
        
        ## 💡 Example
        
        Using [Biopython](https://biopython.org/), load a sequence from a
        [GenBank](http://www.insdc.org/files/feature_table.html) file, import Pyrodigal
        to find all genes it contains, and print the proteins in pseudo-FASTA format:
        ```python
        record = Bio.SeqIO.read("sequence.gbk", "genbank")
        p = pyrodigal.Pyrodigal(meta=True)
        
        for i, gene in enumerate(p.find_genes(str(record.seq))):
            print(f"> {record.id}_{i+1}")
            print(textwrap.fill(record.translate()))
        ```
        
        To use `Pyrodigal` in single mode, you must explicitly call `Pyrodigal.train`
        with the sequence you want to use for training before trying to find genes,
        or you will get a [`RuntimeError`](https://docs.python.org/3/library/exceptions.html#RuntimeError):
        ```python
        p = pyrodigal.Pyrodigal()
        p.train(str(record.seq))
        genes = p.find_genes(str(record.seq))
        ```
        
        ## 📜 License
        
        This library, like the original Prodigal software, is provided under the
        [GNU General Public License v3.0](https://choosealicense.com/licenses/gpl-3.0/).
        
Keywords: bioinformatics,genome,orf,gene,prodigal
Platform: any
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Programming Language :: C
Classifier: Programming Language :: Cython
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: Topic :: Scientific/Engineering :: Medical Science Apps.
Classifier: Typing :: Typed
Requires-Python: >=3.6
Description-Content-Type: text/markdown
