Metadata-Version: 2.1
Name: superseeker
Version: 0.1.dev3
Summary: SuperSeeker_Pipeline is a Python library for identifying subclonal evolution in cancer.
Home-page: https://github.com/gageblack/superseeker_pipeline
Author: Gage Black
Author-email: gage.black@utah.edu
License: MIT
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: numpy
Requires-Dist: pandas


# SuperSeeker Pipeline

The SuperSeeker pipeline is a Python library for identifying subclonal evolution in cancer. It integrates various tools and scripts to process VCF files, cluster variants using PyClone-VI, and reconstruct subclonal evolutionary trees using SuperSeeker.

## Installation

### Step 1: Install External Tools

#### Installing PyClone-VI

PyClone-VI is used for clustering genetic variants. You can install it using Conda:

```bash
conda install -c bioconda pyclone-vi
```

#### Installing SuperSeeker

SuperSeeker is used for reconstructing the evolutionary history of subclones. You can install it from its source repository:

```bash
git clone https://github.com/yiq/SuperSeeker.git
cd SuperSeeker
```

Use the `requirements.txt` file to install the necessary Python packages:

```bash
pip install -r requirements.txt
```

### Step 3: Install the SuperSeeker Python Package

Finally, install the SuperSeeker Pipeline Python package:

```bash
pip install superseeker_pipeline
```
## Usage

To use the SuperSeeker pipeline, you can run the `run_pipeline` function from the `superseeker.pipeline` module. Here's an example:

```python
from superseeker.pipeline import run_pipeline

run_pipeline(
    patient='patient1',
    vcf_file='input_file.vcf',
    restarts=100,
    clusters=10
)
```

## License

This project is licensed under the MIT License - see the [LICENSE.txt](LICENSE.txt) file for details.
