Metadata-Version: 2.1
Name: data_science_project_template
Version: 0.2.3
Summary: Data Science Project Template
Author: Shailaja Janjirala
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: scikit-learn
Requires-Dist: mlflow
Requires-Dist: sqlalchemy
Requires-Dist: gcsfs

# Data Science Project Template

A Python package for streamlining data science projects by providing a standardized folder structure and essential tooling.

## Features

- Standardized project structure for data science workflows.
- Pre-configured directories for data, models, notebooks, and more.
- Ready-to-use integrations for cloud services (e.g., GCP), MLFlow, and S3 buckets.
- Modular and flexible for a variety of use cases.

## Installation

Install the package using pip:

```bash
pip install data-science-project-template==0.1.0

## Usage
Initialize a new data science project using the template:

```bash
from data_science_project_template import initialize_project

initialize_project('my_new_project')

##Folder Structure

├── data/                   # Raw, processed, and interim data
├── models/                 # Trained models and serialized outputs
├── notebooks/              # Jupyter notebooks for exploration and analysis
├── src/                    # Source code for ETL, feature engineering, and model building
├── config/                 # Configuration files for experiments and pipelines
└── docs/                   # Documentation and references

## Contributing

Contributions are welcome! Please open an issue or submit a pull request for any feature requests or bug fixes.

## License

This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.

