Metadata-Version: 2.1
Name: shrike
Version: 2.0.0.dev4
Summary: Python utilities for compliant Azure machine learning
Home-page: https://github.com/ai-platform-ml-platform/shrike
Author: AML Data Science
Author-email: aml-ds@microsoft.com
License: MIT
Platform: UNKNOWN
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Requires-Python: ~=3.6
Description-Content-Type: text/markdown
Requires-Dist: azureml-core (>=1.37.0)
Requires-Dist: ruamel.yaml (~=0.16)
Requires-Dist: tqdm (>=4)
Provides-Extra: build
Requires-Dist: argparse-dataclass (~=0.1) ; extra == 'build'
Requires-Dist: omegaconf (<2.2) ; extra == 'build'
Requires-Dist: rsa (~=4.7) ; extra == 'build'
Requires-Dist: ruamel.yaml (~=0.16) ; extra == 'build'
Requires-Dist: semver (~=2.13) ; extra == 'build'
Requires-Dist: gitpython (~=3.1) ; extra == 'build'
Requires-Dist: jsonpath-ng (~=1.5) ; extra == 'build'
Requires-Dist: chardet (~=4.0) ; extra == 'build'
Requires-Dist: packaging (~=20.0) ; extra == 'build'
Requires-Dist: opencensus-ext-azure (==1.0.7) ; extra == 'build'
Requires-Dist: mldesigner[pipeline] ; extra == 'build'
Requires-Dist: azure-core (>=1.26.2) ; extra == 'build'
Provides-Extra: dev
Requires-Dist: argparse-dataclass (~=0.1) ; extra == 'dev'
Requires-Dist: gitpython (~=3.1) ; extra == 'dev'
Requires-Dist: omegaconf (~=2.0) ; extra == 'dev'
Requires-Dist: rsa (~=4.7) ; extra == 'dev'
Requires-Dist: ruamel.yaml (~=0.16) ; extra == 'dev'
Requires-Dist: semver (~=2.13) ; extra == 'dev'
Requires-Dist: black (==22.3.0) ; extra == 'dev'
Requires-Dist: flake8 (==3.9.1) ; extra == 'dev'
Requires-Dist: typeguard (==2.11.1) ; extra == 'dev'
Requires-Dist: typing-extensions (>=4.0) ; extra == 'dev'
Requires-Dist: markdown-include (==0.6.0) ; extra == 'dev'
Requires-Dist: mkdocstrings-python-legacy (==0.2.2) ; extra == 'dev'
Requires-Dist: mkdocstrings (==0.18.1) ; extra == 'dev'
Requires-Dist: mkdocs-material (==9.1.8) ; extra == 'dev'
Requires-Dist: livereload (==2.6.3) ; extra == 'dev'
Requires-Dist: markdown (==3.3.7) ; extra == 'dev'
Requires-Dist: pytest (~=6.2) ; extra == 'dev'
Requires-Dist: pytest-cov (~=2.11) ; extra == 'dev'
Requires-Dist: pytest-order (==0.10.0) ; extra == 'dev'
Requires-Dist: blake3 (==0.2.1) ; extra == 'dev'
Requires-Dist: twine (~=3.3) ; extra == 'dev'
Requires-Dist: wheel (~=0.36) ; extra == 'dev'
Requires-Dist: pyspark (==3.1.3) ; extra == 'dev'
Requires-Dist: vaex-core (==4.6.0) ; extra == 'dev'
Requires-Dist: vaex (==4.3.0) ; extra == 'dev'
Requires-Dist: toolz (==0.11.1) ; extra == 'dev'
Requires-Dist: numpy (==1.24.0) ; extra == 'dev'
Requires-Dist: pandas (==1.1.5) ; extra == 'dev'
Requires-Dist: scikit-learn (==0.24.2) ; extra == 'dev'
Requires-Dist: azure-mgmt-core (==1.3.0) ; extra == 'dev'
Provides-Extra: distributed
Requires-Dist: mpi4py (>=3.1.3) ; extra == 'distributed'
Requires-Dist: ray (>=1.9.2) ; extra == 'distributed'
Requires-Dist: dask[complete] (>=2022.1.0) ; extra == 'distributed'
Requires-Dist: protobuf (>=3.19.5) ; extra == 'distributed'
Provides-Extra: pipeline
Requires-Dist: azureml-pipeline-core (>=1.37.0) ; extra == 'pipeline'
Requires-Dist: azure-ml-component (==0.9.18) ; extra == 'pipeline'
Requires-Dist: gitpython (~=3.1) ; extra == 'pipeline'
Requires-Dist: hydra-core (~=1.0.3) ; extra == 'pipeline'
Requires-Dist: importlib-resources (~=5.9.0) ; extra == 'pipeline'
Requires-Dist: flatten-dict (~=0.4) ; extra == 'pipeline'
Requires-Dist: jsonpath-ng (~=1.5) ; extra == 'pipeline'
Requires-Dist: coolname (~=1.1) ; extra == 'pipeline'
Requires-Dist: omegaconf (~=2.1.0) ; extra == 'pipeline'
Requires-Dist: toposort (~=1.7) ; extra == 'pipeline'
Requires-Dist: azureml-dataset-runtime (~=1.40) ; extra == 'pipeline'
Requires-Dist: opencensus-ext-azure (==1.0.7) ; extra == 'pipeline'
Requires-Dist: pytest (~=6.2) ; extra == 'pipeline'
Requires-Dist: azure-mgmt-core (>=1.3.0) ; extra == 'pipeline'
Requires-Dist: azure-core (>=1.20.1) ; extra == 'pipeline'
Requires-Dist: argcomplete (>=1.12.3) ; extra == 'pipeline'
Requires-Dist: applicationinsights (>=0.11.10) ; extra == 'pipeline'
Requires-Dist: azure-cli-core (>=2.30.0) ; extra == 'pipeline'
Requires-Dist: azure-ai-ml (==1.7.2) ; extra == 'pipeline'

# Shrike: incubation for Azure ML

[![CodeQL](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/codeql-analysis.yml/badge.svg)](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/codeql-analysis.yml)
[![docs](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/docs.yml/badge.svg)](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/docs.yml)
[![python](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/python.yml/badge.svg)](https://github.com/ai-platform-ml-platform/shrike/actions/workflows/python.yml)
[![Component Governance](https://dev.azure.com/msdata/Vienna/_apis/build/status/aml-ds/Azure.shrike%20Component%20Governance?branchName=main)](https://dev.azure.com/msdata/Vienna/_build/latest?definitionId=16088&branchName=main)
[![Python versions](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
[![code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
[![codecov](https://codecov.io/gh/ai-platform-ml-platform/shrike/branch/main/graph/badge.svg?token=sSq0BKlfTu)](https://codecov.io/gh/ai-platform-ml-platform/shrike)
[![PyPI - Downloads](https://img.shields.io/pypi/dm/shrike)](https://pypi.org/project/shrike/)
[![PyPI version](https://badge.fury.io/py/shrike.svg)](https://badge.fury.io/py/shrike)
[![license: MIT](https://img.shields.io/badge/License-MIT-purple.svg)](LICENSE)

The `shrike` library is a set of Python utilities for running experiments in the 
[Azure Machine Learning](https://github.com/Azure/azureml-examples) platform (_a.k.a._ Azure ML). This
library contains four elements, which are:

-  `shrike.compliant_logging`: utilities for compliant logging and 
exception handling;
-  `shrike.pipeline`: helper code for managing, validating and submitting Azure
ML pipelines based on 
[azure-ml-component](https://aka.ms/azure-ml-component-reference) (_a.k.a._ the Component SDK);
-  `shrike.build`: helper code for packaging, building, validating, signing and
registering Azure ML components.
- `shrike.spark`: utilities for running jobs, especially those leveraging Spark
  .NET, in HDInsight and later Synapse.

## Documentation
For the full documentation of `shrike` with detailed examples and API reference, 
please see the [docs page](https://shrike-docs.com/).

For a list of problems (along with guidance and solutions) designed specifically
to help you learn how to use shrike, please refer to the information in
[this README file](https://github.com/Azure/azure-ml-problem-sets/blob/main/README.md#azure-ml-problems-aimed-at-learning-shrike)
(located in another GitHub repository).

## Installation

The `shrike` library is publicly available in PyPi. There are three optional extra dependencies: `pipeline`, `build`, and `dev`.
The `pipeline` dependency is for submitting Azure ML pipelines, `build` is for signing and registering components, 
and `dev` is for the development environment of `shrike`.

- If you are only planning on using the compliant-logging feature, please `pip install` without any extras:
```pwsh
pip install shrike
```
- If you are planning on signing and registering components, please `pip install` with `[build]`:
```pwsh
pip install shrike[build]
```
- If you are planning on submitting Azure ML pipelines, please `pip install` with `[pipeline]`:
```pwsh
pip install shrike[pipeline]
```
- If you would like to contribute to the source code, please `pip install` with all the dependencies:
```pwsh
pip install shrike[pipeline,build,dev]
```

Alternatively, for local development, you may use the Conda environment defined
in [environment.yml](./environment.yml). It pins the appropriate versions of
pip, Python, and installs all shrike together with all extras as an editable
package.

:warning: If you are using a ZSH terminal, please consider adding quotes,
e.g., `pip install "shrike[pipeline,build,dev]"` to avoid the accidental shell expansion.

## Migration from `aml-build-tooling`, `aml-ds-pipeline-contrib`, and `confidential-ml-utils`
If you have been using the `aml-build-tooling`, `aml-ds-pipeline-contrib`, or `confidential-ml-utils` libraries, 
please use the migration script ([migration.py](https://github.com/ai-platform-ml-platform/shrike/blob/main/migration.py)) to convert your repo or files and
adopt the `shrike` package with one simple command:
```pwsh
python migraton.py --input_path PATH/TO/YOUR/REPO/OR/FILE
```
:warning: This command will update files **in-place**. Please make a copy of your repo/file if you do not want to do so.

## Need Support?
If you have any feature requests, technical questions, or find
any bugs, please do not hesitate to reach out to us.

- For bug reports and feature requests, you are welcome to open an [issue](https://github.com/ai-platform-ml-platform/shrike/issues). 
- If you are a Microsoft employee, please refer to the 
[support page](https://aka.ms/aml/support) for details;
- If you are outside Microsoft, please send an email
to [aims-team@microsoft.com](mailto:aims-team@microsoft.com). 


## Contributing

This project welcomes contributions and suggestions. Most contributions require
you to agree to a Contributor License Agreement (CLA) declaring that you have
the right to, and actually do, grant us the rights to use your contribution.
For details, visit https://cla.opensource.microsoft.com.

To contribute, please start by creating a self-assigned [issue](https://github.com/ai-platform-ml-platform/shrike/issues/new/choose)
giving a high-level overview of what you'd like to do.
Once any discussion there concludes, follow up with a PR.

Please join the security group "aml-ds-guests" on [IDweb](https://idweb.microsoft.com/IdentityManagement/default.aspx), if you have difficulty
in creating a branch. When you submit a pull request, 
a CLA bot will automatically determine whether
you need to provide a CLA and decorate the PR appropriately (e.g., status check,
comment). Simply follow the instructions provided by the bot. You will only need
to do this once across all repos using our CLA.

This project has adopted the
[Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
For more information see the
[Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or
contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any
additional questions or comments.


## Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft 
trademarks or logos is subject to and must follow 
[Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).
Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.
Any use of third-party trademarks or logos are subject to those third-party's policies.


