Metadata-Version: 2.1
Name: swak
Version: 0.2.4
Summary: Swiss army knife for functional data-science projects.
Author-email: yedivanseven <yedivanseven@outlook.de>
Project-URL: Documentation, https://yedivanseven.github.io/swak/
Project-URL: Repository, https://github.com/yedivanseven/swak.git
Project-URL: Issues, https://github.com/yedivanseven/swak/issues
Project-URL: Changelog, https://github.com/yedivanseven/swak/blob/main/CHANGELOG.md
Keywords: functional,tools,utilities,helpers
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Utilities
Classifier: License :: OSI Approved :: MIT License
Requires-Python: >=3.12
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas>=2.2
Requires-Dist: pyarrow>=17.0
Requires-Dist: pyyaml>=6.0
Requires-Dist: tomli-w>=1.1
Requires-Dist: tqdm>=4.6
Provides-Extra: cloud
Requires-Dist: google-cloud-bigquery>=3.25; extra == "cloud"
Requires-Dist: google-cloud-storage>=2.18; extra == "cloud"
Requires-Dist: pandas-gbq>=0.23; extra == "cloud"

![GitHub Pages](https://github.com/yedivanseven/swak/actions/workflows/publish-documentation.yml/badge.svg)
![PyPI](https://github.com/yedivanseven/swak/actions/workflows/publish-package.yml/badge.svg)

# swak
_Swiss army knife for functional data-science projects._

## Introduction
This package is a collection of small, modular, and composable building
blocks implementing frequently occurring operations in typical data-science
applications. In abstracting away boiler-plate code, it thus saves time and effort.
* Consolidate all ways to configure your project (command-line arguments,
  environment variables, and config files) with the `cli` and `text`
  packages, respectively.
* Wrap the project config into a versatile `jsonobject`.
* Focus on writing small, configurable, modular, reusable, and testable 
  building blocks. Then use the flow controls in `funcflow` to compose them
  into arbitrarily complex workflows, that are still easy to maintain and to expand.
* Quickly set up projects on Google BigQuery and Google Cloud Storage, and
  efficiently download lots of data in parallel with `cloud.gcp`.
* Build powerful neural-network architectures from the elements in `pt` and
  train your deep-learning models with early stopping and checkpointing.
  From feature embedding, over feature importance, to repeated residual blocks,
  a broad variety of options is available.
* And much more ...

## Installation
* Create a new virtual environment running at least `python 3.12`.
* The easiest way of installing `swak` is from the python package index
[PyPI](https://pypi.org/project/swak/), where it is hosted. Simply type
  ```shell
  pip install swak
  ```
  or treat it like any other python package in your dependency management.
* If you need support for interacting with the Google Cloud Project,
in particular Google BigQuery and Google Cloud Storage, install
_extra_ dependencies with:
  ```shell
  pip install swak[cloud]
  ```
* In order to use the subpackage `swak.pt`, you need to have [PyTorch](https://pytorch.org/) installed.
Because there is no way of knowing whether you want to run it on CPU only or also on GPU and, if so,
which version of CUDA (or ROC) you have installed on your machine and how, it is not an explicit
dependency of `swak`. You will have to install it yourself, _e.g._, following
[these instructions](https://pytorch.org/get-started/locally/).
If you are using `pipenv` for dependency management, you can also have a look at the
[Pipfile](https://github.com/yedivanseven/swak/blob/main/Pipfile) in the root of the `swak`
[repository](https://github.com/yedivanseven/swak) and taylor it to your needs. Personally, I go
  ```shell
  pipenv sync --categories=cpu
  ```
  for a CPU only installation of PyTorch and
  ```shell
  pipenv sync --categories=cuda
  ```
  if I want GPU support.

## Usage
Try making a new repository using the [swak-template](https://github.com/yedivanseven/swak-template)
as a, well, template.

## Documentation
The API documentation to `swak` is hosted on [GitHub Pages](https://yedivanseven.github.io/swak/).
