Metadata-Version: 2.1
Name: tributary
Version: 0.1.3
Summary: Analytics library
Home-page: https://github.com/timkpaine/tributary
Author: Tim Paine
Author-email: timothy.k.paine@gmail.com
License: Apache 2.0
Keywords: analytics tools plotting
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Description-Content-Type: text/markdown
Requires-Dist: aiohttp (>=3.5.4)
Requires-Dist: aiofiles (>=0.4.0)
Requires-Dist: asyncpg (>=0.20.1)
Requires-Dist: boltons (>=20.1.0)
Requires-Dist: confluent-kafka (>=0.11.6)
Requires-Dist: future (>=0.17.1)
Requires-Dist: gevent (>=1.3.7)
Requires-Dist: graphviz (>=0.10.1)
Requires-Dist: ipython (>=7.0.1)
Requires-Dist: ipydagred3 (>=0.1.5)
Requires-Dist: numpy (>=1.15.3)
Requires-Dist: pandas (>=0.19.0pygraphviz>=1.5)
Requires-Dist: scipy (>1.2.0)
Requires-Dist: six (>=1.11.0)
Requires-Dist: socketIO-client-nexus (>=0.7.6)
Requires-Dist: sympy (>=1.5.1)
Requires-Dist: temporal-cache (>=0.0.6)
Requires-Dist: tornado (>=5.1.1)
Provides-Extra: dev
Requires-Dist: flake8 (>=3.7.8) ; extra == 'dev'
Requires-Dist: mock ; extra == 'dev'
Requires-Dist: pybind11 (>=2.4.0) ; extra == 'dev'
Requires-Dist: pytest (>=4.3.0) ; extra == 'dev'
Requires-Dist: pytest-cov (>=2.6.1) ; extra == 'dev'
Requires-Dist: Sphinx (>=1.8.4) ; extra == 'dev'
Requires-Dist: sphinx-markdown-builder (>=0.5.2) ; extra == 'dev'
Requires-Dist: aiohttp (>=3.5.4) ; extra == 'dev'
Requires-Dist: aiofiles (>=0.4.0) ; extra == 'dev'
Requires-Dist: asyncpg (>=0.20.1) ; extra == 'dev'
Requires-Dist: boltons (>=20.1.0) ; extra == 'dev'
Requires-Dist: confluent-kafka (>=0.11.6) ; extra == 'dev'
Requires-Dist: future (>=0.17.1) ; extra == 'dev'
Requires-Dist: gevent (>=1.3.7) ; extra == 'dev'
Requires-Dist: graphviz (>=0.10.1) ; extra == 'dev'
Requires-Dist: ipython (>=7.0.1) ; extra == 'dev'
Requires-Dist: ipydagred3 (>=0.1.5) ; extra == 'dev'
Requires-Dist: numpy (>=1.15.3) ; extra == 'dev'
Requires-Dist: pandas (>=0.19.0pygraphviz>=1.5) ; extra == 'dev'
Requires-Dist: scipy (>1.2.0) ; extra == 'dev'
Requires-Dist: six (>=1.11.0) ; extra == 'dev'
Requires-Dist: socketIO-client-nexus (>=0.7.6) ; extra == 'dev'
Requires-Dist: sympy (>=1.5.1) ; extra == 'dev'
Requires-Dist: temporal-cache (>=0.0.6) ; extra == 'dev'
Requires-Dist: tornado (>=5.1.1) ; extra == 'dev'

# <a href="https://tributary.readthedocs.io"><img src="docs/img/icon.png" width="300"></a>
Python Data Streams

[![Build Status](https://dev.azure.com/tpaine154/tributary/_apis/build/status/timkpaine.tributary?branchName=master)](https://dev.azure.com/tpaine154/tributary/_build/latest?definitionId=2&branchName=master)
[![GitHub issues](https://img.shields.io/github/issues/timkpaine/tributary.svg)](https://github.com/timkpaine/tributary/issues)
[![Coverage](https://img.shields.io/azure-devops/coverage/tpaine154/tributary/2/master)](https://dev.azure.com/tpaine154/tributary/_build?definitionId=2&_a=summary)
[![PyPI](https://img.shields.io/pypi/l/tributary.svg)](https://pypi.python.org/pypi/tributary)
[![PyPI](https://img.shields.io/pypi/v/tributary.svg)](https://pypi.python.org/pypi/tributary)
[![Docs](https://img.shields.io/readthedocs/tributary.svg)](https://tributary.readthedocs.io)

Tributary is a library for constructing dataflow graphs in python. Unlike many other DAG libraries in python ([airflow](https://airflow.apache.org), [luigi](https://luigi.readthedocs.io/en/stable/), [prefect](https://docs.prefect.io), [dagster](https://docs.dagster.io), [dask](https://dask.org), [kedro](https://github.com/quantumblacklabs/kedro), etc), tributary is not designed with data/etl pipelines or scheduling in mind. Instead, tributary is more similar to libraries like [mdf](https://github.com/man-group/mdf), [pyungo](https://github.com/cedricleroy/pyungo), [streamz](https://streamz.readthedocs.io/en/latest/), or [pyfunctional](https://github.com/EntilZha/PyFunctional), in that it is designed to be used as the implementation for a data model. One such example is the [greeks](https://github.com/timkpaine/greeks) library, which leverages tributary to build data models for [options pricing](https://www.investopedia.com/articles/optioninvestor/07/options_beat_market.asp). 

![](https://raw.githubusercontent.com/timkpaine/tributary/master/docs/img/example.gif)


# Installation
Install from pip:

`pip install tributary`

or from source

`python setup.py install`

# Stream Types
Tributary offers several kinds of streams:

## Streaming
These are synchronous, reactive data streams, built using asynchronous python generators. They are designed to mimic complex event processors in terms of event ordering.

## Functional
These are functional streams, built by currying python functions (callbacks). 

## Lazy
These are lazily-evaluated python streams, where outputs are propogated only as inputs change. They are implemented as directed acyclic graphs.

# Examples
- [Streaming](docs/examples/streaming/streaming.md): In this example, we construct a variety of forward propogating reactive graphs.
- [Lazy](docs/examples/lazy/lazy.md): In this example, we construct a variety of lazily-evaluated directed acyclic computation graphs. 
- [Automatic Differentiation](docs/examples/autodiff/autodiff.md): In this example, we use `tributary` to perform automatic differentiation on both lazy and streaming graphs.

# Graph Visualization
You can visualize the graph with Graphviz. All streaming and lazy nodes support a `graphviz` method.

Streaming and lazy nodes also support [ipydagred3](https://github.com/timkpaine/ipydagred3) for live update monitoring.

## Streaming
![](https://raw.githubusercontent.com/timkpaine/tributary/master/docs/img/streaming/dagred3.gif)

Here green indicates executing, yellow indicates stalled for backpressure, and red indicates that `StreamEnd` has been propogated (e.g. stream has ended).

## Lazy
![](https://raw.githubusercontent.com/timkpaine/tributary/master/docs/img/lazy/dagred3.gif)

Here green indicates executing, and red indicates that the node is dirty. Note the the determination if a node is dirty is also done lazily (we can check with `isDirty` whcih will update the node's graph state.

# Sources and Sinks
## Sources
- Python Function/Generator/Async Function/Async Generator
- Random - generates a random dictionary of values
- File - streams data from a file, optionally loading each line as a json
- Kafka - streams data from kafka
- Websocket - strams data from a websocket
- Http - polls a url with GET requests, streams data out
- SocketIO - streams data from a socketIO connection

## Sinks
- File - data to a file
- Kafka - streams data to kafka
- Http - POSTs data to an url
- Websocket - streams data to a websocket
- SocketIO - streams data to a socketIO connection

# Transforms
## Modulate
- Delay - Streaming wrapper to delay a stream
- Apply - Streaming wrapper to apply a function to an input stream
- Window - Streaming wrapper to collect a window of values
- Unroll - Streaming wrapper to unroll an iterable stream
- UnrollDataFrame - Streaming wrapper to unroll a dataframe into a stream
- Merge - Streaming wrapper to merge 2 inputs into a single output
- ListMerge - Streaming wrapper to merge 2 input lists into a single output list
- DictMerge - Streaming wrapper to merge 2 input dicts into a single output dict. Preference is given to the second input (e.g. if keys overlap)
- Reduce - Streaming wrapper to merge any number of inputs

## Calculations
### Arithmetic Operators
- Noop (unary) - Pass input to output
- Negate (unary) - -1 * input
- Invert (unary) - 1/input
- Add (binary) - add 2 inputs
- Sub (binary) - subtract second input from first
- Mult (binary) - multiple inputs
- Div (binary) - divide first input by second
- RDiv (binary) - divide second input by first
- Mod (binary) - first input % second input
- Pow (binary) - first input^second input
- Sum (n-ary) - sum all inputs
- Average (n-ary) - average of all inputs

### Boolean Operators
- Not (unary) - `Not` input
- And (binary) - `And` inputs
- Or (binary) - `Or` inputs

### Comparators
- Equal (binary) - inputs are equal
- NotEqual (binary) - inputs are not equal
- Less (binary) - first input is less than second input
- LessOrEqual (binary) - first input is less than or equal to second input
- Greater (binary) - first input is greater than second input
- GreaterOrEqual (binary) - first input is greater than or equal to second input

### Math
- Log (unary)
- Sin (unary)
- Cos (unary)
- Tan (unary)
- Arcsin (unary)
- Arccos (unary)
- Arctan (unary)
- Sqrt (unary)
- Abs (unary)
- Exp (unary)
- Erf (unary)

## Converters
- Int (unary)
- Float (unary)
- Bool (unary)
- Str (unary)

## Python Builtins
- Len (unary)

## Rolling
- RollingCount - Node to count inputs
- RollingMin - Node to take rolling min of inputs
- RollingMax - Node to take rolling max of inputs
- RollingSum - Node to take rolling sum inputs
- RollingAverage - Node to take the running average
- SMA - Node to take the simple moving average over a window
- EMA - Node to take an exponential moving average over a window

## Node Type Converters
- Lazy->Streaming


