Metadata-Version: 2.1
Name: nodestream
Version: 0.12.1
Summary: A Fast, Declarative ETL for Graph Databases.
Home-page: https://github.com/nodestream-proj/nodestream
License: GPL-3.0-only
Keywords: etl,neo4j,declarative,data,kafka,ingest
Author: Zach Probst
Author-email: Zach_Probst@intuit.com
Requires-Python: >=3.10,<4.0
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: License :: OSI Approved :: GNU Affero General Public License v3
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Database
Requires-Dist: Jinja2 (>=3,<4)
Requires-Dist: boto3 (>=1.26.137,<2.0.0)
Requires-Dist: cleo (>=2.0.1,<3.0.0)
Requires-Dist: confluent-kafka (>=2.3.0,<3.0.0)
Requires-Dist: cookiecutter (>=2.1.1,<3.0.0)
Requires-Dist: httpx (>=0.24.1,<0.25.0)
Requires-Dist: jmespath (>=1.0.1,<2.0.0)
Requires-Dist: pandas (>=2,<3)
Requires-Dist: psutil (>=5.9.6,<6.0.0)
Requires-Dist: pyarrow (>=15.0.0,<16.0.0)
Requires-Dist: python-json-logger (>=2.0.4,<3.0.0)
Requires-Dist: pyyaml (>=6.0,<7.0)
Requires-Dist: schema (>=0.7.5,<0.8.0)
Requires-Dist: uvloop (>=0.17.0,<=0.18.0) ; sys_platform == "darwin"
Requires-Dist: uvloop (>=0.17.0,<=0.18.0) ; sys_platform == "linux"
Project-URL: Documentation, https://nodestream-proj.github.io/nodestream
Project-URL: Repository, https://github.com/nodestream-proj/nodestream
Description-Content-Type: text/markdown


# Nodestream

_Fast, Declarative ETL for Graph Databases._

[![Demo](https://raw.githubusercontent.com/nodestream-proj/nodestream/e94d0faa024c0f8da1e83a4ff6d83746504d197e/docs/img/demo.gif)](https://badge.fury.io/py/nodestream)

#### Badges

[![Continuous Integration](https://github.com/nodestream-proj/nodestream/actions/workflows/ci.yaml/badge.svg)](https://github.com/nodestream-proj/nodestream/actions/workflows/ci.yaml)
[![codecov](https://codecov.io/gh/nodestream-proj/nodestream/branch/main/graph/badge.svg?token=HAPEVKQ6OQ)](https://codecov.io/gh/nodestream-proj/nodestream)
[![ApacheV2 License](https://img.shields.io/badge/License-Apache%202.0-yellow.svg)](https://opensource.org/license/apache-2-0/)
[![PyPI version](https://badge.fury.io/py/nodestream.svg)](https://badge.fury.io/py/nodestream)

## Features

- Flexible and extensible YAML based DSL for ETL jobs
- Connect to data sources like Kafka, AWS Athena, flat files, and more.
- Developer friendly
- Highly optimized with async and tuned query generation


## Getting Started

Install nodestream with `pip`

```bash
  pip install nodestream
  nodestream new --database neo4j my_project && cd my_project
  nodestream run sample -v
```

## Documentation

Visit our [Documentation](https://nodestream-proj.github.io/docs/) on Github Pages.


## Contributing

Contributions are always welcome!

See `contributing.md` for ways to get started.

Please adhere to this project's `code of conduct`.


## Authors

- Zach Probst ([@zprobst](https://www.github.com/zprobst))
- Chad Cloes ([@ccloes](https://www.github.com/ccloes))
- Oshri Rozenberg([@orozen](https://www.github.com/orozen))
- Kevin Neal ([@khneal](https://www.github.com/khneal))
- Grant Hoffman ([@grantleehoffman](https://www.github.com/grantleehoffman))


