Metadata-Version: 2.1
Name: pgformat
Version: 0.2.1
Summary: Property Graph Exchange Format (PG) parser and serializer
Author-email: Jakob Voß <jakob.voss@gbv.de>
Description-Content-Type: text/markdown
Classifier: License :: OSI Approved :: MIT License
Requires-Dist: lark
Project-URL: Homepage, https://github.com/pg-format/pg-format-py
Project-URL: Issues, https://github.com/pg-format/pg-format-py/issues

# pgformat

[![Test](https://github.com/pg-format/pg-format-py/actions/workflows/test.yml/badge.svg)](https://github.com/pg-format/pg-format-py/actions/workflows/test.yml)
[![PyPI](https://img.shields.io/pypi/v/pgformat?label=pypi%20package)](https://pypi.org/project/pgformat/)

> Property Graph Exchange Format (PG) parser and serializer

This package implements parsers and serializers to for labeled property graphs in [Property Graph Exchange Format](https://pg-format.github.io/).

[PG Format]: https://pg-format.github.io/specification/#pg-format
[PG JSONL]: https://pg-format.github.io/specification/#pg-jsonl
[PG JSON]: https://pg-format.github.io/specification/#pg-json

## Table of Contents

- [Install](#install)
- [Usage](#usage)
- [API](#api)
  - [parseGraph](#parseGraph)
  - [parseStatements](#parseStatements)
  - [serializeGraph](#serializeGraph)

## Install

Requires Python 3.8 or above.

~~~sh
pip install pgformat
~~~

<!-- TODO: install with conda -->

## Usage

The package comes with a simple command line client to convert between serialization forms of Property Graph Exchange format and an [API](#api) to be used in python code. The command line client by default reads [PG Format] and emits [PG JSONL].

~~~sh
usage: pgformat [-h] [-t {ndjson,json,pg}] [source]

Parse and transform property graphs in PG Format.

positional arguments:
  source                input file (default: - for standard input)

options:
  -h, --help            show this help message and exit
  -t {ndjson,json,pg}, --to {ndjson,json,pg}
                        target format
~~~

The functionality is a subset of the [pgraph](https://github.com/pg-format/pgraphs) command line tool written in NodeJS.

The command line client can also be called via `python -m pgformat`.

## API

The package exports the functions `parseGraph`, `parseStatements`, and `serializeGraph`:

~~~python
from pgformat import parseGraph, parseStatements, serializeGraph

statements = parseStatements("x\na -> b")   # list of statements
graph = parseGraph("a -> b")                # full graph

pg = serializeGraph(graph)                  # back to PG Format
~~~

The internal format of graphs and their statements is a data structure of plain lists and arrays for direct conversion to JSON.
This may be changed in a future version.

### parseGraph

~~~python
parseGraph(str: str, sort=True, implicitNodes=True) -> dict
~~~

Parse a property graph given as string in PG Format. Returns a dict with keys `nodes` and `edges` ([PG JSON] format).

Optional parameter `sort` sorts nodes and edges by their identifier, and labels by their value. Optional parameter `implicitNodes` adds nodes for node identifiers referenced in edges only.

### parseStatements

~~~python
parseStatements(str: str, implicitNodes=True, mergeNodes=True, duplicatedEdges=False) -> list
~~~

Parse a list of statements given as string in PG Format. Returns a list of node and edge statements in order of appearance ([PG JSON] format).

Optional parameter `mergeNodes` applies merging of nodes with same identifiers. Optional parameter `duplicatedEdges` throws an exception if set to `False`.

### serializeGraph

~~~python
serializeGraph(graph: dict) -> str
~~~

Convert a property graph to PG Format.

## License

Licensed under the MIT License.


