Metadata-Version: 2.1
Name: ratschlab-common
Version: 0.1.0
Summary: Small library of common functionalities used in various projects in the ratschlab
Home-page: https://github.com/ratschlab/ratschlab-common
Author: ETH Zurich, Biomedical Informatics Group
Author-email: marc.zimmermann@inf.ethz.ch
License: MIT license
Keywords: ratschlab_common
Platform: UNKNOWN
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Requires-Dist: Click (==6.0)
Requires-Dist: dask[complete] (==0.17.1)
Requires-Dist: pandas (==0.22)
Requires-Dist: pyarrow (==0.8.0)
Requires-Dist: tables (==3.4.2)

================
ratschlab common
================

..

   .. image:: https://img.shields.io/pypi/v/ratschlab_common.svg
           :target: https://pypi.python.org/pypi/ratschlab_common

   .. image:: https://img.shields.io/travis/ratschlab/ratschlab_common.svg
           :target: https://travis-ci.org/ratschlab/ratschlab_common

   .. image:: https://readthedocs.org/projects/ratschlab-common/badge/?version=latest
           :target: https://ratschlab-common.readthedocs.io/en/latest/?badge=latest
           :alt: Documentation Status


Small library of common code used in various projects in the `ratschlab
<http://www.ratschlab.org>`_.  

* Free software: MIT license


Features
--------

* Writing parquet and HDF5 files with sensible defaults.
* Support for working with 'chunkfiles', i.e. splitting up a large dataset in smaller chunks which can be processed independently:

  * Repartition records (i.e. increase or decrease number of chunkfiles) while keeping data belonging together in the same file (e.g. data with the same patient id associated)
  * simple indexing for looking up in which chunk to find data belonging e.g. to a patient

* bigmatrix: support for creating and reading large matrices stored in HDF5 having additional metadata on the axes in form of data frames.








