Metadata-Version: 2.1
Name: gwdama
Version: 0.4.1
Summary: A GW data manager package and more
Home-page: https://gwdama.readthedocs.io/en/latest/index.html
Author: Pisa Detchar Group
Author-email: francesco.direnzo@df.unipi.it
License: UNKNOWN
Download-URL: https://gitlab.com/gwprojects/gwdama
Description: This package aims at providing a unified and easy to use interface to access and manipulate Gravitational Wave (GW) data. This can be read from different locations, such as from local `frame files (gwf) <https://lappweb.in2p3.fr/virgo/FrameL/>`_ and especially from the `Gravitational Wave Open Science Center <https://www.gw-openscience.org/>`_ (GWOSC). It will also include some convenient plotting and data analysis methods aimed at Detector Characterization.
        
        .. image:: https://readthedocs.org/projects/gwdama/badge/?version=latest
            :target: https://gwdama.readthedocs.io/en/latest/?badge=latest
            :alt: Documentation Status
            
        .. image:: https://badge.fury.io/py/gwdama.svg
            :target: https://badge.fury.io/py/gwdama
        
        =================================
         GWdata manager package overview
        =================================
        
        **gwdama** currently comprises the main class ``GwDataManager``, which behaves as a multi-purpose and multi-format container for data, not just that related to GW's. This is based on `h5py.File class <http://docs.h5py.org/en/stable/high/file.html>`_ with the addition of the methods and the attributes to import and manipulate GW data, and  some functions for data analysis of time series. Differently from the common application of ``h5py.File`` objects, a ``GwDataManager`` instance is, by default, set to occupy only a temporary file, which is authomatically deleted by python once closed, or some space in the RAM. Refer to the `full documentation <'https://gwdama.readthedocs.io/en/latest/index.html'>`_ for further details. 
        
        Inside ``GwDataManager`` objects, data is stored into `h5py.Dataset like <http://docs.h5py.org/en/stable/high/dataset.html>`_ objects organised into a hierarchical structure of `h5py.Groups <http://docs.h5py.org/en/stable/high/group.html>`_ and sub-groups. These Dataset are created within an instance of ``GwDataManager()`` with the usual methods of ``h5py``: ``create_dataset(name, shape, dtype)``. They contain data, typically of numeric type but also strings, and some attributes (or metadata). For example, for GW data, and in general all time series, it is important the information of when they have been recorded, and at which sampling frequency. A neme and a unit are also usefull. These can be conveniently added and customised. Also a ``GwDatamanager`` object contains attributes for itself. 
        
        When the data have been acquired, pre-processed, and organised into groups, they can be saved to ``hdf5`` format, and later read back.
        
        --------------
         Installation
        --------------
        
        **gwdama** can be installed via `pip <https://docs.python.org/3/installing/index.html>`_:
        
        .. code-block:: console
        
            $ pip install gwdama
        
        and requires Python 3.6.0 or higher. The previous command automatically fulfil all the required dependencies (like on ``numpy``, ``matplotlib``), so you are ready to start generating datasets and making plots.
        
        Further details can be found in the `full documentation <'https://gwdama.readthedocs.io/en/latest/index.html'>`_.
        
        
        --------------------
         Creating a dataset
        --------------------
        
        A dataset of, say, random numbers can be readily created as::
        
            >>> from gwdama.io import GwDataManager
            >>> import numpy as np
            
            >>> dama = GwDataManager("my_dama")
            >>> dama.create_dataset('random_n', data=np.random.normal(0, 1, (10,)))
            >>> dama.create_dataset('a_list', data=[1, 2, 3, 4])
            >>> dama.create_dataset('a_string', data="this is a string")
            
        The *string representation* of the ``GwDataManager`` class provides a quick look at its structure and its attributes. You can call it with the method ``__str__()`` or simply by printing the object::
        
            >>> print(dama)
            my_dama:
              ├── a_list
              ├── a_string
              └── random_n
        
              Attributes:
                 dama_name : my_dama
                time_stamp : 20-07-28_19h36m47s
            
        Other attributes can be added to both the ``GwDataManager`` object and the Datasets therein::
        
            >>> dama.attrs['owner'] = 'Francesco'
            >>> dama.show_attrs
            my_dama:
              ├── a_list
              ├── a_string
              └── random_n
        
              Attributes:
                 dama_name : my_dama
                     owner : Francesco
                time_stamp : 20-07-28_19h36m47s  
                
        Datasets can be accessed from their *keys*, as reported in the structure shown above, with a syntax similar to that for dictionaries in Python::
        
            >>> dset = dama['random_n']      # 'random_n' is the dataset key
            >>> dset.attrs['t0'] = 0         # It is conveninet to use gps times
            >>> dset.attrs['fsample'] = 10   # measured in Hz
            
            >>> dset.show_attrs
            fsample : 10
                 t0 : 0
        
        To get the data contained in this dataset, call its attribute ``data``:
        ::
        
            >>> dset.data
            array([-0.73796689, -1.34206706, -0.97898291, -0.19846702,
                   -0.85056961,  0.20206334,  0.84720009,  0.19527366,
                   -0.9246727 , -0.04808732])
        
        ------------------------------
         Writing and reading datasets
        ------------------------------
        
        Now it is time to *write your data* to disc. Let's say we want to store the information contained in our ``GwdataManager`` object into the `hdf5 file <https://www.hdfgroup.org/>`_ ``out_dataset.h5``:
        ::
        
            >>> out_f = 'out_dataset.h5'
            >>> write_gwdama_dataset(out_f)
            
        Then remember to **close your previous file** before leaving the session:
        ::
        
            >>> dama.close()
            >>> del dama       # Redundant but usefull
        
        .. note:: This operation is automatically performed every time the session is closed. However, it is good practice to do this manually every time there is no more need of a certain variable.
        
        To *read data* back your data::
        
            >>> new_dama = GwDataManager(out_f)
            Reading dama
            >>> print(new_dama)
            my_dama:
              ├── a_list
              ├── a_string
              └── random_n
        
              Attributes:
                 dama_name : my_dama
                     owner : Francesco
                time_stamp : 20-07-30_12h19m32s
        
        
        
        ----------------
         Read open data 
        ----------------
        
        Open data can be accessed from both online and local virtual disks provided by `CVMFS <https://cernvm.cern.ch/fs/>`_. 
        
        From online GWOSC
        -----------------
        
        ::
        
            >>> event_gps = 1186746618                                      # GW170814
        
            >>> dama = GwDataManager()                                      # Default name 'mydama' assigned
            >>> dama.read_gwdata(event_gps - 50, event_gps +10, ifo='L1',   # Required params
                                 m_data_source="gwosc-remote",              # data source
                                 dts_key='online')                          # group key (optional, but useful)
        
        
        From local CVMFS
        ----------------
         
        CernVM-FS must be installed and configured on your computer. Refer to its `description on the GWOSC website <https://www.gw-openscience.org/cvmfs/>`_ 
        or to `this Quick start guide <https://cernvm.cern.ch/portal/filesystem/quickstart>`_.
        
        Assuming your data are stored at the following path::
        
           cvmfs_path = '/data2/cvmfs/gwosc.osgstorage.org/gwdata/' 
        
        data can be read with:
        
        ::
        
            >>> start='2017-06-08 01:00:00'  # starting time as a string
            >>> end='2017-06-08 02:00:00'    # ending time as a string
            >>> ifo='H1'                     # interfereometer tag
        
            >>> rate='4k'                    # sample rate: 4k or 16k
            >>> frmt='hdf5'                  # format of the data: gwf or hdf5
            
            >>> dama.read_gwdata(start, end, m_data_source="gwosc-cvmfs", ifo=ifo, m_data_format=frmt)
            
            
        -----------
         Changelog
        -----------
        
        0.4.1
        -----
        * Methods:
            - ``hist`` method to generate plot of Dataset object;
            - ``duration`` method to get the duration in seconds of the dataset.
        * Attributes:
            - ``gropus``, returning a list with all the groups and subgroups (included dataset) contained in a GwDataManager onject.
        * Preprocessing functions:
            - PSD
            - whiten
            - taper
        
        0.4.0
        -----
        
        * Implemented support for data on Virgo Farm.
        
        0.3.0
        -----
        * Only open data can be imported either from online or via CVMFS;
        * New methods to access data and attributes of datasets.
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/x-rst
