Metadata-Version: 2.1
Name: data-view
Version: 0.0.8
Summary: Automated view of dataset
Home-page: http://github.com/urevoleg/data-view
Author: Urev Oleg
Author-email: urevolegg@gmail.com
License: UNKNOWN
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: matplotlib
Requires-Dist: ipython

Description
===========
Module, provides the function view, which displays general information on the data:
  - Result of method info()
  - Result of method describe()
  - for numeric / categorical signs - The number of missions in the data (number and percentage for each column)
  - Top-5 of the most frequent categorical signs (for each)

 Parameters:
 * d - table with data
 * only_numeric - True / False, default: True. True - information output only by numerical signs, False - information output by numerical and categorical signs.
 * full_stats - True / False, default: False. False - output information on numerical characteristics without interquartile range, data boundaries without outliers, True - complete output with data character.
 * histograms - True / False, default: True. True - output with building histograms for numerical signs, False - without building histograms

Top-5 elements of categorical signs

 The table is formed as follows. The postfix (_name / _count) is assigned to the name of the data column:
  - __name_ - category name
  - __count_ - number of elements in this category If there are less than 5 elements in the attribute, then the values ​​in the _count field are filled -1

Usage
-----
	$ pip install data_view

    $ python3

	import pandas as pd

	import numpy as np

    from data_view import view

    d = pd.DataFrame(np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]), columns=['a', 'b', 'c'])

    view(d, only_numeric=True, histograms=False)


