Metadata-Version: 2.1
Name: sudachi_c_imitator
Version: 0.2.1
Summary: Imitate Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.
Author-email: Yutaka Nakano <nknytk.dev@gmail.com>
License: CC-BY-SA 3.0
        
        このモデルは Wikipedia CirrusSearch のデータを利用して作成されました。
        https://dumps.wikimedia.org/other/cirrussearch/
        
Project-URL: Homepage, https://github.com/nknytk/ma-imitator
Project-URL: Bug Tracker, https://github.com/nknytk/ma-imitator/issues
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Operating System :: OS Independent
Requires-Python: <=3.11,>=3.8
Description-Content-Type: text/markdown
License-File: LICENSE

# Sudachi C Imitator

This package imitates Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.  
You can add tokenization and part-of-speech estimation to your environment  
with only 4MB additoinal disk space if you already have onnxruntime in your environment.

## Installation

```
$ pip install sudachi_c_imitator
```

## Usage Examples

```python
>>> import sudachi_c_imitator
>>> tagger = sudachi_c_imitator.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]
```

Input string length must be 256 or less.
