Metadata-Version: 1.0
Name: bert-sent-encoding
Version: 0.1.2
Summary: A bert sentence encoding tool
Home-page: https://gitlab.leihuo.netease.com/shaojianzhi/bert-sent-encoding
Author: Shao Jianzhi
Author-email: shaojianzhi2012@163.com
License: BSD
Description-Content-Type: text/markdown
Description: This is a bert sentence encoding tool.
        
        ## How to use it
        ### install package
        
            pip install --index-url https://pypi.python.org/simple/ bert-sent-encoding==0.1.1
        or
        
            git clone ssh://git@gitlab.leihuo.netease.com:32200/shaojianzhi/bert-sent-encoding.git
            cd bert-sent-encoding
            python setup.py install
        ### then use it
        
            from bert_sent_encoding import bert_sent_encoding
            bse = bert_sent_encoding(model_path='bert_sent_encoding/model/chinese_L-12_H-768_A-12', seq_length=64, batch_size=8, word_vector=False, layer=-1)
            vector = bse.get_vector('你好')   # 1. get vector of string
            vectors = bse.get_vector(['你好', '哈哈'])  # 2. get vector list of strings
            bse.write_txt2vector(input_file, output_file)   # 3. get and write vectors of strings
            
            
        ### for line:
            bse = bert_sent_encoding(model_path='bert_sent_encoding/model/chinese_L-12_H-768_A-12', seq_length=64, batch_size=8)
        model_path is required, seq_length, batch_size and word_vector are optional
        
        ### for line:
            bse.write_txt2vector(input_file, output_file)   # 3. get and write vectors of strings
        path of **input_file** and **output_file** are defined by user and below is content of **input_file**:
            
            the first line text
            the second line text
            ...
        
Platform: UNKNOWN
