Metadata-Version: 2.1
Name: peduncle
Version: 0.0.1
Summary: Simple Python content extractor for html
Project-URL: Homepage, https://github.com/midstreeeam/peduncle
Project-URL: Bug Tracker, https://github.com/midstreeeam/peduncle/issues
Author-email: Midstream <midstream.lou@gmail.com>
License-File: LICENSE
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Requires-Python: >=3.7
Description-Content-Type: text/markdown

# peduncle

very very very simple DOM based HTML content extraction tool (less than 100 line)

easy but useable

work with python 3.7+

## usage

```Python
import requests
from grader import Grader

# obtain the raw html
url="https://blog.rust-lang.org/2023/05/29/RustConf.html"
html = requests.get(url).text

# then you get content
G = Grader(html)
print(G.main_node.text)
```

