Metadata-Version: 2.1
Name: contentfetch
Version: 0.0.4
Summary: Extracting the content from the webpage
Home-page: UNKNOWN
Author: aidotio
Author-email: neneranadheer@gmail.com
License: UNKNOWN
Keywords: webpage,scrape,crawl
Platform: UNKNOWN
Classifier: Development Status :: 1 - Planning
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: Unix
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: Microsoft :: Windows
Description-Content-Type: text/markdown
Requires-Dist: beautifulsoup4
Requires-Dist: chromedriver-binary
Requires-Dist: fake-useragent
Requires-Dist: news-please
Requires-Dist: newspaper3k
Requires-Dist: pandas
Requires-Dist: requests
Requires-Dist: selenium
Requires-Dist: unidecode


# content-fetch


This package helps you parse the text from the webpage. The input to the function can be a path to an HTML file on your disk or a URL to a web page. 

You can install the package by running the following command

```
pip install contentfetch
```


Please refer to the code snippet for parsing the text from the HTML file

```
import contentfetch

results_json = contentfetch.extract_content(html=<html_file>)

results_json

### OR

from contentfetch import extract_content

results_json = extract_content(html=<html_file>)

results_json
```

Please refer to the code snippet for parsing the text from the webpage through URL

```
import contentfetch

results_json = contentfetch.extract_content(url=<webpage_url>)

results_json

### OR

from contentfetch import extract_content

results_json = extract_content(url=<webpage_url>)

results_json
```



