Metadata-Version: 2.1
Name: opencf
Version: 0.3.3a1
Summary: A collection of Python scripts for file conversion tasks, built on top of the opencf-core framework.
Home-page: https://github.com/Hermann-web/opencf
License: MIT
Keywords: file conversion,data processing,file formats
Author: Hermann Agossou
Author-email: agossouhermann7@gmail.com
Maintainer: Hermann Agossou
Maintainer-email: agossouhermann7@gmail.com
Requires-Python: >=3.9,<3.12
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Provides-Extra: document
Provides-Extra: media
Provides-Extra: tabular
Requires-Dist: aspose-words (>=24.4.0,<25.0.0) ; extra == "document"
Requires-Dist: generalimport (==0.5.2)
Requires-Dist: imageio (>=2.30.0,<3.0.0) ; extra == "media"
Requires-Dist: numpy (>=1.24.0,<2.0.0) ; extra == "media"
Requires-Dist: opencf-core (>=0.3.2,<0.4.0)
Requires-Dist: opencv-python (>=4.8.1.78,<5.0.0.0) ; extra == "media"
Requires-Dist: openpyxl (>=3.1.3,<4.0.0) ; extra == "tabular"
Requires-Dist: pandas (>=2.0.0,<3.0.0) ; extra == "tabular"
Requires-Dist: pdf2docx (>=0.5.8,<0.6.0) ; extra == "document"
Requires-Dist: pillow (>=10.3.0,<11.0.0) ; extra == "media"
Requires-Dist: pymupdf (>=1.24.5,<2.0.0) ; extra == "document"
Requires-Dist: pypdf (>=4.2.0,<5.0.0) ; extra == "document"
Requires-Dist: svgtrace (>=2024,<2025) ; extra == "media"
Project-URL: Documentation, https://opencf.readthedocs.io
Project-URL: Repository, https://github.com/Hermann-web/opencf
Description-Content-Type: text/markdown

# OpenCF: The File Conversion Scripts

The `opencf` package provides a collection of Python scripts for file conversion tasks, built on top of the [opencf-core](https://test.pypi.org/project/opencf-core/) framework. These scripts offer functionalities to convert various file formats, including text, CSV, JSON, XML, Excel, and image files, making it easy to handle different types of data transformations efficiently.

## Features

- **Extensive Conversion Support**: The package includes scripts for converting between various file formats, including text, XML, JSON, CSV, and Excel.
- **Integration with `opencf-core`**: Utilizes classes from the `opencf-core` package for file I/O operations, MIME type detection, and exception handling.
- **Modular Converter Classes**: Each conversion script is backed by a custom converter class tailored to handle specific file format conversions, extending the base converter class provided by `opencf-core`.
- **Flexible Input/Output Handling**: Supports reading from and writing to different file formats seamlessly, leveraging the capabilities of `opencf-core`.
- **Custom File Handlers**: Implements custom file reader and writer classes for Excel and image files, demonstrating extensibility and versatility.
- **Command-Line Interface**: Offers a command-line interface for executing file conversion tasks, allowing users to specify input and output file paths and types conveniently.
- **Extensibility**: Other input/output pairs converters can be easily added to augment the existing functionality, providing flexibility for handling a wider range of file formats.
- **Direct Integration**: For specific projects, each converter script can directly leverage the separated package `opencf-core` to build custom converters tailored to project requirements.

## Conversion Handlers

This module provides classes for converting between different file formats. It includes [concrete implementations of conversion classes](./opencf/converters.py) for various file types.

### Text Conversion

- `TextToTextConverter`: Merges text-based files (txt, md, xml, json, ...) into a single text file.

### XML and JSON Conversion

- `XMLToJSONConverter`: Converts XML files to JSON format. (Reader: XmlToTreeReader, Writer: DictToJsonWriter)

### CSV and Excel Conversion

- `CSVToXMLConverter`: Converts CSV files to XML format. (Reader: CsvToDictReader, Writer: TreeToXmlWriter)
- `XLSXToCSVConverter`: Converts Excel files to CSV format. (Reader: XlsxToDictReader, Writer: DictToCsvWriter)
- `CSVToXLSXConverter`: Converts CSV files to EXCEL format. (Reader: CsvToDictReader, Writer: DictToXlsxWriter)

### Image Conversion

- `ImageToPDFConverterWithPyPdf`: Converts image files to PDF format using PyPDF. (Reader: ImageToPillowReader, Writer: PyPdfToPdfWriter)
- `ImageToPDFConverterWithPillow`: Converts img files to pdf format using Pillow. (Reader: ImageToPillowReader, Writer: PillowToPDFWriter)

### PDF Conversion

- `PDFToImageConverterwithPymupdf`: Converts PDF files to image format using PyMuPDF. (Reader: PdfToPymupdfReader, Writer: PymupdfToImageWriter)
- `PDFToImageExtractorwithPypdf`: Extracts images from PDF files using PyPDF. (Reader: PdfToPyPdfReader, Writer: PypdfToImageExtractorWriter)
- `PDFToImageExtractorwithPymupdf`: Extracts images from PDF files using PyMuPDF. (Reader: PdfToPymupdfReader, Writer: PymupdfToImageExtractorWriter)
- `PDFToDocxConvertorwithPdf2docx`: Converts PDF files to docx format using pdf2docx. (Reader: Pdf2DocxReader, Writer: Pdf2DocxWriter)
- `PDFToDocxWithAspose`: Converts PDF files to docx format using Aspose.Words for Python. (Reader: AsposeReader, Writer: AsposeWriter)

### PDF Manipulation

- `MergePDFswithPypdf`: Merges multiple PDF files into a single PDF. (Reader: PdfToPyPdfReader, Writer: PyPdfToPdfWriter)

## Video Generation

- `ImageToVideoConverterWithPillow`: Converts image files to video format using Pillow. (Reader: ImageToPillowReader, Writer: VideoArrayWriter)
- `ImageToVideoConverterWithOpenCV`: Converts image files to video format using OpenCV. (Reader: ImageToOpenCVReader, Writer: VideoArrayWriter)
- `VideoToGIFConverter`: Converts image files to GIF format using OpenCV and ImageIO. (Reader: VideoToFramesReaderWithOpenCV, Writer: FramesToGIFWriterWithImageIO)
- ...

## Getting Started

To use the `opencf` package, follow these steps:

1. Install the package along with its dependencies using your preferred package manager.
2. Import the required classes into your Python scripts or applications, ensuring that the `opencf-core` package is accessible.
3. Utilize the provided converter classes to perform file format conversions as needed, specifying input and output file paths and types.
4. Execute the scripts either programmatically or via the command-line interface, providing necessary arguments for file conversion tasks.

## Example Usage

Here's an example demonstrating how to use the `opencf` package for converting an PNG file to PDF:

```bash
# Install the package along with its dependencies (if not already installed)
pip install -i https://test.pypi.org/simple/ opencf

# Run the converter script
opencf examples/input/example.png -o examples/output/example.pdf
# or
opencf examples/input/example.png -ot pdf
```

This command executes the `ConverterApp` class, initiating the conversion process from an PNG file to a PDF file using the appropriate converter classes.

## Other examples

- pdf to png

You can set, the output argument as a folder. So, the png files would be in that folder. Using a filepath instead of a folder may yield an error.

```bash
opencf examples/input/example.pdf -o examples/output -ot png
```

This command will write png files into the `examples/output` as mentioned

## Contribution

Contributions to the `opencf` package are welcome! Feel free to submit bug reports, feature requests, or pull requests via the GitHub repository. Additionally, consider extending the functionality by adding support for additional file formats or improving existing converter classes.

