Metadata-Version: 2.3
Name: stacchip
Version: 0.1.38
Summary: Dynamically create image chips from STAC items
Project-URL: Homepage, https://github.com/Clay-foundation/stacchip
Project-URL: Bug Tracker, https://github.com/Clay-foundation/stacchip/issues
Author-email: Daniel Wiesmann <danielwiesmann@developmentseed.org>
License-File: LICENSE
Requires-Python: >=3.10
Requires-Dist: boto3>=1.29.0
Requires-Dist: geoarrow-pyarrow>=0.1.2
Requires-Dist: geopandas>=0.14.1
Requires-Dist: numpy<2.0,>=1.26.0
Requires-Dist: planetary-computer>=1.0.0
Requires-Dist: pyarrow>=14.0.1
Requires-Dist: pystac-client>=0.7.5
Requires-Dist: pystac>=1.9.0
Requires-Dist: rasterio>=1.3.9
Requires-Dist: rio-stac>=0.9.0
Provides-Extra: dev
Requires-Dist: build; extra == 'dev'
Requires-Dist: mock; extra == 'dev'
Requires-Dist: pytest; extra == 'dev'
Requires-Dist: ruff; extra == 'dev'
Requires-Dist: types-python-dateutil; extra == 'dev'
Provides-Extra: docs
Requires-Dist: mkdocs; extra == 'docs'
Requires-Dist: mkdocs-jupyter; extra == 'docs'
Requires-Dist: mkdocs-material; extra == 'docs'
Requires-Dist: nbconvert; extra == 'docs'
Requires-Dist: pdocs; extra == 'docs'
Requires-Dist: pygments; extra == 'docs'
Description-Content-Type: text/markdown

# stacchip

Dynamically create image chips for earth observation machine learning
applications using a custom chip index based on STAC items.

Get a STAC item, index its contents, and create chips dynamically
like so

```python
# Get item from an existing STAC catalog
item = stac.search(...)

# Index all chips that could be derived from the STAC item
index = Indexer(item).create_index()

# Use the index to get RGB array for a specific chip
chip = Chipper(index).chip(x=23, y=42)
```

## Installation

Stacchip is available on pypi

```bash
pip install stacchip
```

## Motivation

Remote sensing imagery is typically distributed in large files (scenes)
that typically have the order of 10 thousand of pixels in both the x and y
directions. This is true for systems like Landsat, Sentinel 1 and 2, and
aerial imagery such as NAIP.

Machine learning models operate on much smaller image sizes. Many use
256x256 pixels, and the largest inputs are in the range of 1000 pixels.

This poses a challenge to modelers, as they have to cut the larger scenes
into pieces before passing them to their models. The smaller image snippets
are typically referred to as "chips". A term we will use throughout this
documentation.

Creating imagery chips tends to be a tedious and slow process, and it is
specific for each model. Models will have different requirements on image
sizes, datatypes, and the spectral bands to include. A set of chips that
works for one model might be useless for the next.

Systemizing how chips are tracked, and making the chip creation more dynamic
is a way to work around these difficulties. This is the goal fo stacchip. It
presents an approach that leverages cloud optimized technology to make chipping
simpler, faster, and less static.

## Overview

Stacchip relies on three cloud oriented technologies. Cloud Optimized Geotiffs
(COG), Spatio Temporal Asset Catalogs (STAC), and GeoParquet. Instead of pre-creating millions of files of a fixed size, chips are indexed first in tables, and then created dynamically from the index files when needed. The imagery data itsel is kept in its original format and referenced in STAC items.

Creating chips with stacchip is composed of two steps:

1. Create a stacchip index from a set of STAC
2. Dynamically create pixel arrays for any chip in the stacchip index

Indexes can be created separately for different imagery sources, and combined
into larger indexes when needed. This makes mixing different imagery sources
simple, and allows for flexibility during the modeling process, as imagery sources
can be added and removed by only updating the combined index.

The mechanism is purposefully kept as generic as possible. The index creation
is done based on a STAC item alone, no other input is needed. Obtaining image
data for a chip that is registered in a stacchip index only requires a few
lines of code.

For more information, please consult the [documentation](https://clay-foundation.github.io/stacchip/)


## Build and release

The following steps to release the latest version

```bash
tag=0.1.34
hatch version $tag
git commit -am "Bump version number"
git push
git tag $tag
git push origin $tag
rm -rf dist
python -m build
python3 -m twine upload --repository testpypi dist/*
python3 -m twine upload --repository pypi dist/*
```
