PEST: Preprocessing Engine for Spherinator Training
PEST preprocess simulation data and generate training data for Spherinator & HiPSter, including arbitrary single- and multi-channel images, 3D PPP and PPV cubes, and point clouds.
Installation
PEST can be installed via pip:
pip install astro-pest
Converters
PEST provides converters to transform data from various formats into the Apache Parquet format, which is used internally by Spherinator and HiPSter. The converters can handle different types of data, such as CSV files, FITS images, and more.
Example 1: Convert Gaia CSV to Parquet
from pest import GaiaConverter
gaia_converter = GaiaConverter(
with_flux_error=True,
number_of_workers=1,
)
gaia_converter.convert_all("data/gaia/csv", "data/gaia/parquet")
Example 2: Convert IllustrisTNG SKIRT fits images to Parquet
from pest import FitsConverter
FitsConverter(image_size=128).convert_all(
"data/illustris/fits/TNG100/sdss/snapnum_099/data", "data/illustris/parquet"
)
Coming soon: Generators
PEST provides generators to create training data for Spherinator and HiPSter from cosmological simulations.