Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Contents
For example to trigger installation of GPU version of tensorflow and opencv, use the following pip command:
Petastorm supports extensible data codecs. These enable a user to use one of the s…