MozzaVID: Mozzarella Volumetric Image Dataset

Abstract

Influenced by the complexity of volumetric imaging, there is a shortage of established datasets useful for benchmarking volumetric deep-learning models. As a consequence, new and existing models are not easily comparable, limiting the development of architectures optimized specifically for volumetric data. To counteract this trend, we introduce MozzaVID -- a large, clean, and versatile volumetric classification dataset. Our dataset contains X-ray computed tomography (CT) images of mozzarella microstructure and enables the classification of 25 cheese types and 149 cheese samples. We provide data in three different resolutions, resulting in three dataset instances containing from 591 to 37,824 images. While targeted for developing general purpose volumetric algorithms, the dataset also facilitates investigating the properties of mozzarella microstructure. The complex and disordered nature of food structures brings a unique challenge, where a choice of appropriate imaging method, scale, and sample size is not trivial. With this dataset, we aim to address these complexities, contributing to more robust structural analysis models and a deeper understanding of food structure.

Structural analysis

Apart from providing a general benchmark, the dataset is specifically targeted towards developing and evaluating methods for deep learning-based structural analysis. The structure of materials directly influences their functional properties, and structural analysis is an important use case for volumetric images. At the same time, CT imaging has become a common choice for food analysis, where many products, such as meat, bread, pastries, and cheese (e.g. mozzarella) are heavily defined by their structural properties. With 34% of greenhouse gas emissions linked to food, understanding its structural properties is crucial for developing environmentally friendly alternatives to known structured foods that are also pleasant to eat.

BibTeX

@misc{pieta2024b, title={MozzaVID: Mozzarella Volumetric Image Dataset}, author={Pawel Tomasz Pieta and Peter Winkel Rasmussen and Anders Bjorholm Dahl and Jeppe Revall Frisvad and Siavash Arjomand Bigdeli and Carsten Gundlach and Anders Nymark Christensen}, year={2024}, howpublished={arXiv:2412.04880 [cs.CV]}, eprint={2412.04880}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2412.04880}, }

MozzaVID: Mozzarella Volumetric Image Dataset

Mozzarella microstructure in example coarse and fine-grained classes. MozzaVID is a benchmark volumetric (3D) classification dataset containing mozzarella CT images at three dataset splits (sizes) and two classification targets.

Abstract

We use the 590 high-resolution synchrotron scans of the mozzarella structure to create three dataset instances. The raw volume is downscaled and split, ensuring that in each case the final volumes have the shape of 192 cubed. The three instances contain 591, 4.728, and 37.824 volumes, respectively.

Structural analysis

We use the experimental design of the mozzarella types to express the similarity between classes. The parameter space is compressed with PCA and colored to enable further analysis.

BibTeX