posted on 2023-11-30, 08:48authored byDavid TowersDavid Towers, Rob Geada, Amir Atapour-Abarghouei, Andrew Stephen McGough
<p dir="ltr">Dataset containing the images and labels for the AddNIST data used in the CVPR NAS workshop Unseen-data challenge under the codename "Adaline"</p><p dir="ltr">The AddNIST dataset is a constructed dataset from MNIST Images. The intention of this dataset is to require machine learning models to do more than just image classification but also perform a calculation, in this case addition. For each image, three MNIST Images were randomly chosen and combined together through the colour channels, resulting in a three colour-channel image so each MNIST image represents one colour channel. </p><p dir="ltr">The data is in a channels-first format with a shape of (n, 3, 28, 28) where n is the number of samples in the corresponding set (45,000 for training, 15,000 for validation, and 10,000 for testing).</p><p dir="ltr">There are twenty classes in the dataset, with 3,500 examples of each, distributed evenly between the three subsets.</p><p dir="ltr">The label of each image is generated using the formula "(r + b + g) - 1" where r, g, and b are the red, green, and blue colour channels respectively. An example of an AddNIST Image would be a rgb configuation of 3, 7, and 4 respectively, which would result in a label of 13 ((3 + 7 + 4) - 1).</p>