Question 1 part 3

Har_el_Fishbein · February 22, 2019, 10:50pm

I don’t want to convert the mxnet dataset to an ndarray and then manually filter out the elements not of interest.

First I’m relabeling with a transform function and then providing a custom batchify function. But the design of mxnet is making this painful, as the zipper only allows access to the labels after it has stacked the predictor variables into a big ndarray.

How do I write a batchify function to remove the unwanted data points?

Edit: I’m aware that this won’t work because there batches would differ in size after removing unwanted elements, but can’t think of a reasonable solution, so am giving up.

ryantheisen · February 23, 2019, 2:28am

There are probably several ways to do this, but you can access the data directly after importing with:

train_labels = mnist_train._label
train_data = mnist_train._data

and then from there select only the indices of interest (i.e. where label is 2,5,6 or 7, I believe).

rdutta · February 23, 2019, 5:41am

I used that method to keep only the rows of interest, but how do we feed it into the trainer?

I attempted using mxnet.gluon.gdata.DataLoader(gdata.ArrayDataset(…)) as in homework 4’s train method to get train_iter and test_iter because I had split the data and labels. The result was an incompatible datatype error (expected uint8 but got float32).

I checked that the data and labels are uint8, but I’m still receiving this error.

ashleychien · February 23, 2019, 4:20pm

@rdutta

I ran into the same error and googled it. The error message is backward (it actually expects float32, but the data is uint8). Casting the arrays to float32 fixed it for me.

See here for more info: https://stackoverflow.com/questions/49961351/mxnet-augmentations-expected-uint8-got-float32

Topic		Replies	Views
Integer labels in NDArrayIter	5	1069	October 25, 2017
Is there a way to convert Dataset to NDArray?	3	766	July 3, 2018
Ndarray problem Gluon	1	420	September 19, 2019
How to pass a vector(ndarray) using iterator without including it as input data for the neural network? Discussion	2	326	April 24, 2019
Array newbie Discussion	0	293	February 21, 2022

Question 1 part 3

Related Topics