"UserWarning: An input could not be retrieved. It could be because a worker has died. We do not have any information on the lost sample."

Question

While training model I got this warning "UserWarning: An input could not be retrieved. It could be because a worker has died.We do not have any information on the lost sample.)", after showing this warning, model starts training. What does this warning means? Is it something that will affect my training and I need to worry about?

Lakshmi - Intel · Accepted Answer · 2019-10-24 18:01:28Z

8

This is just a user warning that will be usually thrown when you try to fetch the inputs,targets during training. This is because a timeout is set for the queuing mechanism which will be specified inside the data_utils.py.

For more details you can refer the data_utils.py file which will be inside the keras/utils folder.

https://github.com/keras-team/keras/blob/master/keras/utils/data_utils.py

answered Oct 24, 2019 at 18:01

Lakshmi - Intel

5913 silver badges10 bronze badges

So, Could you explain more about it. The same error I occured.
– Benchur Wong
May 23, 2020 at 3:58

Add a comment |

mjkvaak · Accepted Answer · 2019-11-27 10:14:30Z

3

I got the same warning when training a model in Google Colab. The problem was that I tried to fetch the data from my Google Drive that I had mounted to the Colab session. The solution was to move the data into Colab's working directory and use it from there. This can be done simply via !cp -r path/to/google_drive_data_dir/ path/to/colab_data_dir in the notebook. Note that you will have to do this each time when a new Colab session is created.

This may or may not be the problem that Rahul was asking, but I think this might be helpful to others who face the issue.

answered Nov 27, 2019 at 10:14

mjkvaak

4796 silver badges6 bronze badges

2

I am using my Google Drive as a storage. Where else would I put this? Colab uses Google Drive as a hard disk right?
– Anshuman Kumar
Apr 22, 2020 at 10:00
Could you explain more about 'path/to/colab_data_dir'
– Benchur Wong
May 23, 2020 at 3:50
Sorry, I thought that I had answered the first question already. AFAIK opening a Google Colab session spins up an virtual machine to which you can mount your Google Drive. However, the mount is not a physical one (fast) but the files need to be transferred over internet (slow). It's this file transfer that will cause a bottleneck. To avoid this, it's best copy the files from Drive physically to Colab session's drive (any folder you prefer) after which you can use them faster.
– mjkvaak
May 25, 2020 at 13:17

Add a comment |

Benchur Wong · Accepted Answer · 2020-05-29 03:11:47Z

3

If you are running the training in GPU, the Warning will occur. You have to know that there are two running progress during the fit_generator running.

GPU, trains the IMAGE DATASETS with each steps in each epoch.
CPU, prepares the IMAGE DATASETS with each batch size.

While, they are parallel tasks. So if CPU's compute is lower than GPUs', the Warning occurs.

Solution:

Just set your batch_size smaller or upgrade your CPU config.

answered May 29, 2020 at 3:11

Benchur Wong

2,5452 gold badges10 silver badges13 bronze badges

Add a comment |

joshna rani pothuganti · Accepted Answer · 2019-11-30 03:32:13Z

0

make sure the path of data set you have given is correct only..this definitely helps example:train_data_dir="/content/drive/My Drive/Colab Notebooks/dataset"

edited Nov 30, 2019 at 3:32

answered Nov 29, 2019 at 9:26

joshna rani pothuganti

115 bronze badges

Add a comment |

Harshad · Accepted Answer · 2020-01-12 13:56:13Z

I faced the same issue while training a deep neural network on my machine using keras, and it took me a while to figure it out. The images I was loading using the

ImageDataGenerator(target_size = (256, 256))

from

keras.preprocessing

were of a lower resolution, say 100*100 and I was trying to convert them into 256*256, and apparently there is no inbuilt support provided for this.

As soon as I fixed the output shape of the image returned by the ImageDataGenerator, the warning vanished.

//Note: the figures 100*100 and 255*255 are just for explanation.

gongshu huan · Accepted Answer · 2020-03-23 02:31:54Z

0

You can reduce the number of workers and max_queue_size to solve problems.

answered Mar 23, 2020 at 2:31

gongshu huan

511 silver badge2 bronze badges

2

May we know why reducing the number of workers and max_queue_size will solve the problem?
– Fernand
Mar 23, 2020 at 4:18

Add a comment |

nim.py · Accepted Answer · 2020-04-08 09:17:33Z

0

I got this warning when I was training on the amount of data samples that was smaller than the batch size.

(The training would actually seem to have started, but then get stuck before even showing the progress bar for the first epoch.)

answered Apr 8, 2020 at 9:17

nim.py

4771 gold badge7 silver badges20 bronze badges

Add a comment |

调情生活 · Accepted Answer · 2021-08-14 18:39:49Z

0

I faced the same issue, when I change my Keras version from 2.3.1 to 2.2.4, the warning is disappeared, And my CUDA and cuDNN can also work normally.

If still not resolved, please additional references：https://github.com/keras-team/keras/issues/13878

Now my System information:

OS : Win10

TensorFlow version: 1.15.0

Keras version: 2.2.4

Python version: 3.6

CUDA version: 10.0.130

cuDNN version: 7.6.5

edited Aug 14, 2021 at 18:39

answered Aug 14, 2021 at 18:34

调情生活

11 bronze badge

Add a comment |

Collectives™ on Stack Overflow

"UserWarning: An input could not be retrieved. It could be because a worker has died. We do not have any information on the lost sample."

8 Answers 8

Solution:

Your Answer

Not the answer you're looking for? Browse other questions tagged
machine-learning
deep-learning
tf.keras
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

Solution:

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged machine-learningdeep-learningtf.keras or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
machine-learning
deep-learning
tf.keras
or ask your own question.