-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question]Threading error after last train #284
Comments
Dear @GarryJAY502 , batchgenerator is used for augmentation and data loading in nnDetection and thus is essential for proper functioanlity. As the message already indicates: "RuntimeError: One or more background workers are no longer alive. Exiting. Please check the print statements above for the actual error message" The passage you posted does not contain the actual error, please provide the full error message. Best, |
The error message only contains the part shown in the figure, without specific content This happened after the last epoch of training,What tasks will nnDetection perform after this? |
Dear @GarryJAY502 , that is indeed curious and may be a problem within batchgenerators which might not shut down the workers correctly in combination with pytorch lightning. nnDetection does not use batchgenerators after the training anymore. After training the empirical parameters need to be determined and whole patient inference is performed to give the final validation results. Best, |
thanks,Michael |
If the training runs through completely, you can continue. The screenshot you posted only shows epoch 1, which is definitely not sufficient; the full schedule contains 60 epochs. |
|
❓ Question
Hi
I have a question ,In the last stage of training, there was an error when using batchgenerators. I noticed that someone had mentioned this issue before. Is there a solution now.
Exception in thread Thread-3:
Traceback (most recent call last):
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/threading.py", line 980, in _bootstrap_inner
self.run()
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/threading.py", line 917, in run
self._target(*self._args, **self._kwargs)
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/site-packages/batchgenerators/dataloading/multi_threaded_augmenter.py", line 92, in results_loop
raise RuntimeError("One or more background workers are no longer alive. Exiting. Please check the print"
RuntimeError: One or more background workers are no longer alive. Exiting. Please check the print statements above for the actual error message
Exception in thread Thread-4:
Traceback (most recent call last):
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/threading.py", line 980, in _bootstrap_inner
self.run()
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/threading.py", line 917, in run
self._target(*self._args, **self._kwargs)
File "/home/liuyvjie/opt/miniforge3/envs/nndet_venv/lib/python3.9/site-packages/batchgenerators/dataloading/multi_threaded_augmenter.py", line 92, in results_loop
raise RuntimeError("One or more background workers are no longer alive. Exiting. Please check the print"
RuntimeError: One or more background workers are no longer alive. Exiting. Please check the print statements above for the actual error message
What is the purpose of this package batchgenerators
, and will this error affect my training process and result output saving
The text was updated successfully, but these errors were encountered: