#The dataloader, train_dataloader, does not have many workers which may be a bottleneck

3 messages · Page 1 of 1 (latest)

heavy niche
#

I'm seeing this message when I run TI training. The full message:
The dataloader, train_dataloader, does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` (try 10 which is the number of cpus on this machine) in the `DataLoader` init to improve performance.

I'm on an M1 Max MacBook Pro. I'm not sure how to make the change it's recommending. Would this actually improve training performance on my machine?

I'm also curious if it's possible to parallelize GPU work (if that's not happening already). When I look in Activity Monitor, I can tell if multiple CPU cores are being used if a process is using >100% CPU. But I never see anything exceed 100% GPU, even though the GPU in this machine has 24 cores. This is probably just some fundamental thing I'm not understanding, but it makes me wonder if my hardware is underutilized by InvokeAI.

icy summit
#

This link will walk you through what to change if you want to go that direction