RTX 3070 + GPU version


#1

How do I run RTX 3070 with Deep Art Effects GPU version? It seems it does not work, either if I use CUDA 9.0 or CUDA 11.2 following all the instructions.


#2

Which instructions?

https://peardox.com/cuda-cudnn-windows-10/

DAE GPU will only work with Cuda 9.0

If you’ve got 11.2 installed changing your CUDA_PATH environment variable may help (if not already correct)

In Windows Explorer Right Click on ‘This PC’ and select Properties, then click ‘Advanced System Settings’ in the left menu and finally ‘Environment Variables’ in the Systems Properties window. Now check/alter your CUDA path in the System Variables section on the Environment Properties window.


#3

I will try this, thank you!


#4

Unfortunately, it does not work. It infinitely trying to render, but can’t start it.


#5

Try a smaller image

One thing I do when having problems like this is to work out some rough limits to the image size.

  1. TestSize = 1024
  2. Start with an image that’s TestSize x TestSize
  3. Try rendering it with DAE
  4. Did it work?
    Yes) Make TestSize = TestSize + (TestSize / 2), go to (1)
    No) Make TestSize = TestSize - (TestSize / 2), restart DAE, go to (1)

At some point TestSize will be a small enough number to make repeated testing pointless

You can use the same image over and over for testing. Start with a big original and keep on resizing it to TestSize x TestSize and saving that version to test with

The above method is known as a binary chop. You should get to a sensible number in under 10 tries (actually, you’ll most likely get a number a lot faster than that, 10 is the number of times to get it down to 1 pixel accuracy)

Note - the choice of 1024 to start is very deliberate - I already know 1540x1540 will fail if 1024x1024 passes.


#6

I tried, I believe the reason is that RTX 3070 is not supported by CUDA 9.0 (at least when it was installing it was said that there is no supported GPU found). I tried 400x400px tests, does not work


#7

Oh well, I suppose DAE GPU had to eventually run into this issue.

To be fair it’s equally NVIDIA’s fault for not making CUDA backwards compatible

I guess you’ll just have to take up bit-coin mining on your 3070 and buy a 1080 with the profits :slight_smile:


#8

I just got 10.0 working


#9

I’ve now got a RTX 3060 equipped laptop

CUDA is not supported even with 10.0 installed - I wasn’t sure what would happen, the answer is, at present, absolutely nothing (won’t render - just sulks

The logs look like this…

2021-10-09 00:48:58.219019: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cudart64_100.dll
2021-10-09 00:48:58.342943: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library nvcuda.dll
2021-10-09 00:48:59.561645: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1639] Found device 0 with properties:
name: NVIDIA GeForce RTX 3060 Laptop GPU major: 8 minor: 6 memoryClockRate(GHz): 1.702
pciBusID: 0000:01:00.0
2021-10-09 00:48:59.561715: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cudart64_100.dll
2021-10-09 00:48:59.563926: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cublas64_100.dll
2021-10-09 00:48:59.566039: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cufft64_100.dll
2021-10-09 00:48:59.567147: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library curand64_100.dll
2021-10-09 00:48:59.570950: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cusolver64_100.dll
2021-10-09 00:48:59.573412: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cusparse64_100.dll
2021-10-09 00:48:59.580361: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cudnn64_7.dll
2021-10-09 00:48:59.580453: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1767] Adding visible gpu devices: 0

At this point DAE stops doing anything…

The gpu device 0 actually means it found one (numbering starts at zero, itd say -1 if nothing found AFAIK)

So, a GPU was discovered but TF didn’t know what to do with it as it’s a 3060 (Cuda 10.0 only supports up to 2xxx series). CUDA 11.x appears to be required for a 3xxx card.

There is a huge potential problem here in that Tensorflow 1.x (used by DAE) only goes up to CUDA 10 (and lower like the recommended 9 version). For newer cards they changed Tensorflow to version 2.x and that is not backwardly compatible witrh TF1.

This implies that DAE will need to update to TF2 or become obsolete as 3xxx cards become the norm.

It should also be noted that CUDA 9.0 only supports 1xxx cards so without CUDA 10.0 version of Tensorflow it seems possible that 2xxx cards may not function out of the box (I have CUDA 10.0 versions of TF but no 2xxx card to test on)

More investigation will be required as I’ve not had the new laptop long enough to look into the situation in depth.

I’m going to compile up a CUDA 11.2 version of TF2 just to see what happens when I have some free time (still installing everything ATM)