0

We currently have a trained ResnetV250 image classification model that performs as expected on CPU and with GPU support on Turing based cards with CUDA 10.1 and cudnn 7.6.4.

When transferring this to machines with Ampere based cards, we get similar expected results for CPU runs but when enabling GPU support we get fixed value results no matter what inputs are applied.

The drivers on all cards are up to date and relevant CUDA and cudnn versions are installed. On first start up with the Ampere cards we see the Compute cache being constructed and used on subsequent classification runs.

Could there be an issue with nvcc in constructing the PTX?

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.