[QE-users] [QE 7.0/GPU] Run built with CMake fails with cuFFT error

Robert MIJAKOVIC robert.mijakovic at lxp.lu
Mon May 23 17:23:20 CEST 2022


# Summary
QE 7.0/GPU compiled with CMake fails on our system in "routine fft_scalar_cuFFT: cft_1z_gpu (8)".

# Version
qe-7.0-ReleasePack.tgz

# Environment
## Hardware
1. 2xAMD EPYC 7452
2. 4xNVIDIA A100
3. 512 GB RAM
## Software
1. OS: Rocky Linux release 8.5 (Green Obsidian)
2. NVHPC 22.3
3. OpenMPI 4.1.3 built with NVHPC 22.3
4. CUDA 11.3.1 with Driver 470.82.01
5. libxc 5.1.5
6. CMake 3.20.1
7. M4 1.4.19

# Steps to reproduce
## Configured with:
`-DQE_ENABLE_CUDA=1 -DQE_FFTW_VENDOR=Internal -DQE_ENABLE_LIBXC=1 -DQE_ENABLE_OPENMP=1 `
## Prebuild options
`cp $EBROOTLIBXC/include/*.mod Modules/mod/qe_modules && export FPP='nvfortran -Mpreprocess -E' && export CPP='cpp -E' && export FCPP='cpp -E' && `
## make options
`make all epw`
## Execute
srun

## Input files
QEF AUSURF112 benchmark

# Observed behaviour
When example is started it fails with:
```
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Error in routine fft_scalar_cuFFT: cft_1z_gpu (8):
cufftPlanMany failed
```

# Questions
1. What do I do wrong?
2. Why is there no option to set FFTW_VENDOR to cuFFT?
3. Why it got linked against cuFFT if FFTW_VENDOR is set to Internal?

Dr. rer. nat. Robert Mijaković | HPC System Software Architect

LuxProvide
3, Op der Poukewiss | L-7795 Bissen
Grand-Duchy of Luxembourg
M (+352) 691 396 474
robert.mijakovic at lxp.lu<mailto:robert.mijakovic at lxp.lu> | www.luxprovide.lu<http://www.luxprovide.lu/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.quantum-espresso.org/pipermail/users/attachments/20220523/6fa38adc/attachment.html>


More information about the users mailing list