[QE-users] [SUSPECT ATTACHMENT REMOVED] [QE-GPU] OpenMP is not working with my compilation
Takahiro Chiba
takahiro_chiba at eis.hokudai.ac.jp
Mon Aug 2 01:00:00 CEST 2021
Dear experienced users,
I have trouble in utilizing OpenMP with my compilation. From the
output file, pw.x 6.8 recognizes "OMP_NUM_THREADS=2", but it took same
time as "OMP_NUM_THREADS=1", and according to PBS batch queue, only
100% (not 200%) of CPU is used. Therefore, QE 6.8 with GPU is not as
fast as expected.
I used nvidia HPC SDK 20.9, cuda 10.1, and Intel MKL 2021.2. The node
has two Xeon Gold 6248, one Tesla V100 32GB, and 768GB of RAM.
Benchmark results and make.inc are attached as tarball.
Could you please point out my mistake?
---Sender---
Takahiro Chiba
1st-year student at grad. school of chem. sci. and eng., Hokkaido Univ.
Expected graduation date: Mar. 2023
takahiro_chiba at eis.hokudai.ac.jp
-----
More information about the users
mailing list