[Pw_forum] Problem with parallel execution
akohlmey at gmail.com
Sun Feb 5 04:05:08 CET 2012
On Sat, Feb 4, 2012 at 8:37 PM, Elie M <elie.moujaes at hotmail.co.uk> wrote:
> Dear all,
> I am running a parallel execution (pw.x) on a SLURM LINUX interface and once
> I run the command sbatch filename.srm, the calculation starts running and
> then stops with the follwing error:
> "mpiexec_veredas5: cannot connect to local mpd
this is a local system setup issue and has nothing to do
with quantum espresso. i would recommend to talk to your
local system administrator or somebody else that has
experience running in parallel on your local machine.
failing that, you may try to get help on the MPICH
mailing list and/or study the MPICH2 documentation
very carefully and first try running a small (trivial)
MPI test program. once you figured out how to make
that work, it should be straightforward to get Q-E to run.
> (/tmp/mpd2.console_sushil); possible causes:
> 1. no mpd is running on this host
> 2. an mpd is running but was started without a "console" (-n option)
> In case 1, you can start an mpd on this host with:
> mpd &
> and you will be able to run jobs just on this host.
> For more details on starting mpds on a set of hosts, see
> the MPICH2 Installation Guide."
> I saw a previous message posted in 2009 about this error. I followed what
> prof. andrea did: I created a file elie.mpd.hosts and included one line in
> it (localhosts) then ran the coomad mpdboot -f ~/elie.mpd.hosts and run
> sbatch command again but in 3 seconds time, it stops with the same error.
> can anyone help..
> N.B: veredas 5 is the node at which I am executing the command but whatever
> node I try on , I get the same error
> Elie Moujaes
> University of Nottingham
> NG7 2RD
> Pw_forum mailing list
> Pw_forum at pwscf.org
Dr. Axel Kohlmeyer
akohlmey at gmail.com http://goo.gl/1wk0
College of Science and Technology
Temple University, Philadelphia PA, USA.
More information about the users