[SCore-users-jp] [SCore-users] ia32/ia64 cluster
Nick Birkett
nrcb @ streamline-computing.com
2003年 5月 30日 (金) 06:24:31 JST
We need some help on configuring mixed clusters.
Here is our test system:
8 dual Xeon ia32 cluster nodes.
1 quad Itanium2 cluster node.
Score 5.4
Network = gigabit
The Xeon cluster has been running Score MPI jobs for the last 2 months.
The Itanium has kernel 2.4.19-1SCORE patched and compiled for Itanium as in
user documents.
The Itanium has been added, and is running SCore multiprocessor kernel (we have run several OpenMP codes and it works fine).
The front end server is called server and is a Xeon 32 bit.
scout works fine:
server nrcb:$ cat hosts
itanium01
server nrcb:$ scout -F hosts
done.
SCOUT: session started.
server nrcb:$ scout uname -a
[itanium01]:
Linux tiger4.streamline 2.4.19-1SCORE_ia64 #2 SMP Mon May 26 12:36:20 PDT 2003 ia64 unknown
server nrcb:$
Also mpi compilers work fine on Itanium:
[nrcb @ tiger4 mpi]$ mpif77 -compiler intel7 -O3 -w -o jacobi_mpi_64 jacobi_mpi_param.f
program JACOBI
external subroutine OUTPUT
external function UEXACT
external function FEXT
external subroutine INITIALISE
external subroutine ERROR
external subroutine ITERATE
2057 Lines Compiled
[nrcb @ tiger4 mpi]$ ./jacobi_mpi
<0:0> SCORE: One local node ready.
Running with nprocs= 1
Array size nxg,nyg = 4096 4096
Iteration count = 128
cpus= 1 : Iteration = 1 2.24994472335565D+015
cpus= 1 : Iteration = 2 2.24924498762084D+015
cpus= 1 : Iteration = 3 2.24870040652968D+015
cpus= 1 : Iteration = 4 2.24823845276853D+015
cpus= 1 : Iteration = 5 2.24783004116974D+015
cpus= 1 : Iteration = 6 2.24745998334097D+015
cpus= 1 : Iteration = 7 2.24711915332825D+015
So code runs both on Itanium and Xeon (2 binaries).
I have copied 32 bit and 64 bit binaries to:
/opt/score/bin/bin.ia64-redhat-linux2_4/jacobi_mpi.exe
/opt/score/bin/bin.i386-redhat7-linux2_4/jacobi_mpi.exe
on itanium 64 bit and server 32 bit
and set link for .wrapper :
[nrcb @ tiger4 mpi]$ ls -al jacobi_mpi
lrwxrwxrwx 1 nrcb streamc 23 May 29 11:46 jacobi_mpi -> /opt/score/bin/.wrapper
(server and Itanium share a common user filesystem via nfs).
I have an entry for scorehosts.db for Itanium:
server mpi:$ grep IA /opt/score/etc/scorehosts.db
itanium01.streamline HOST_8 network=gigabit,shmem0,shmem1,shmem2,shmem3 group=_scoreall_,IA64,SHMEM smp=4 MSGBSERV
server mpi:$
scoreboard and msgbserv services are restarted.
Running a Xeon application on a Xeon node works fine:
server mpi:$ scout -F hosts -e scrun -nodes=2 ./jacobi_mpi
done.
FEP: Unable to connect with SCore-D (comp07)
FEP:WARNING checkpoint option is ignored in single-user mode.
SCore-D 5.4.0 connected.
<0:0> SCORE: 2 nodes (1x2) ready.
Running with nprocs= 2
Array size nxg,nyg = 4096 4096
Iteration count = 128
Running with nprocs= 2
Trying to run Itanium application from Xeon server gives this:
server mpi:$ scout -F hosts -e scrun -nodes=2 ./jacobi_mpi
done.
FEP: Unable to connect with SCore-D (comp07)
FEP:WARNING checkpoint option is ignored in single-user mode.
<0> SCore-D:WARNING Unable to open a network configuration file (2):
network='gigabit', attribute='-config:file'
<0> SCore-D:ERROR No PM device opened.
Itanium has these modules loaded:
pm_shmem 42976 0 (unused)
pm_ethernet_dev 148296 0 (unused)
pm_memory 19216 0 [pm_shmem pm_ethernet_dev]
Do scoreboard and pm config files need to be running on Itanium as well as Xeon
main server ?
Any help appreciated.
Many thanks,
Nick
_______________________________________________
SCore-users mailing list
SCore-users @ pccluster.org
http://www.pccluster.org/mailman/listinfo/score-users
SCore-users-jp メーリングリストの案内