[SCore-users-jp] [SCore-users] (no subject)

master.of.brainless.things @ gmx.net master.of.brainless.things @ gmx.net
2002年 6月 26日 (水) 23:27:55 JST


sorry, for coming so late with this return mail:

We don't know if we got the way you want to do what, but we think like this:

at first copy officialname to all compute hosts:

>[root @ frontend bin]# rsh-all -g pcc -norsh rdist -c
/opt/score/bin/officialname /opt/score/bin/bin.*/officialname @host:
>comp4.cluster.domain
>comp5.cluster.domain
>comp6.cluster.domain
>comp7.cluster.domain
>comp4.cluster.domain: comp4.cluster.domain: updating host
comp4.cluster.domain
>comp5.cluster.domain: comp5.cluster.domain: updating host
comp5.cluster.domain
>comp6.cluster.domain: comp6.cluster.domain: updating host
comp6.cluster.domain
>comp7.cluster.domain: comp7.cluster.domain: updating host
comp7.cluster.domain
>comp4.cluster.domain: comp4.cluster.domain: LOCAL ERROR:
/opt/score/bin/bin.*/officialname: lstat failed: No such file or directory
>comp4.cluster.domain: comp4.cluster.domain: updating of
comp4.cluster.domain finished
>comp5.cluster.domain: comp5.cluster.domain: LOCAL ERROR:
/opt/score/bin/bin.*/officialname: lstat failed: No such file or directory
>comp5.cluster.domain: comp5.cluster.domain: updating of
comp5.cluster.domain finished
>comp6.cluster.domain: comp6.cluster.domain: LOCAL ERROR:
/opt/score/bin/bin.*/officialname: lstat failed: No such file or directory
>comp6.cluster.domain: comp6.cluster.domain: updating of
comp6.cluster.domain finished
>comp7.cluster.domain: comp7.cluster.domain: LOCAL ERROR:
/opt/score/bin/bin.*/officialname: lstat failed: No such file or directory
>comp7.cluster.domain: comp7.cluster.domain: updating of
comp7.cluster.domain finished

some errors, and this isn't really good, isn't it?!

>[root @ frontend bin]# scout -g pcc
>SCOUT: Spawning done.
>SCOUT: session started.
>[root @ frontend bin]# scout officialname `scorehosts -g pcc`
>4 hosts found.
>[comp4-7]:
>bash: officialname: command not found

now, we realize, that officialname is a symbolic link to .wrapper. on the
nodes officialname in this state was a
broken link to .wrapper.
so we run

>[root @ frontend bin]# rsh-all -g pcc -norsh rdist -c /opt/score/bin/.wrapper
/opt/score/bin/bin.*/.wrapper @host:

there still is missing something, so we copy the whole bin directory to the
hosts:
[root @ frontend bin]# rsh-all -g pcc -norsh rdist -c /opt/score/bin/* @host:

now running:
[root @ frontend bin]# scout officialname `scorehosts -g pcc`
4 hosts found.
[comp4]:
comp4.cluster.domain
client05.cluster.domain
client06.cluster.domain
client07.cluster.domain
[comp5]:
client04.cluster.domain
comp5.cluster.domain
client06.cluster.domain
client07.cluster.domain
[comp6]:
client04.cluster.domain
client05.cluster.domain
comp6.cluster.domain
client07.cluster.domain
[comp7]:
client04.cluster.domain
client05.cluster.domain
client06.cluster.domain
comp7.cluster.domain

now we see, what's the problem (which is a little bit stupid).
it seems that yp commands are wrong interpreted, so that the old /etc/hosts
is used to determine the hostnames.

so we updated the yp database, and modified hosts, and now:

[root @ frontend etc]# scout officialname `scorehosts -g pcc`
4 hosts found.
[comp4-7]:
comp4.cluster.domain
comp5.cluster.domain
comp6.cluster.domain
comp7.cluster.domain
[root @ frontend etc]# scstest -network myrinet2k
SCSTEST: BURST on myrinet2k(chan=0,ctx=0,len=16)
50 K packets.
100 K packets.
150 K packets.
200 K packets.
250 K packets.
300 K packets.
350 K packets.
400 K packets.
450 K packets.
500 K packets.
.
.
.

and the mpi demos work, too.

Really  big, big thanks to everyone (especially Mr. Kameyama, Mr. Hori and
Mr. Sumimoto) for helping and supporting us.
And we hope, that this group will dure a long time to help other
score-"newbie's" like us.

Peer Ueberholz & Alex Golks

_______________________________________________
SCore-users mailing list
SCore-users @ pccluster.org
http://www.pccluster.org/mailman/listinfo/score-users



SCore-users-jp メーリングリストの案内