[SCore-users-jp] kernel 不具合?

vqm_mp vqm_mp @ yahoo.co.jp
2006年 9月 12日 (火) 16:41:43 JST


お世話様です.明大の鈴木です.

2重起動,実行にならないように注意しながら,PM_DEBUGを
1にして,
  scrun -nodes=4,scoredtrace=100 ./a.out
を行いました.何も反応がない(プロンプトが返ってこず,
scrun.exeが動いたままの状態になっている)か,あるいは
以下の出力になります.

通常の実行
  scrun -nodes=4 ./a.out
においては,正常に動くか,何も反応がないか,あるいは,
  [root @ server test]# scrun -nodes=4 ./a.out
  SCore-D 5.8.3 connected.
  <0> SCORE: Program signaled (SIGSEGV).
となります.

たびたび申し訳ございませんが,もう一度,診断をお願い
します.

<1> SCore-D:DEBUG fd_max(NULL) = 199
<0> SCore-D:DEBUG fd_max(NULL) = 199
<3> SCore-D:DEBUG fd_max(NULL) = 199
<2> SCore-D:DEBUG fd_max(NULL) = 199
<3> SCore-D:TRACE(../fep.cc:458)
<3> SCore-D:TRACE(../fep.cc:468)
<3> SCore-D:DEBUG control=(null)
SCore-D 5.8.3 connected.
<3> SCore-D:DEBUG >> user_control
<0> SCore-D:DEBUG >> createSubjob(JID=1,subjobID=0)
<3> SCore-D:DEBUG
isnot_ready_to_run(jid=1,wchan=0,gchan=0,temp=0,death=0)
<3> SCore-D:DEBUG
reset_wchan(jid=1,wchan=0,gchan=0,temp=0,death=0)
<3> SCore-D:DEBUG run_fep(jid=1,status=1,kill=0)
<0> SCore-D:DEBUG << createSubjob(JID=1,subjobID=0)
<0> SCORE-D:DEBUG set_process_group_id(13381,13381)
<0> SCore-D:DEBUG set_process_group_id(13381,13381)
<3> SCore-D:DEBUG fep_stopped(key=406038567,jid=1,uid=0)
<1> SCORE-D:DEBUG set_process_group_id(13373,13373)
<1> SCore-D:DEBUG set_process_group_id(13373,13373)
<3> SCore-D:DEBUG
isnot_ready_to_run(jid=1,wchan=0,gchan=0,temp=0,death=0)
<3> SCore-D:DEBUG
reset_wchan(jid=1,wchan=0,gchan=0,temp=0,death=0)
<3> SCore-D:DEBUG run_fep(jid=1,status=3,kill=0)
<2> SCORE-D:DEBUG set_process_group_id(13354,13354)
<2> SCore-D:DEBUG set_process_group_id(13354,13354)
<3> SCORE-D:DEBUG set_process_group_id(13391,13391)
<3> SCore-D:DEBUG set_process_group_id(13391,13391)
<3> SCore-D:DEBUG TSS timer STARTS (jid=1)
<3> SCore-D:DEBUG wakeup_job(jid=1,ident=1)
<3> SCore-D:DEBUG TSS timer EXPIRES (jid=1)
<3> SCore-D:DEBUG fep_stopped(key=406038567,jid=1,uid=0)
<3> SCore-D:DEBUG
isnot_ready_to_run(jid=1,wchan=0,gchan=1,temp=0,death=0)
<3> SCore-D:DEBUG
reset_wchan(jid=1,wchan=0,gchan=1,temp=0,death=0)
<3> SCore-D:DEBUG run_fep(jid=1,status=3,kill=0)
<3> SCore-D:DEBUG
check_checkpoint(ckpt_on=0,checkpointing=0,debug=0,cpu_time=10.151[S],next=0.0[m])
<3> SCore-D:DEBUG TSS timer STARTS (jid=1)
<0> SCORE-D:DEBUG putenv
LD_LIBRARY_PATH=/opt/score/deploy/lib.i386-fedoracore3-linux2_6
<1> SCORE-D:DEBUG putenv
LD_LIBRARY_PATH=/opt/score/deploy/lib.i386-fedoracore3-linux2_6
<3> SCore-D:DEBUG wakeup_job(jid=1,ident=1)
<2> SCORE-D:DEBUG putenv
LD_LIBRARY_PATH=/opt/score/deploy/lib.i386-fedoracore3-linux2_6
<3> SCORE-D:DEBUG putenv
LD_LIBRARY_PATH=/opt/score/deploy/lib.i386-fedoracore3-linux2_6
<1> SCORE-D:DEBUG <0> SCORE-D:DEBUG <3> SCORE-D:DEBUG
umask=022
<3> SCORE-D:DEBUG
exec(/var/scored/singleuser/0/jobs/jid-1/a.out.1=./a.out,(null))
<1> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<1> SCore-D:TRACE(../idle.cc:679) fd_socket
<1> SCore-D:DEBUG IDLE:FD_SYSCALL:-1=>203
<1> SCore-D:DEBUG IDLE:FD_SCWAIT:-1=>203
<1> SCore-D:TRACE(../idle.cc:679) fd_socket
<1> SCore-D:DEBUG IDLE:FD_SAVE:200=>205
<1> SCore-D:DEBUG IDLE:FD_RSTR:201=>205
<1> SCore-D:TRACE(../idle.cc:679) fd_socket
<1> SCore-D:TRACE(../idle.cc:742) IDLE:FD_NETWORK
<2> SCORE-D:DEBUG umask=022
<1> SCORE-D:DEBUG
exec(/var/scored/singleuser/0/jobs/jid-1/a.out.1=./a.out,(null))
umask=022
<0> SCORE-D:DEBUG
exec(/var/scored/singleuser/0/jobs/jid-1/a.out.1=./a.out,(null))
umask=022
<2> SCORE-D:DEBUG
exec(/var/scored/singleuser/0/jobs/jid-1/a.out.1=./a.out,(null))
<1> SCore-D:DEBUG score_send_fd()
<1> SCore-D:DEBUG score_send_fd()
<2> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<0> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<0> SCore-D:TRACE(../idle.cc:679) fd_socket
<0> SCore-D:DEBUG IDLE:FD_SYSCALL:-1=>204
<0> SCore-D:DEBUG IDLE:FD_SCWAIT:-1=>204
<0> SCore-D:TRACE(../idle.cc:679) fd_socket
<0> SCore-D:DEBUG IDLE:FD_SAVE:201=>206
<0> SCore-D:DEBUG IDLE:FD_RSTR:202=>206
<0> SCore-D:TRACE(../idle.cc:679) fd_socket
<0> SCore-D:TRACE(../idle.cc:742) IDLE:FD_NETWORK
<0> SCore-D:DEBUG score_send_fd()
<0> SCore-D:DEBUG score_send_fd()
<2> SCore-D:TRACE(../idle.cc:679) fd_socket
<2> SCore-D:DEBUG IDLE:FD_SYSCALL:-1=>203
<2> SCore-D:DEBUG IDLE:FD_SCWAIT:-1=>203
<2> SCore-D:TRACE(../idle.cc:679) fd_socket
<2> SCore-D:DEBUG IDLE:FD_SAVE:200=>205
<2> SCore-D:DEBUG IDLE:FD_RSTR:201=>205
<2> SCore-D:TRACE(../idle.cc:679) fd_socket
<2> SCore-D:TRACE(../idle.cc:742) IDLE:FD_NETWORK
<2> SCore-D:DEBUG score_send_fd()
<2> SCore-D:DEBUG score_send_fd()
<3> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<3> SCore-D:TRACE(../idle.cc:679) fd_socket
<3> SCore-D:DEBUG IDLE:FD_SYSCALL:-1=>205
<3> SCore-D:DEBUG IDLE:FD_SCWAIT:-1=>205
<3> SCore-D:TRACE(../idle.cc:679) fd_socket
<3> SCore-D:DEBUG IDLE:FD_SAVE:202=>207
<3> SCore-D:DEBUG IDLE:FD_RSTR:203=>207
<3> SCore-D:TRACE(../idle.cc:679) fd_socket
<3> SCore-D:TRACE(../idle.cc:742) IDLE:FD_NETWORK
<3> SCore-D:DEBUG score_send_fd()
<3> SCore-D:DEBUG score_send_fd()
<0:0> SCORE: 4 nodes (4x1) ready.
hello, world (from node 1)
<1> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<0> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
hello, world (from node 3)
<3> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
<2> SCore-D:TRACE(../idle.cc:628) fd_syscall is closed
hello, world (from node 0)
<3> SCore-D:DEBUG fep_stopped(key=406038567,jid=1,uid=0)
hello, world (from node 2)
<3> SCore-D:DEBUG TSS timer EXPIRES (jid=1)
<3> SCore-D:DEBUG exit_status()=0
<3> SCore-D:DEBUG <<<<<<<<<<< TERMINATED (jid=1)
>>>>>>>>>>>
<3> SCore-D:DEBUG
remove_job_file(/var/scored/singleuser/0/job-descs/jid-1)
<3> SCore-D:DEBUG >> free_fep(jid=1,node=0,exit=0x0)
<3> SCore-D:TRACE(../fep.cc:838)  free_fep()
<3> SCore-D:TRACE(../fep.cc:841)  free_fep()
<0> SCore-D:TRACE(../fepio.cc:443) >> flush_fepio()
<0> SCore-D:DEBUG    flush_fepio(status=3)
<0> SCore-D:TRACE(../fepio.cc:466) << flush_fepio()
<0> SCore-D:TRACE(../subjob.cc:221) >> free_subjob()
<0> SCore-D:DEBUG free_pegroup(flag_dontclear=0)
<0> SCore-D:DEBUG killpg(13381,9)=3
<0> SCore-D:DEBUG >> free_pe(scio=0)
<1> SCore-D:DEBUG free_pegroup(flag_dontclear=0)
<0> SCore-D:TRACE(../pe.cc:487) >> flush_pe()
<0> SCore-D:TRACE(../pe.cc:512)    flush_pe()
<0> SCore-D:TRACE(../pe.cc:516) << flush_pe()
<3> SCore-D:TRACE(../fepio.cc:443) >> flush_fepio()
<1> SCore-D:DEBUG killpg(13373,9)=3
<1> SCore-D:DEBUG >> free_pe(scio=0)
<1> SCore-D:TRACE(../pe.cc:487) >> flush_pe()
<0> SCore-D:TRACE(../pe.cc:531)    free_pe
<0> SCore-D:TRACE(../pe.cc:535)    free_pe
<0> SCore-D:DEBUG >> close_attach_fds(netset_num=1)
<1> SCore-D:TRACE(../pe.cc:512)    flush_pe()
<0> SCore-D:DEBUG    close_attach_fds(dev=1,np=1)
<0> SCore-D:DEBUG   
close_attach_fds(dev=0,np=0,cntxt=0x81e5ed8)
<0> SCore-D:DEBUG << close_attach_fds()
<0> SCore-D:TRACE(../pe.cc:539)    free_pe
<0> SCore-D:TRACE(../pe.cc:543)    free_pe
<0> SCore-D:TRACE(../pe.cc:547)    free_pe
<0> SCore-D:TRACE(../pe.cc:551)    free_pe
<0> SCore-D:TRACE(../pe.cc:574)    free_pe
<0> SCore-D:TRACE(../pe.cc:578) << free_pe
<2> SCore-D:DEBUG free_pegroup(flag_dontclear=0)
<2> SCore-D:DEBUG killpg(13354,9)=3
<2> SCore-D:DEBUG >> free_pe(scio=0)
<2> SCore-D:TRACE(../pe.cc:487) >> flush_pe()
<2> SCore-D:TRACE(../pe.cc:512)    flush_pe()
<3> SCore-D:DEBUG    flush_fepio(status=3)
<3> SCore-D:TRACE(../fepio.cc:466) << flush_fepio()
<3> SCore-D:TRACE(../fep.cc:843)  free_fep()
<3> SCore-D:TRACE(../fep.cc:845)  free_fep()
<3> SCore-D:TRACE(../fepio.cc:443) >> flush_fepio()
<3> SCore-D:DEBUG    flush_fepio(status=3)
<3> SCore-D:TRACE(../fepio.cc:466) << flush_fepio()
<3> SCore-D:TRACE(../fep.cc:847)  free_fep()
<3> SCore-D:TRACE(../fep.cc:849)  free_fep()
<3> SCore-D:TRACE(../fep.cc:851)  free_fep()
<3> SCore-D:DEBUG free_pegroup(flag_dontclear=0)
<1> SCore-D:TRACE(../pe.cc:516) << flush_pe()
<1> SCore-D:TRACE(../pe.cc:531)    free_pe
<1> SCore-D:TRACE(../pe.cc:535)    free_pe
<1> SCore-D:DEBUG >> close_attach_fds(netset_num=1)
<1> SCore-D:DEBUG    close_attach_fds(dev=1,np=1)
<1> SCore-D:DEBUG   
close_attach_fds(dev=0,np=0,cntxt=0x81e5e80)
<1> SCore-D:DEBUG << close_attach_fds()
<3> SCore-D:DEBUG killpg(13391,9)=3
<3> SCore-D:DEBUG >> free_pe(scio=0)
<3> SCore-D:TRACE(../pe.cc:487) >> flush_pe()
<3> SCore-D:TRACE(../pe.cc:512)    flush_pe()
<3> SCore-D:TRACE(../pe.cc:516) << flush_pe()
<3> SCore-D:TRACE(../pe.cc:531)    free_pe
<3> SCore-D:TRACE(../pe.cc:535)    free_pe
<3> SCore-D:DEBUG >> close_attach_fds(netset_num=1)
<3> SCore-D:DEBUG    close_attach_fds(dev=1,np=1)
<3> SCore-D:DEBUG   
close_attach_fds(dev=0,np=0,cntxt=0x81e5f38)
<3> SCore-D:DEBUG << close_attach_fds()
<3> SCore-D:TRACE(../pe.cc:539)    free_pe
<3> SCore-D:TRACE(../pe.cc:543)    free_pe
<3> SCore-D:TRACE(../pe.cc:547)    free_pe
<3> SCore-D:TRACE(../pe.cc:551)    free_pe
<3> SCore-D:TRACE(../pe.cc:574)    free_pe
<3> SCore-D:TRACE(../pe.cc:578) << free_pe
<1> SCore-D:TRACE(../pe.cc:539)    free_pe
<1> SCore-D:TRACE(../pe.cc:543)    free_pe
<1> SCore-D:TRACE(../pe.cc:547)    free_pe
<1> SCore-D:TRACE(../pe.cc:551)    free_pe
<1> SCore-D:TRACE(../pe.cc:574)    free_pe
<1> SCore-D:TRACE(../pe.cc:578) << free_pe
<2> SCore-D:TRACE(../pe.cc:516) << flush_pe()
<2> SCore-D:TRACE(../pe.cc:531)    free_pe
<2> SCore-D:TRACE(../pe.cc:535)    free_pe
<2> SCore-D:DEBUG >> close_attach_fds(netset_num=1)
<2> SCore-D:DEBUG    close_attach_fds(dev=1,np=1)
<2> SCore-D:DEBUG   
close_attach_fds(dev=0,np=0,cntxt=0x81e5f38)
<2> SCore-D:DEBUG << close_attach_fds()
<2> SCore-D:TRACE(../pe.cc:539)    free_pe
<2> SCore-D:TRACE(../pe.cc:543)    free_pe
<2> SCore-D:TRACE(../pe.cc:547)    free_pe
<2> SCore-D:TRACE(../pe.cc:551)    free_pe
<2> SCore-D:TRACE(../pe.cc:574)    free_pe
<2> SCore-D:TRACE(../pe.cc:578) << free_pe
<0> SCore-D:TRACE(../subjob.cc:225)    free_subjob()
<0> SCore-D:TRACE(../subjob.cc:232)    free_subjob()
<0> SCore-D:DEBUG fepio_close()
<0> SCore-D:TRACE(../subjob.cc:236)    free_subjob()
<0> SCore-D:TRACE(../subjob.cc:241) << free_subjob()
<0> SCore-D:DEBUG >> finalize_host(0)
<0> SCore-D:TRACE(../scoredir.cc:389) cleanup_scored_dir()
<1> SCore-D:DEBUG >> finalize_host(0)
<0> SCore-D:DEBUG << finalize_host()
<1> SCore-D:TRACE(../scoredir.cc:389) cleanup_scored_dir()
<1> SCore-D:DEBUG << finalize_host()
<3> SCore-D:TRACE(../fep.cc:853)  free_fep()
<2> SCore-D:DEBUG >> finalize_host(0)
<2> SCore-D:TRACE(../scoredir.cc:389) cleanup_scored_dir()
<2> SCore-D:DEBUG << finalize_host()
<3> SCore-D:TRACE(../fep.cc:856)  free_fep()
<3> SCore-D:DEBUG fepio_close()
<3> SCore-D:DEBUG fds_select[199]
<3> SCore-D:TRACE(../fep.cc:862)    free_fep(jobc)
<3> SCore-D:TRACE(../fep.cc:866) << free_fep
<3> SCore-D:DEBUG >> finalize_host(0)
<3> SCore-D:TRACE(../scoredir.cc:389) cleanup_scored_dir()
<3> SCore-D:DEBUG << finalize_host()




--------------------------------------
[10th Anniversary] special auction campaign now!
http://pr.mail.yahoo.co.jp/auction/



SCore-users-jp メーリングリストの案内