Skip to Content.
Sympa Menu

charm - [charm] Question about running BigSim on multiple machines

charm AT lists.cs.illinois.edu

Subject: Charm++ parallel programming system

List archive

[charm] Question about running BigSim on multiple machines


Chronological Thread 
  • From: Xuehan Xu <xxhdx1985126 AT gmail.com>
  • To: charm <charm AT cs.uiuc.edu>
  • Subject: [charm] Question about running BigSim on multiple machines
  • Date: Sat, 12 Nov 2011 11:30:58 +0800
  • List-archive: <http://lists.cs.uiuc.edu/pipermail/charm>
  • List-id: CHARM parallel programming system <charm.cs.uiuc.edu>

Dear Sirs:
I tried to run BigSim on two machines, but the following error came up:

[root@localhost
Cjacobi3D]# ./charmrun +p2 ./jacobi 4 4 4 +x4 +y4 +z4
++remote-shell ssh
Charmrun> started all node programs in 1.226 seconds.
Converse/Charm++ Commit ID:
Warning> Randomization of stack pointer is turned on in kernel, thread
migration may not work! Run 'echo 0 >
/proc/sys/kernel/randomize_va_space' as root to disable it, or try run
with '+isomalloc_sync'.
Charm++> scheduler running in netpoll mode.
BG info> Simulating 4x4x4 nodes with 1 comm + 1 work threads each.
BG info> Network type: bluegene.
alpha: 1.000000e-07 packetsize: 1024
CYCLE_TIME_FACTOR:1.000000e-03.
CYCLES_PER_HOP: 5 CYCLES_PER_CORNER: 75.
BG info> cpufactor is 1.000000.
BG info> floating point factor is 0.000000.
BG info> Using WallTimer for timing method.
CharmLB> Load balancer ignores processor background load.
------------- Processor 1 Exiting: Caught Signal ------------
Signal: segmentation violation
Suggestion: Try running with '++debug', or linking with '-memory
paranoid' (memory paranoid requires '+netpoll' at runtime).
[1] Stack Traceback:
[1:0] [0x81c420]
[1:1] AMPI_threadstart+0x4b [0x8141f7d]
[1:2] /root/charm_emu/net-linux-bigemulator/examples/ampi/Cjacobi3D/./jacobi
[0x8182c01]
[1:3] /root/charm_emu/net-linux-bigemulator/examples/ampi/Cjacobi3D/./jacobi
[0x81a3227]
[1:4] qt_error+0 [0x82a9db2]
Fatal error on PE 1> segmentation violation

The .nodelist file that I wrote is like this:

group main
host 192.168.5.169
host 192.168.5.172

But if I run two BigSim process on the same machine, it can end
normally.
What should I do to deal with it?

Thanks:-)



  • [charm] Question about running BigSim on multiple machines, Xuehan Xu, 11/11/2011

Archive powered by MHonArc 2.6.16.

Top of Page