Skip to Content.
Sympa Menu

charm - RE: [charm] Obtaining Charm++ profile

charm AT lists.cs.illinois.edu

Subject: Charm++ parallel programming system

List archive

RE: [charm] Obtaining Charm++ profile


Chronological Thread 
  • From: Rob Van der Wijngaart <robv AT nvidia.com>
  • To: Ronak Buch <rabuch2 AT illinois.edu>
  • Cc: "charm AT cs.illinois.edu" <charm AT cs.illinois.edu>
  • Subject: RE: [charm] Obtaining Charm++ profile
  • Date: Wed, 13 Jun 2018 14:58:22 +0000
  • Accept-language: en-US
  • Authentication-results: illinois.edu; spf=softfail smtp.mailfrom=robv AT nvidia.com; dmarc=none header.from=nvidia.com
  • Msip_labels: MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Enabled=True; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_SiteId=43083d15-7273-40c1-b7db-39efd9ccc17a; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Ref=https://api.informationprotection.azure.com/api/43083d15-7273-40c1-b7db-39efd9ccc17a; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Owner=robv AT nvidia.com; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_SetDate=2018-06-06T08:50:43.0323109-07:00; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Name=Unrestricted; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Application=Microsoft Azure Information Protection; MSIP_Label_6b558183-044c-4105-8d9c-cea02a2a3d86_Extended_MSFT_Method=Automatic; Sensitivity=Unrestricted

Hi Ronak,

 

To follow up on yesterday’s message, I noticed at the top of ck-core/ckarray.C that it is possible to enable certain messages by editing the block shown below. I will experiment with that now, but have a few questions:

  1. DEBS(x) appears to control debug IO for send/recv/broadcast, but DEBB(x) also controls debug IO for broadcasts. Is the latter simply a subset of the former?
  2. I don’t see a similar debug block in ck.C. Is it possible to obtain similar functionality, though?

 

Thanks!

 

Rob

 

#define ARRAY_DEBUG_OUTPUT 0

 

#if ARRAY_DEBUG_OUTPUT

#   define DEB(x) CkPrintf x  //General debug messages

#   define DEBI(x) CkPrintf x  //Index debug messages

#   define DEBC(x) CkPrintf x  //Construction debug messages

#   define DEBS(x) CkPrintf x  //Send/recv/broadcast debug messages

#   define DEBM(x) CkPrintf x  //Migration debug messages

#   define DEBL(x) CkPrintf x  //Load balancing debug messages

#   define DEBK(x) CkPrintf x  //Spring Cleaning debug messages

#   define DEBB(x) CkPrintf x  //Broadcast debug messages

#   define AA "ArrayBOC on %d: "

#   define AB ,CkMyPe()

#   define DEBUG(x) x

#else

#   define DEB(X) /*CkPrintf x*/

#   define DEBI(X) /*CkPrintf x*/

#   define DEBC(X) /*CkPrintf x*/

#   define DEBS(x) /*CkPrintf x*/

#   define DEBM(X) /*CkPrintf x*/

#   define DEBL(X) /*CkPrintf x*/

#   define DEBK(x) /*CkPrintf x*/

#   define DEBB(x) /*CkPrintf x*/

#   define str(x) /**/

#   define DEBUG(x)

#endif

 

 

 

From: Rob Van der Wijngaart
Sent: Tuesday, June 12, 2018 9:23 PM
To: 'Ronak Buch' <rabuch2 AT illinois.edu>
Cc: 'charm AT cs.illinois.edu' <charm AT cs.illinois.edu>
Subject: RE: [charm] Obtaining Charm++ profile

 

Hi Ronak,

 

I’ve played around a bit with the trace, but it still does not give me quite what I need. I am trying to build a dependency graph, which requires, besides the time stamp and the source of the event, also the target. I am looking at file src/ck-core/ck.C, trying to decide where to insert the requisite print statements. Will I capture all communication events by instrumenting the big case statement in function _processHandler? Thanks!

 

Rob

 

From: Rob Van der Wijngaart
Sent: Wednesday, June 6, 2018 8:51 AM
To: 'Ronak Buch' <rabuch2 AT illinois.edu>
Cc: charm AT cs.illinois.edu
Subject: RE: [charm] Obtaining Charm++ profile

 

OK, thanks, Ronak. I’ll see how this works out once I’ve processed the graph info of my application.

 

Rob

 

From: Ronak Buch <rabuch2 AT illinois.edu>
Sent: Tuesday, June 5, 2018 4:05 PM
To: Rob Van der Wijngaart <robv AT nvidia.com>
Cc: charm AT cs.illinois.edu
Subject: Re: [charm] Obtaining Charm++ profile

 

Yes, you can basically ignore the rest of the fields (they're some mixture of ordering, IDs, performance counters, etc.). The trailing zeros are some of these fields and can usually be ignored.

 

dummy_thread_ep is basically a catchall for things that are traced that the runtime doesn't have enough information for to attribute to something else. It can be meaningful, but it could be a lot of different things depending on what exactly your program is doing, so it's hard for me to make any conclusions without more information. Usually looking at the bookending events that have meaningful attributions is enough to figure out what's going on in those sections.

 

 

 

On Tue, Jun 5, 2018 at 5:51 PM, Rob Van der Wijngaart <robv AT nvidia.com> wrote:

Thanks much, Ronak!

Below is a representative snippet from one of my log files.

The lines that start with 2 or 3 have 8 and 13 fields each. I guess I can ignore the fields that you’re not describing. A lot of the records that start with 2 or 3 refer to ENTRY CHARE 0, which refers to dummy_thread_ep, according to the sts file (the line says: “ENTRY CHARE 0 dummy_thread_ep 0 0”). Is that a meaningful mapping, and do the trailing zeroes have any significance? Thanks.

 

Rob

 

3 0 355 14898768 569 16 0 11638933

2 4 0 14898768 2950 14 0 0 14 369 0 0 0

3 0 0 14898864 2950 14 0 11638933

14 14898866 14

15 14901654 14

2 5 355 14901657 893 16 100 0 0 0 0 0 11641931

1 4 0 14901667 2951 14 0 0

3 0 355 14901669 893 16 0 11641931

2 5 355 14901669 900 16 100 0 0 0 0 0 11641931

1 4 0 14901679 2952 14 0 0

3 0 355 14901681 900 16 0 11641931

2 4 0 14901681 2952 14 0 0 14 2118 0 0 0

3 0 0 14901772 2952 14 0 11641931

2 5 355 14901775 954 16 100 0 0 0 0 0 11641931

1 4 0 14901787 2953 14 0 0

3 0 355 14901787 954 16 0 11641931

2 4 0 14901789 2953 14 0 0 14 1958 0 0 0

3 0 0 14901878 2953 14 0 11641931

2 5 355 14901880 1019 16 100 0 0 0 0 0 11641931

1 4 0 14901890 2954 14 0 0

3 0 355 14901894 1019 16 0 11641931

2 5 355 14901895 1006 16 100 0 0 0 0 0 11641931

1 4 0 14901906 2955 14 0 0

3 0 355 14901906 1006 16 0 11641931

2 4 0 14901908 2955 14 0 0 14 1653 0 0 0

3 0 0 14901993 2955 14 0 11641931

2 4 0 14901995 2955 14 0 0 14 1349 0 0 0

3 0 0 14902085 2955 14 0 11641931

2 4 0 14902086 2955 14 0 0 14 1796 0 0 0

3 0 0 14902175 2955 14 0 11641931

14 14902176 14

15 14902230 14

 

 

From: Ronak Buch <rabuch2 AT illinois.edu>
Sent: Tuesday, June 5, 2018 3:09 PM


To: Rob Van der Wijngaart <robv AT nvidia.com>
Cc: charm AT cs.illinois.edu
Subject: Re: [charm] Obtaining Charm++ profile

 

No, we don't have a good description of the record format right now other than the source code that generates it in Charm (src/ck-perf/trace-projections.C) and the source code that parses it in Projections (GenericLogReader.java). I'll provide a brief summary below, but if there's something specific that you want, I can explain it.

 

In general, each line in the Projections log file represents a specific event. The ones that you're probably interested in are those beginning with 2 or 3, which represent BEGIN_PROCESSING and END_PROCESSING events (the start and end of entry methods). The third field in those lines represents the entry ID, which can be converted a function name by using the sts file (in which ENTRY CHARE lines provide a mapping between an entry ID and the actual name of the entry method). The fourth field represents the timestamp of the event (so the difference between the two values in this position for corresponding BEGIN_PROCESSING and END_PROCESSING events is the total time of the entry method). The sixth value is the PE, which indicates which PE served as that event's source. The seventh field for BEGIN_PROCESSING events is the message length.

 

Hopefully that helps.

 

Thanks,

Ronak

 

On Tue, Jun 5, 2018 at 4:38 PM, Rob Van der Wijngaart <robv AT nvidia.com> wrote:

OK, I got my .log, .sts, and .projrc files for my run! Next question, if you don’t mind. I know of Projections, but I’d like to use my own graph analysis tools to parse the log files. Is there a description of the records in the .log and .sts files? Thanks!

 

Rob

 

From: Rob Van der Wijngaart
Sent: Tuesday, June 5, 2018 2:11 PM
To: 'Ronak Buch' <rabuch2 AT illinois.edu>
Cc: charm AT cs.illinois.edu
Subject: RE: [charm] Obtaining Charm++ profile

 

Thanks, Ronak!

 

Rob

 

From: Ronak Buch <rabuch2 AT illinois.edu>
Sent: Tuesday, June 5, 2018 2:05 PM
To: Rob Van der Wijngaart <robv AT nvidia.com>
Cc: charm AT cs.illinois.edu
Subject: Re: [charm] Obtaining Charm++ profile

 

You built Charm++ with the correct flag ("--enable-tracing"). The issue is that the application, when linked, needs the "-tracemode projections" flag.

 

Specifically, if you're building NAMD (as you seem to be), you can just run "make projections," which includes the "-tracemode projections" flag already.

 

Thanks,

Ronak

 

 

On Tue, Jun 5, 2018 at 12:47 PM, Rob Van der Wijngaart <robv AT nvidia.com> wrote:

Hello Team,

 

I am trying to obtain a profile of a NAMD run using Charm++. I built Charm++ this way:

./build charm++ multicore-linux64 gcc --with-production --enable-tracing -j40 -tracemode summary

According to the man page I should now automatically obtain a profile if I run the application like this:

./namd-2.13-multicore-cuda +traceroot $PWD +p32 +setcpuaffinity +pemap 0-31 +devices 0,1,2,3 stmv/stmv.namd

However, I do not see any profile. If I then add +sumonly to the command line, I get this error message:

WARNING: +sumonly is a command line argument beginning with a '+' but was not parsed by the RTS.

If any of the above arguments were intended for the RTS you may need to recompile Charm++ with different options.

So it appears I did not build Charm++ properly to support tracing. Can you tell me how to do it right?

Second, I am actually interesting in doing projections, not obtaining a summary profile, but I do not understand this instruction in the manual:

Link time option: -tracemode projections

After all, I only issue a build command to create the runtime, with no explicit link calls issued. Can you advise?
This concerns v6.8.2, which I obtained as a tar ball.

Thanks,
Rob

 

 


This email message is for the sole use of the intended recipient(s) and may contain confidential information.  Any unauthorized review, use, disclosure or distribution is prohibited.  If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message.


 

 

 




Archive powered by MHonArc 2.6.19.

Top of Page