Skip to Content.
Sympa Menu

charm - Re: [charm] Parallel read

charm AT lists.cs.illinois.edu

Subject: Charm++ parallel programming system

List archive

Re: [charm] Parallel read


Chronological Thread 
  • From: Jozsef Bakosi <jbakosi AT gmail.com>
  • To: Ronak Buch <rabuch2 AT illinois.edu>
  • Cc: "charm AT cs.uiuc.edu" <charm AT cs.uiuc.edu>, rspavel AT lanl.gov
  • Subject: Re: [charm] Parallel read
  • Date: Wed, 22 Feb 2017 14:48:37 -0700

Hi Ronak,

We would like to give the existing parallel read of ckio a try. 

The way I see it at this point, our options are as follows:

1. From a chare group, do a parallel read in which not all group elements participate (how many, e.g., a percentage, controlled by the user),

2. Do the same as 1, but from a node group, using a similar user-adjustable parameter specifying the number of nodes that should participate in the read.

3. Use ckio's read.

Of these three options, I assume option 3 would also be able to overlap communication and computation, so ultimately we would probably like that best.. Also as I see, the above is also in the order of increasing complexity, but potentially, also in the order of larger return over investment.

I think we would like to give ckio's reader a try just to learn more about option 3 and to be able to better scope out what we can get out of it for our purposes using a small test application.

Could you please merge master into that branch so we can give it a try and let us know when we can try it?

Thanks,
Jozsef

On Sat, Feb 11, 2017 at 6:13 PM, Ronak Buch <rabuch2 AT illinois.edu> wrote:
Hi Jozsef,

There was some work on adding read capabilities to CkIO a while back, it's on the gerrit/rohan/ckioread branch of charm.  The reading facility added there should work, but nobody has been using it in a production applications, so it's out of date (and it hasn't been rebased on a long time).  Now that there's some interest in using parallel file reading, I'll update the branch and do some performance tests and get back to you.

Otherwise, a node group level file I/O scheme should work well in the meantime.  Your sketch on how things should be distributed makes sense to me.

Thanks,
Ronak

On Fri, Feb 10, 2017 at 9:37 PM, Jozsef Bakosi <jbakosi AT gmail.com> wrote:
Hi folks,

Currently, I have file/PE-style read of a mesh that is in a single file and about a few tens of GB containing a few hundred million cells and their node coordinates. This is okay but does not scale very well beyond a few thousand cores.

I have read the paper on ckio from 2011 and looked at the source and it is definitely something I would like to explore for writing checkpoints in the future, but I wonder what my options are for input. ckio does not seem to have a capability at this point for reading.

Another option that seems appealing is to use Charm++'s node groups and/or the "Physical Node API" for simply replacing my file/PE read with a node/PE read and then after reading my input file in that fashion in a second step I would distribute the data to group and/or chare array elements.

What do you suggest? At this point I am interested in expl oring both quick/dirty as well as good/longer-term solutions.

Thanks,
Jozsef





Archive powered by MHonArc 2.6.19.

Top of Page