pprocess (annotate docs/tutorial.html in 2cd56ed1e0f7)

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

paulb@124

2

<html xmlns="http://www.w3.org/1999/xhtml" lang="en-gb">

paulb@124

3

<head>

paulb@124

4

  <meta http-equiv="content-type" content="text/html; charset=UTF-8" />

paulb@124

5

  <title>pprocess - Tutorial</title>

paulb@124

6

  <link href="styles.css" rel="stylesheet" type="text/css" />

paulb@124

7

</head>

paulb@124

8

<body>

<h1>pprocess - Tutorial</h1>

<p>The <code>pprocess</code> module provides several mechanisms for running

paulb@124

13

Python code concurrently in several processes. The most straightforward way of

paulb@124

14

making a program parallel-aware - that is, where the program can take

paulb@124

15

advantage of more than one processor to simultaneously process data - is to

paulb@124

16

use the <code>pmap</code> function.</p>

<ul>

paul@170

19

<li><a href="#note">A Note on Parallel Processes</a></li>

paulb@145

20

<li><a href="#pmap">Converting Map-Style Code</a></li>

paulb@145

21

<li><a href="#Map">Converting Invocations to Parallel Operations</a></li>

paul@159

22

<li><a href="#Queue">Converting Arbitrarily-Ordered Invocations</a>

paul@159

23

  <ul>

paul@159

24

  <li><a href="#Exchange">Replacing Queues with Exchanges</li></a>

paul@159

25

  <li><a href="#channel">Using Channels in Callables</li></a>

paul@159

26

  </ul>

paul@159

27

</li>

paulb@145

28

<li><a href="#create">Converting Inline Computations</a></li>

paulb@145

29

<li><a href="#MakeReusable">Reusing Processes in Parallel Programs</a></li>

paul@158

30

<li><a href="#continuous">Supporting Continuous Processes in Parallel Programs</a></li>

paulb@145

31

<li><a href="#BackgroundCallable">Performing Computations in Background Processes</a></li>

paulb@145

32

<li><a href="#ManagingBackgroundProcesses">Managing Several Background Processes</a></li>

paulb@145

33

<li><a href="#Summary">Summary</a></li>

paulb@145

34

</ul>

<p>For a brief summary of each of the features of <code>pprocess</code>, see

paulb@149

37

the <a href="reference.html">reference document</a>.</p>

<h2 id="note">A Note on Parallel Processes</h2>

<p>The way <code>pprocess</code> uses multiple processes to perform work in

paul@170

42

parallel involves the <code>fork</code> system call, which on modern operating

paul@170

43

systems involves what is known as "copy-on-write" semantics. In plain language,

paul@170

44

when <code>pprocess</code> creates a new <em>child</em> process to perform work

paul@170

45

in parallel with other work that needs to be done, this new process will be a

paul@170

46

near-identical copy of the original <em>parent</em> process, and the running

paul@170

47

code will be able to access data resident in that parent process.</p>

<p>However, when a child process modifies data, instead of changing that data

paul@170

50

in such a way that the parent process can see the modifications, the parent

paul@170

51

process will, in fact, remain oblivious to such changes. What happens is that

paul@170

52

as soon as the child process attempts to modify the data, it obtains its own

paul@170

53

separate copy which is then modified independently of the original data. Thus,

paul@170

54

a <em>copy</em> of any data is made when an attempt is made to <em>write</em>

paul@170

55

to such data. Meanwhile, the parent's copy of that data will be left untouched

paul@170

56

by the activities of the child.</p>

<p>It is therefore essential to note that any data distributed to other

paul@170

59

processes, and which will then be modified by those processes, will not appear

paul@170

60

to change in the parent process even if the objects employed are mutable. This

paul@170

61

is rather different to the behaviour of a normal Python program: passing a

paul@170

62

list to a function, for example, mutates that list in such a way that upon

paul@170

63

returning from that function the modifications will still be present. For

paul@170

64

example:</p>

<pre>

paul@170

67

def mutator(l):

paul@170

68

    l.append(3)

l = [1, 2]

paul@170

71

mutator(l) # l is now [1, 2, 3]

paul@170

72

</pre>

<p>In contrast, passing a list to a child process will cause the list to

paul@170

75

mutate in the child process, but the parent process will not see the list

paul@170

76

change. For example:</p>

<pre>

paul@170

79

def mutator(l):

paul@170

80

    l.append(3)

results = pprocess.Map()

paul@170

83

mutator = results.manage(pprocess.MakeParallel(mutator))

l = [1, 2]

paul@170

86

mutator(l) # l is now [1, 2]

paul@170

87

</pre>

<p>To communicate changes to data between processes, the modified objects must

paul@170

90

be explicitly returned from child processes using the mechanisms described in

paul@170

91

this documentation. For example:</p>

<pre>

paul@170

94

def mutator(l):

paul@170

95

    l.append(3)

paul@170

96

    return l       # the modified object is explicitly returned

results = pprocess.Map()

paul@170

99

mutator = results.manage(pprocess.MakeParallel(mutator))

l = [1, 2]

paul@170

102

mutator(l)

all_l = results[:] # there are potentially many results, not just one

paul@170

105

l = all_l[0]       # l is now [1, 2, 3], taken from the first result

paul@170

106

</pre>

<p>It is perhaps easiest to think of the communications mechanisms as

paul@170

109

providing a gateway between processes through which information can be passed,

paul@170

110

with the rest of a program's data being private and hidden from the other

paul@170

111

processes (even if that data initially resembles what the other processes also

paul@170

112

see within themselves).</p>

<h2 id="pmap">Converting Map-Style Code</h2>

<p>Consider a program using the built-in <code>map</code> function and a sequence of inputs:</p>

<pre>

paulb@124

119

    t = time.time()

    # Initialise an array.

    sequence = []

paulb@124

124

    for i in range(0, N):

paulb@124

125

        for j in range(0, N):

paulb@124

126

            sequence.append((i, j))

    # Perform the work.

    results = map(calculate, sequence)

    # Show the results.

    for i in range(0, N):

paulb@124

135

        for result in results[i*N:i*N+N]:

paulb@124

136

            print result,

paulb@124

137

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

142

found in the <code>examples/simple_map.py</code> file.)</p>

<p>The principal features of this program involve the preparation of an array

paulb@124

145

for input purposes, and the use of the <code>map</code> function to iterate

paulb@124

146

over the combinations of <code>i</code> and <code>j</code> in the array. Even

paulb@124

147

if the <code>calculate</code> function could be invoked independently for each

paulb@124

148

input value, we have to wait for each computation to complete before

paulb@124

149

initiating a new one. The <code>calculate</code> function may be defined as

paulb@124

150

follows:</p>

<pre>

paulb@124

153

def calculate(t):

    "A supposedly time-consuming calculation on 't'."

    i, j = t

paulb@124

158

    time.sleep(delay)

paulb@124

159

    return i * N + j

paulb@124

160

</pre>

<p>In order to reduce the processing time - to speed the code up, in other

paulb@124

163

words - we can make this code use several processes instead of just one. Here

paulb@124

164

is the modified code:</p>

<pre>

paulb@124

167

    t = time.time()

    # Initialise an array.

    sequence = []

paulb@124

172

    for i in range(0, N):

paulb@124

173

        for j in range(0, N):

paulb@124

174

            sequence.append((i, j))

    # Perform the work.

    results = <strong>pprocess.pmap</strong>(calculate, sequence<strong>, limit=limit</strong>)

    # Show the results.

    for i in range(0, N):

paulb@124

183

        for result in results[i*N:i*N+N]:

paulb@124

184

            print result,

paulb@124

185

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

190

found in the <code>examples/simple_pmap.py</code> file.)</p>

<p>By replacing usage of the <code>map</code> function with the

paulb@124

193

<code>pprocess.pmap</code> function, and specifying the limit on the number of

paulb@124

194

processes to be active at any given time (the value of the <code>limit</code>

paulb@124

195

variable is defined elsewhere), several calculations can now be performed in

paulb@124

196

parallel.</p>

<h2 id="Map">Converting Invocations to Parallel Operations</h2>

<p>Although some programs make natural use of the <code>map</code> function,

paulb@124

201

others may employ an invocation in a nested loop. This may also be converted

paulb@124

202

to a parallel program. Consider the following Python code:</p>

<pre>

paulb@124

205

    t = time.time()

    # Initialise an array.

    results = []

    # Perform the work.

    print "Calculating..."

paulb@124

214

    for i in range(0, N):

paulb@124

215

        for j in range(0, N):

paulb@124

216

            results.append(calculate(i, j))

    # Show the results.

    for i in range(0, N):

paulb@124

221

        for result in results[i*N:i*N+N]:

paulb@124

222

            print result,

paulb@124

223

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

228

found in the <code>examples/simple1.py</code> file.)</p>

<p>Here, a computation in the <code>calculate</code> function is performed for

paulb@124

231

each combination of <code>i</code> and <code>j</code> in the nested loop,

paulb@124

232

returning a result value. However, we must wait for the completion of this

paulb@124

233

function for each element before moving on to the next element, and this means

paulb@124

234

that the computations are performed sequentially. Consequently, on a system

paulb@124

235

with more than one processor, even if we could call <code>calculate</code> for

paulb@124

236

more than one combination of <code>i</code> and <code>j</code><code></code>

paulb@124

237

and have the computations executing at the same time, the above program will

paulb@124

238

not take advantage of such capabilities.</p>

<p>We use a slightly modified version of <code>calculate</code> which employs

paulb@124

241

two parameters instead of one:</p>

<pre>

paulb@124

244

def calculate(i, j):

"""

paulb@124

247

    A supposedly time-consuming calculation on 'i' and 'j'.

paulb@124

248

"""

    time.sleep(delay)

paulb@124

251

    return i * N + j

paulb@124

252

</pre>

<p>In order to reduce the processing time - to speed the code up, in other

paulb@124

255

words - we can make this code use several processes instead of just one. Here

paulb@124

256

is the modified code:</p>

<pre id="simple_managed_map">

paulb@124

259

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

262

    # channels/processes.

    <strong>results = pprocess.Map(limit=limit)</strong><code></code>

    # Wrap the calculate function and manage it.

    <strong>calc = results.manage(pprocess.MakeParallel(calculate))</strong>

    # Perform the work.

    print "Calculating..."

paulb@124

273

    for i in range(0, N):

paulb@124

274

        for j in range(0, N):

paulb@124

275

            <strong>calc</strong>(i, j)

    # Show the results.

    for i in range(0, N):

paulb@124

280

        for result in results[i*N:i*N+N]:

paulb@124

281

            print result,

paulb@124

282

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

287

found in the <code>examples/simple_managed_map.py</code> file.)</p>

<p>The principal changes in the above code involve the use of a

paulb@124

290

<code>pprocess.Map</code> object to collect the results, and a version of the

paulb@124

291

<code>calculate</code> function which is managed by the <code>Map</code>

paulb@124

292

object. What the <code>Map</code> object does is to arrange the results of

paulb@124

293

computations such that iterating over the object or accessing the object using

paulb@124

294

list operations provides the results in the same order as their corresponding

paulb@124

295

inputs.</p>

<h2 id="Queue">Converting Arbitrarily-Ordered Invocations</h2>

<p>In some programs, it is not important to receive the results of

paulb@124

300

computations in any particular order, usually because either the order of

paulb@124

301

these results is irrelevant, or because the results provide "positional"

paulb@124

302

information which let them be handled in an appropriate way. Consider the

paulb@124

303

following Python code:</p>

<pre>

paulb@124

306

    t = time.time()

    # Initialise an array.

    results = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

315

    for i in range(0, N):

paulb@124

316

        for j in range(0, N):

paulb@124

317

            i2, j2, result = calculate(i, j)

paulb@124

318

            results[i2*N+j2] = result

    # Show the results.

    for i in range(0, N):

paulb@124

323

        for result in results[i*N:i*N+N]:

paulb@124

324

            print result,

paulb@124

325

        print

    print "Time taken:", time.time() - t

paulb@124

328

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

331

found in the <code>examples/simple2.py</code> file.)</p>

<p>Here, a result array is initialised first and each computation is performed

paulb@124

334

sequentially. A significant difference to the previous examples is the return

paulb@124

335

value of the <code>calculate</code> function: the position details

paulb@124

336

corresponding to <code>i</code> and <code>j</code> are returned alongside the

paulb@124

337

result. Obviously, this is of limited value in the above code because the

paulb@124

338

order of the computations and the reception of results is fixed. However, we

paulb@124

339

get no benefit from parallelisation in the above example.</p>

<p>We can bring the benefits of parallel processing to the above program with

paulb@124

342

the following code:</p>

<pre id="simple_managed_queue">

paulb@124

345

    t = time.time()

    # Initialise the communications queue with a limit on the number of

paulb@124

348

    # channels/processes.

    <strong>queue = pprocess.Queue(limit=limit)</strong>

    # Initialise an array.

    results = [0] * N * N

    # Wrap the calculate function and manage it.

    <strong>calc = queue.manage(pprocess.MakeParallel(calculate))</strong>

    # Perform the work.

    print "Calculating..."

paulb@124

363

    for i in range(0, N):

paulb@124

364

        for j in range(0, N):

paulb@124

365

            <strong>calc(i, j)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paulb@124

370

    <strong>for i, j, result in queue:</strong>

paulb@124

371

        <strong>results[i*N+j] = result</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

376

        for result in results[i*N:i*N+N]:

paulb@124

377

            print result,

paulb@124

378

        print

    print "Time taken:", time.time() - t

paulb@124

381

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

384

found in the <code>examples/simple_managed_queue.py</code> file.)</p>

<p>This revised code employs a <code>pprocess.Queue</code> object whose

paulb@124

387

purpose is to collect the results of computations and to make them available

paulb@124

388

in the order in which they were received. The code collecting results has been

paulb@124

389

moved into a separate loop independent of the original computation loop and

paulb@124

390

taking advantage of the more relevant "positional" information emerging from

paulb@124

391

the queue.</p>

<h3 id="Exchange">Replacing Queues with Exchanges</h3>

<p>We can take this example further, illustrating some of the mechanisms

paulb@124

396

employed by <code>pprocess</code>. Instead of collecting results in a queue,

paulb@124

397

we can define a class containing a method which is called when new results

paulb@124

398

arrive:</p>

<pre>

paulb@124

401

class MyExchange(pprocess.Exchange):

    "Parallel convenience class containing the array assignment operation."

    def store_data(self, ch):

paulb@124

406

        i, j, result = ch.receive()

paulb@124

407

        self.D[i*N+j] = result

paulb@124

408

</pre>

<p>This code exposes the channel paradigm which is used throughout

paulb@124

411

<code>pprocess</code> and is available to applications, if desired. The effect

paulb@124

412

of the method is the storage of a result received through the channel in an

paulb@124

413

attribute of the object. The following code shows how this class can be used,

paulb@124

414

with differences to the previous program illustrated:</p>

<pre>

paulb@124

417

    t = time.time()

    # Initialise the communications exchange with a limit on the number of

paulb@124

420

    # channels/processes.

    <strong>exchange = MyExchange(limit=limit)</strong>

    # Initialise an array - it is stored in the exchange to permit automatic

paulb@124

425

    # assignment of values as the data arrives.

    <strong>results = exchange.D = [0] * N * N</strong>

    # Wrap the calculate function and manage it.

    calc = <strong>exchange</strong>.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@124

436

    for i in range(0, N):

paulb@124

437

        for j in range(0, N):

paulb@124

438

            calc(i, j)

    # Wait for the results.

    print "Finishing..."

paulb@124

443

    <strong>exchange.finish()</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

448

        for result in results[i*N:i*N+N]:

paulb@124

449

            print result,

paulb@124

450

        print

    print "Time taken:", time.time() - t

paulb@124

453

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

456

found in the <code>examples/simple_managed.py</code> file.)</p>

<p>The main visible differences between this and the previous program are the

paulb@124

459

storage of the result array in the exchange, the removal of the queue

paulb@124

460

consumption code from the main program, placing the act of storing values in

paulb@124

461

the exchange's <code>store_data</code> method, and the need to call the

paulb@124

462

<code>finish</code> method on the <code>MyExchange</code> object so that we do

paulb@124

463

not try and access the results too soon. One underlying benefit not visible in

paulb@124

464

the above code is that we no longer need to accumulate results in a queue or

paulb@124

465

other structure so that they may be processed and assigned to the correct

paulb@124

466

positions in the result array.</p>

<h3 id="channel">Using Channels in Callables</h3>

<p>For the curious, we may remove some of the remaining conveniences of the

paulb@124

471

above program to expose other features of <code>pprocess</code>. First, we

paulb@124

472

define a slightly modified version of the <code>calculate</code> function:</p>

<pre>

paulb@124

475

def calculate(ch, i, j):

"""

paulb@124

478

    A supposedly time-consuming calculation on 'i' and 'j', using 'ch' to

paulb@124

479

    communicate with the parent process.

paulb@124

480

"""

    time.sleep(delay)

paulb@124

483

    ch.send((i, j, i * N + j))

paulb@124

484

</pre>

<p>This function accepts a channel, <code>ch</code>, through which results

paulb@124

487

will be sent, and through which other values could potentially be received,

paulb@124

488

although we choose not to do so here. The program using this function is as

paulb@124

489

follows, with differences to the previous program illustrated:</p>

<pre>

paulb@124

492

    t = time.time()

    # Initialise the communications exchange with a limit on the number of

paulb@124

495

    # channels/processes.

    exchange = MyExchange(limit=limit)

    # Initialise an array - it is stored in the exchange to permit automatic

paulb@124

500

    # assignment of values as the data arrives.

    results = exchange.D = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

507

    for i in range(0, N):

paulb@124

508

        for j in range(0, N):

paulb@124

509

            <strong>exchange.start(calculate, i, j)</strong>

    # Wait for the results.

    print "Finishing..."

paulb@124

514

    exchange.finish()

    # Show the results.

    for i in range(0, N):

paulb@124

519

        for result in results[i*N:i*N+N]:

paulb@124

520

            print result,

paulb@124

521

        print

    print "Time taken:", time.time() - t

paulb@124

524

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

527

found in the <code>examples/simple_start.py</code> file.)</p>

<p>Here, we have discarded two conveniences: the wrapping of callables using

paulb@124

530

<code>MakeParallel</code>, which lets us use functions without providing any

paulb@124

531

channel parameters, and the management of callables using the

paulb@124

532

<code>manage</code> method on queues, exchanges, and so on. The

paulb@124

533

<code>start</code> method still calls the provided callable, but using a

paulb@124

534

different notation from that employed previously.</p>

<h2 id="create">Converting Inline Computations</h2>

<p>Although many programs employ functions and other useful abstractions which

paulb@124

539

can be treated as parallelisable units, some programs perform computations

paulb@124

540

"inline", meaning that the code responsible appears directly within a loop or

paulb@124

541

related control-flow construct. Consider the following code:</p>

<pre>

paulb@124

544

    t = time.time()

    # Initialise an array.

    results = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

553

    for i in range(0, N):

paulb@124

554

        for j in range(0, N):

paulb@124

555

            time.sleep(delay)

paulb@124

556

            results[i*N+j] = i * N + j

    # Show the results.

    for i in range(0, N):

paulb@124

561

        for result in results[i*N:i*N+N]:

paulb@124

562

            print result,

paulb@124

563

        print

    print "Time taken:", time.time() - t

paulb@124

566

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

569

found in the <code>examples/simple.py</code> file.)</p>

<p>To simulate "work", as in the different versions of the

paulb@124

572

<code>calculate</code> function, we use the <code>time.sleep</code> function

paulb@124

573

(which does not actually do work, and which will cause a process to be

paulb@124

574

descheduled in most cases, but which simulates the delay associated with work

paulb@124

575

being done). This inline work, which must be performed sequentially in the

paulb@124

576

above program, can be performed in parallel in a somewhat modified version of

paulb@124

577

the program:</p>

<pre>

paulb@124

580

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

583

    # channels/processes.

    <strong>results = pprocess.Map(limit=limit)</strong>

    # Perform the work.

paulb@124

588

    # NOTE: Could use the with statement in the loop to package the

paulb@124

589

    # NOTE: try...finally functionality.

    print "Calculating..."

paulb@124

592

    for i in range(0, N):

paulb@124

593

        for j in range(0, N):

paulb@124

594

            <strong>ch = results.create()</strong>

paulb@124

595

            <strong>if ch:</strong>

paulb@124

596

                <strong>try: # Calculation work.</strong>

                    time.sleep(delay)

paulb@124

599

                    <strong>ch.send(i * N + j)</strong>

                <strong>finally: # Important finalisation.</strong>

                    <strong>pprocess.exit(ch)</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

608

        for result in results[i*N:i*N+N]:

paulb@124

609

            print result,

paulb@124

610

        print

    print "Time taken:", time.time() - t

paulb@124

613

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

616

found in the <code>examples/simple_create_map.py</code> file.)</p>

<p>Although seemingly more complicated, the bulk of the changes in this

paulb@124

619

modified program are focused on obtaining a channel object, <code>ch</code>,

paulb@124

620

at the point where the computations are performed, and the wrapping of the

paulb@124

621

computation code in a <code>try</code>...<code>finally</code> statement which

paulb@124

622

ensures that the process associated with the channel exits when the

paulb@124

623

computation is complete. In order for the results of these computations to be

paulb@124

624

collected, a <code>pprocess.Map</code> object is used, since it will maintain

paulb@124

625

the results in the same order as the initiation of the computations which

paulb@124

626

produced them.</p>

<h2 id="MakeReusable">Reusing Processes in Parallel Programs</h2>

<p>One notable aspect of the above programs when parallelised is that each

paulb@124

631

invocation of a computation in parallel creates a new process in which the

paulb@124

632

computation is to be performed, regardless of whether existing processes had

paulb@124

633

just finished producing results and could theoretically have been asked to

paulb@124

634

perform new computations. In other words, processes were created and destroyed

paulb@124

635

instead of being reused.</p>

<p>However, we can request that processes be reused for computations by

paulb@124

638

enabling the <code>reuse</code> feature of exchange-like objects and employing

paulb@124

639

suitable reusable callables. Consider this modified version of the <a

paulb@124

640

href="#simple_managed_map">simple_managed_map</a> program:</p>

<pre>

paulb@124

643

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

646

    # channels/processes.

    results = pprocess.Map(limit=limit<strong>, reuse=1</strong>)

    # Wrap the calculate function and manage it.

    calc = results.manage(pprocess.Make<strong>Reusable</strong>(calculate))

    # Perform the work.

    print "Calculating..."

paulb@124

657

    for i in range(0, N):

paulb@124

658

        for j in range(0, N):

paulb@124

659

            calc(i, j)

    # Show the results.

    for i in range(0, N):

paulb@124

664

        for result in results[i*N:i*N+N]:

paulb@124

665

            print result,

paulb@124

666

        print

    print "Time taken:", time.time() - t

paulb@124

669

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

672

found in the <code>examples/simple_manage_map_reusable.py</code> file.)</p>

<p>By indicating that processes and channels shall be reused, and by wrapping

paulb@124

675

the <code>calculate</code> function with the necessary support, the

paulb@124

676

computations may be performed in parallel using a pool of processes instead of

paulb@124

677

creating a new process for each computation and then discarding it, only to

paulb@124

678

create a new process for the next computation.</p>

<h2 id="continuous">Supporting Continuous Processes in Parallel Programs</h2>

<p>Although reusable processes offer the opportunity to invoke a callable over

paul@159

683

and over within the same created process, they do not fully support the

paul@159

684

potential of the underlying mechanisms in <code>pprocess</code>: created

paul@159

685

processes can communicate multiple values to the creating process and can

paul@159

686

theoretically run within the same callable forever.</p>

<p>Consider this modified form of the <code>calculate</code> function:</p>

<pre>

paul@159

691

def calculate(ch, i):

"""

paul@159

694

    A supposedly time-consuming calculation on 'i'.

paul@159

695

"""

    for j in range(0, N):

paul@159

698

        time.sleep(delay)

paul@159

699

        ch.send((i, j, i * N + j))

paul@159

700

</pre>

<p>This function accepts a channel <code>ch</code> together with an argument

paul@159

703

<code>i</code> corresponding to an entire row of the input array, as opposed

paul@159

704

to having two arguments (<code>i</code> and <code>j</code>) corresponding to a

paul@159

705

single cell in the input array. In this function, a series of calculations are

paul@159

706

performed and a number of values are returned through the channel, without the

paul@159

707

function terminating until all values have been returned for the row data.</p>

<p>To use this modified function, a modified version of the

paul@159

710

<a href="#simple_managed_queue">simple_managed_queue</a> program is used:</p>

<pre>

paul@159

713

    t = time.time()

    # Initialise the communications queue with a limit on the number of

paul@159

716

    # channels/processes.

    queue = pprocess.Queue(limit=limit<strong>, continuous=1</strong>)

    # Initialise an array.

    results = [0] * N * N

    # Manage the calculate function.

    calc = queue.manage(<strong>calculate</strong>)

    # Perform the work.

    print "Calculating..."

paul@159

731

    for i in range(0, N):

paul@159

732

        <strong>calc(i)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paul@159

737

    for i, j, result in queue:

paul@159

738

        results[i*N+j] = result

    # Show the results.

    for i in range(0, N):

paul@159

743

        for result in results[i*N:i*N+N]:

paul@159

744

            print result,

paul@159

745

        print

    print "Time taken:", time.time() - t

paul@159

748

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paul@159

751

found in the <code>examples/simple_continuous_queue.py</code> file.)</p>

<p>Although the inner loop in the work section has been relocated to the

paul@159

754

<code>calculate</code> function, the queue still receives outputs from that

paul@159

755

function with positional information and a result for the result array. Thus,

paul@159

756

no change is needed for the retrieval of the results: they arrive in the queue

paul@159

757

as before.</p>

<h2 id="BackgroundCallable">Performing Computations in Background Processes</h2>

<p>Occasionally, it is desirable to initiate time-consuming computations and to

paulb@145

762

not only leave such processes running in the background, but to be able to detach

paulb@145

763

the creating process from them completely, potentially terminating the creating

paulb@145

764

process altogether, and yet also be able to collect the results of the created

paulb@145

765

processes at a later time, potentially in another completely different process.

paulb@145

766

For such situations, we can make use of the <code>BackgroundCallable</code>

paulb@145

767

class, which converts a parallel-aware callable into a callable which will run

paulb@145

768

in a background process when invoked.</p>

<p>Consider this excerpt from a modified version of the <a

paulb@145

771

href="#simple_managed_queue">simple_managed_queue</a> program:</p>

<pre>

paulb@145

774

<strong>def task():</strong>

    # Initialise the communications queue with a limit on the number of

paulb@145

777

    # channels/processes.

    queue = pprocess.Queue(limit=limit)

    # Initialise an array.

    results = [0] * N * N

    # Wrap the calculate function and manage it.

    calc = queue.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@145

792

    for i in range(0, N):

paulb@145

793

        for j in range(0, N):

paulb@145

794

            calc(i, j)

    # Store the results as they arrive.

    print "Finishing..."

paulb@145

799

    for i, j, result in queue:

paulb@145

800

        results[i*N+j] = result

    <strong>return results</strong>

paulb@145

803

</pre>

<p>Here, we have converted the main program into a function, and instead of

paulb@145

806

printing out the results, we return the results list from the function.</p>

<p>Now, let us consider the new main program (with the relevant mechanisms

paulb@145

809

highlighted):</p>

<pre>

paulb@145

812

    t = time.time()

    if "--reconnect" not in sys.argv:

        # Wrap the computation and manage it.

        <strong>ptask = pprocess.BackgroundCallable("task.socket", pprocess.MakeParallel(task))</strong>

        # Perform the work.

        ptask()

        # Discard the callable.

        del ptask

paulb@145

827

        print "Discarded the callable."

    if "--start" not in sys.argv:

        # Open a queue and reconnect to the task.

        print "Opening a queue."

paulb@145

834

        <strong>queue = pprocess.BackgroundQueue("task.socket")</strong>

        # Wait for the results.

        print "Waiting for persistent results"

paulb@145

839

        for results in queue:

paulb@145

840

            pass # should only be one element

        # Show the results.

        for i in range(0, N):

paulb@145

845

            for result in results[i*N:i*N+N]:

paulb@145

846

                print result,

paulb@145

847

            print

        print "Time taken:", time.time() - t

paulb@145

850

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@145

853

found in the <code>examples/simple_background_queue.py</code> file.)</p>

<p>This new main program has two parts: the part which initiates the

paulb@145

856

computation, and the part which connects to the computation in order to collect

paulb@145

857

the results. Both parts can be run in the same process, and this should result

paulb@145

858

in similar behaviour to that of the original

paulb@145

859

<a href="#simple_managed_queue">simple_managed_queue</a> program.</p>

<p>In the above program, however, we are free to specify <code>--start</code> as

paulb@145

862

an option when running the program, and the result of this is merely to initiate

paulb@145

863

the computation in a background process, using <code>BackgroundCallable</code>

paulb@145

864

to obtain a callable which, when invoked, creates the background process and

paulb@145

865

runs the computation. After doing this, the program will then exit, but it will

paulb@145

866

leave the computation running as a collection of background processes, and a

paulb@145

867

special file called <code>task.socket</code> will exist in the current working

paulb@145

868

directory.</p>

<p>When the above program is run using the <code>--reconnect</code> option, an

paulb@145

871

attempt will be made to reconnect to the background processes already created by

paulb@145

872

attempting to contact them using the previously created <code>task.socket</code>

paulb@145

873

special file (which is, in fact, a UNIX-domain socket); this being done using

paulb@145

874

the <code>BackgroundQueue</code> function which will handle the incoming results

paulb@145

875

in a fashion similar to that of a <code>Queue</code> object. Since only one

paulb@145

876

result is returned by the computation (as defined by the <code>return</code>

paulb@145

877

statement in the <code>task</code> function), we need only expect one element to

paulb@145

878

be collected by the queue: a list containing all of the results produced in the

paulb@145

879

computation.</p>

<h2 id="ManagingBackgroundProcesses">Managing Several Background Processes</h2>

<p>In the above example, a single background process was used to manage a number

paulb@145

884

of other processes, with all of them running in the background. However, it can

paulb@145

885

be desirable to manage more than one background process.</p>

<p>Consider this excerpt from a modified version of the <a

paulb@145

888

href="#simple_managed_queue">simple_managed_queue</a> program:</p>

<pre>

paulb@145

891

<strong>def task(i):</strong>

    # Initialise the communications queue with a limit on the number of

paulb@145

894

    # channels/processes.

    queue = pprocess.Queue(limit=limit)

    # Initialise an array.

    results = [0] * N

    # Wrap the calculate function and manage it.

    calc = queue.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@145

909

    <strong>for j in range(0, N):</strong>

paulb@145

910

        <strong>calc(i, j)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paulb@145

915

    <strong>for i, j, result in queue:</strong>

paulb@145

916

        <strong>results[j] = result</strong>

    <strong>return i, results</strong>

paulb@145

919

</pre>

<p>Just as we see in the previous example, a function called <code>task</code>

paulb@145

922

has been defined to hold a background computation, and this function returns a

paulb@145

923

portion of the results. However, unlike the previous example or the original

paulb@145

924

example, the scope of the results of the computation collected in the function

paulb@145

925

have been changed: here, only results for calculations involving a certain value

paulb@145

926

of <code>i</code> are collected, with the particular value of <code>i</code>

paulb@145

927

returned along with the appropriate portion of the results.</p>

<p>Now, let us consider the new main program (with the relevant mechanisms

paulb@145

930

highlighted):</p>

<pre>

paulb@145

933

    t = time.time()

    if "--reconnect" not in sys.argv:

        # Wrap the computation and manage it.

        <strong>ptask = pprocess.MakeParallel(task)</strong>

        <strong>for i in range(0, N):</strong>

            # Make a distinct callable for each part of the computation.

            <strong>ptask_i = pprocess.BackgroundCallable("task-%d.socket" % i, ptask)</strong>

            # Perform the work.

            <strong>ptask_i(i)</strong>

        # Discard the callable.

        del ptask

paulb@145

954

        print "Discarded the callable."

    if "--start" not in sys.argv:

        # Open a queue and reconnect to the task.

        print "Opening a queue."

paulb@145

961

        <strong>queue = pprocess.PersistentQueue()</strong>

paulb@145

962

        <strong>for i in range(0, N):</strong>

paulb@145

963

            <strong>queue.connect("task-%d.socket" % i)</strong>

        # Initialise an array.

        <strong>results = [0] * N</strong>

        # Wait for the results.

        print "Waiting for persistent results"

paulb@145

972

        <strong>for i, result in queue:</strong>

paulb@145

973

            <strong>results[i] = result</strong>

        # Show the results.

        for i in range(0, N):

paulb@145

978

            <strong>for result in results[i]:</strong>

paulb@145

979

                print result,

paulb@145

980

            print

        print "Time taken:", time.time() - t

paulb@145

983

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@145

986

found in the <code>examples/simple_persistent_queue.py</code> file.)</p>

<p>In the first section, the process of making a parallel-aware callable is as

paulb@145

989

expected, but instead of then invoking a single background version of such a

paulb@145

990

callable, as in the previous example, we create a version for each value of

paulb@145

991

<code>i</code> (using <code>BackgroundCallable</code>) and then invoke each one.

paulb@145

992

The result of this is a total of <code>N</code> background processes, each

paulb@145

993

running an invocation of the <code>task</code> function with a distinct value of

paulb@145

994

<code>i</code> (which in turn perform computations), and each employing a

paulb@145

995

UNIX-domain socket for communication with a name of the form

paulb@145

996

<code>task-<em>i</em>.socket</code>.</p>

<p>In the second section, since we now have more than one background process, we

paulb@145

999

must find a way to monitor them after reconnecting to them; to achieve this, a

paulb@145

1000

<code>PersistentQueue</code> is created, which acts like a regular

paulb@145

1001

<code>Queue</code> object but is instead focused on handling persistent

paulb@145

1002

communications. Upon connecting the queue to each of the previously created

paulb@145

1003

UNIX-domain sockets, the queue acts like a regular <code>Queue</code> and

paulb@145

1004

exposes received results through an iterator. Here, the principal difference

paulb@145

1005

from previous examples is the structure of results: instead of collecting each

paulb@145

1006

individual value in a flat <code>i</code> by <code>j</code> array, a list is

paulb@145

1007

returned for each value of <code>i</code> and is stored directly in another

paulb@145

1008

list.</p>

<h3>Applications of Background Computations</h3>

<p>Background computations are useful because they provide flexibility in the

paulb@145

1013

way the results can be collected. One area in which they can be useful is Web

paulb@145

1014

programming, where a process handling an incoming HTTP request may need to

paulb@145

1015

initiate a computation but then immediately send output to the Web client - such

paulb@145

1016

as a page indicating that the computation is "in progress" - without having to

paulb@145

1017

wait for the computation or to allocate resources to monitor it. Moreover, in

paulb@145

1018

some Web architectures, notably those employing the Common Gateway Interface

paulb@145

1019

(CGI), it is necessary for a process handling an incoming request to terminate

paulb@145

1020

before its output will be sent to clients. By using a

paulb@145

1021

<code>BackgroundCallable</code>, a Web server process can initiate a

paulb@145

1022

communication, and then subsequent server processes can be used to reconnect to

paulb@145

1023

the background computation and to wait efficiently for results.</p>

<h2 id="Summary">Summary</h2>

<p>The following table indicates the features used in converting one

paulb@124

1028

sequential example program to another parallel program:</p>

<table border="1" cellspacing="0" cellpadding="5">

paulb@124

1031

  <thead>

paulb@124

1032

    <tr>

paulb@124

1033

      <th>Sequential Example</th>

paulb@124

1034

      <th>Parallel Example</th>

paulb@124

1035

      <th>Features Used</th>

paulb@124

1036

    </tr>

paulb@124

1037

  </thead>

paulb@124

1038

  <tbody>

paulb@124

1039

    <tr>

paulb@124

1040

      <td>simple_map</td>

paulb@124

1041

      <td>simple_pmap</td>

paulb@124

1042

      <td>pmap</td>

paulb@124

1043

    </tr>

paulb@124

1044

    <tr>

paulb@124

1045

      <td>simple1</td>

paulb@124

1046

      <td>simple_managed_map</td>

paulb@124

1047

      <td>MakeParallel, Map, manage</td>

paulb@124

1048

    </tr>

paulb@124

1049

    <tr>

paul@159

1050

      <td rowspan="6">simple2</td>

paulb@124

1051

      <td>simple_managed_queue</td>

paulb@124

1052

      <td>MakeParallel, Queue, manage</td>

paulb@124

1053

    </tr>

paulb@124

1054

    <tr>

paul@159

1055

      <td>simple_continuous_queue</td>

paul@159

1056

      <td>Queue, manage (continuous)</td>

paul@159

1057

    </tr>

paul@159

1058

    <tr>

paulb@124

1059

      <td>simple_managed</td>

paulb@124

1060

      <td>MakeParallel, Exchange (subclass), manage, finish</td>

paulb@124

1061

    </tr>

paulb@124

1062

    <tr>

paulb@124

1063

      <td>simple_start</td>

paulb@124

1064

      <td>Channel, Exchange (subclass), start, finish</td>

paulb@124

1065

    </tr>

paulb@124

1066

    <tr>

paulb@145

1067

      <td>simple_background_queue</td>

paulb@145

1068

      <td>MakeParallel, BackgroundCallable, BackgroundQueue</td>

paulb@145

1069

    </tr>

paulb@145

1070

    <tr>

paulb@145

1071

      <td>simple_persistent_queue</td>

paulb@145

1072

      <td>MakeParallel, BackgroundCallable, PersistentQueue</td>

paulb@145

1073

    </tr>

paulb@145

1074

    <tr>

paulb@124

1075

      <td>simple</td>

paulb@124

1076

      <td>simple_create_map</td>

paulb@124

1077

      <td>Channel, Map, create, exit</td>

paulb@124

1078

    </tr>

paulb@124

1079

  </tbody>

paulb@124

1080

</table>

</body>

paulb@124

1083

</html>

pprocess

Annotated docs/tutorial.html