pprocess (annotate docs/tutorial.html in 6c8d12f35c55)

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

paulb@124

2

<html xmlns="http://www.w3.org/1999/xhtml" lang="en-gb">

paulb@124

3

<head>

paulb@124

4

  <meta http-equiv="content-type" content="text/html; charset=UTF-8" />

paulb@124

5

  <title>pprocess - Tutorial</title>

paulb@124

6

  <link href="styles.css" rel="stylesheet" type="text/css" />

paulb@124

7

</head>

paulb@124

8

<body>

<h1>pprocess - Tutorial</h1>

<p>The <code>pprocess</code> module provides several mechanisms for running

paulb@124

13

Python code concurrently in several processes. The most straightforward way of

paulb@124

14

making a program parallel-aware - that is, where the program can take

paulb@124

15

advantage of more than one processor to simultaneously process data - is to

paulb@124

16

use the <code>pmap</code> function.</p>

<ul>

paulb@145

19

<li><a href="#pmap">Converting Map-Style Code</a></li>

paulb@145

20

<li><a href="#Map">Converting Invocations to Parallel Operations</a></li>

paul@159

21

<li><a href="#Queue">Converting Arbitrarily-Ordered Invocations</a>

paul@159

22

  <ul>

paul@159

23

  <li><a href="#Exchange">Replacing Queues with Exchanges</li></a>

paul@159

24

  <li><a href="#channel">Using Channels in Callables</li></a>

paul@159

25

  </ul>

paul@159

26

</li>

paulb@145

27

<li><a href="#create">Converting Inline Computations</a></li>

paulb@145

28

<li><a href="#MakeReusable">Reusing Processes in Parallel Programs</a></li>

paul@158

29

<li><a href="#continuous">Supporting Continuous Processes in Parallel Programs</a></li>

paulb@145

30

<li><a href="#BackgroundCallable">Performing Computations in Background Processes</a></li>

paulb@145

31

<li><a href="#ManagingBackgroundProcesses">Managing Several Background Processes</a></li>

paulb@145

32

<li><a href="#Summary">Summary</a></li>

paulb@145

33

</ul>

<p>For a brief summary of each of the features of <code>pprocess</code>, see

paulb@149

36

the <a href="reference.html">reference document</a>.</p>

<h2 id="pmap">Converting Map-Style Code</h2>

<p>Consider a program using the built-in <code>map</code> function and a sequence of inputs:</p>

<pre>

paulb@124

43

    t = time.time()

    # Initialise an array.

    sequence = []

paulb@124

48

    for i in range(0, N):

paulb@124

49

        for j in range(0, N):

paulb@124

50

            sequence.append((i, j))

    # Perform the work.

    results = map(calculate, sequence)

    # Show the results.

    for i in range(0, N):

paulb@124

59

        for result in results[i*N:i*N+N]:

paulb@124

60

            print result,

paulb@124

61

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

66

found in the <code>examples/simple_map.py</code> file.)</p>

<p>The principal features of this program involve the preparation of an array

paulb@124

69

for input purposes, and the use of the <code>map</code> function to iterate

paulb@124

70

over the combinations of <code>i</code> and <code>j</code> in the array. Even

paulb@124

71

if the <code>calculate</code> function could be invoked independently for each

paulb@124

72

input value, we have to wait for each computation to complete before

paulb@124

73

initiating a new one. The <code>calculate</code> function may be defined as

paulb@124

74

follows:</p>

<pre>

paulb@124

77

def calculate(t):

    "A supposedly time-consuming calculation on 't'."

    i, j = t

paulb@124

82

    time.sleep(delay)

paulb@124

83

    return i * N + j

paulb@124

84

</pre>

<p>In order to reduce the processing time - to speed the code up, in other

paulb@124

87

words - we can make this code use several processes instead of just one. Here

paulb@124

88

is the modified code:</p>

<pre>

paulb@124

91

    t = time.time()

    # Initialise an array.

    sequence = []

paulb@124

96

    for i in range(0, N):

paulb@124

97

        for j in range(0, N):

paulb@124

98

            sequence.append((i, j))

    # Perform the work.

    results = <strong>pprocess.pmap</strong>(calculate, sequence<strong>, limit=limit</strong>)

    # Show the results.

    for i in range(0, N):

paulb@124

107

        for result in results[i*N:i*N+N]:

paulb@124

108

            print result,

paulb@124

109

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

114

found in the <code>examples/simple_pmap.py</code> file.)</p>

<p>By replacing usage of the <code>map</code> function with the

paulb@124

117

<code>pprocess.pmap</code> function, and specifying the limit on the number of

paulb@124

118

processes to be active at any given time (the value of the <code>limit</code>

paulb@124

119

variable is defined elsewhere), several calculations can now be performed in

paulb@124

120

parallel.</p>

<h2 id="Map">Converting Invocations to Parallel Operations</h2>

<p>Although some programs make natural use of the <code>map</code> function,

paulb@124

125

others may employ an invocation in a nested loop. This may also be converted

paulb@124

126

to a parallel program. Consider the following Python code:</p>

<pre>

paulb@124

129

    t = time.time()

    # Initialise an array.

    results = []

    # Perform the work.

    print "Calculating..."

paulb@124

138

    for i in range(0, N):

paulb@124

139

        for j in range(0, N):

paulb@124

140

            results.append(calculate(i, j))

    # Show the results.

    for i in range(0, N):

paulb@124

145

        for result in results[i*N:i*N+N]:

paulb@124

146

            print result,

paulb@124

147

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

152

found in the <code>examples/simple1.py</code> file.)</p>

<p>Here, a computation in the <code>calculate</code> function is performed for

paulb@124

155

each combination of <code>i</code> and <code>j</code> in the nested loop,

paulb@124

156

returning a result value. However, we must wait for the completion of this

paulb@124

157

function for each element before moving on to the next element, and this means

paulb@124

158

that the computations are performed sequentially. Consequently, on a system

paulb@124

159

with more than one processor, even if we could call <code>calculate</code> for

paulb@124

160

more than one combination of <code>i</code> and <code>j</code><code></code>

paulb@124

161

and have the computations executing at the same time, the above program will

paulb@124

162

not take advantage of such capabilities.</p>

<p>We use a slightly modified version of <code>calculate</code> which employs

paulb@124

165

two parameters instead of one:</p>

<pre>

paulb@124

168

def calculate(i, j):

"""

paulb@124

171

    A supposedly time-consuming calculation on 'i' and 'j'.

paulb@124

172

"""

    time.sleep(delay)

paulb@124

175

    return i * N + j

paulb@124

176

</pre>

<p>In order to reduce the processing time - to speed the code up, in other

paulb@124

179

words - we can make this code use several processes instead of just one. Here

paulb@124

180

is the modified code:</p>

<pre id="simple_managed_map">

paulb@124

183

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

186

    # channels/processes.

    <strong>results = pprocess.Map(limit=limit)</strong><code></code>

    # Wrap the calculate function and manage it.

    <strong>calc = results.manage(pprocess.MakeParallel(calculate))</strong>

    # Perform the work.

    print "Calculating..."

paulb@124

197

    for i in range(0, N):

paulb@124

198

        for j in range(0, N):

paulb@124

199

            <strong>calc</strong>(i, j)

    # Show the results.

    for i in range(0, N):

paulb@124

204

        for result in results[i*N:i*N+N]:

paulb@124

205

            print result,

paulb@124

206

        print

    print "Time taken:", time.time() - t</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

211

found in the <code>examples/simple_managed_map.py</code> file.)</p>

<p>The principal changes in the above code involve the use of a

paulb@124

214

<code>pprocess.Map</code> object to collect the results, and a version of the

paulb@124

215

<code>calculate</code> function which is managed by the <code>Map</code>

paulb@124

216

object. What the <code>Map</code> object does is to arrange the results of

paulb@124

217

computations such that iterating over the object or accessing the object using

paulb@124

218

list operations provides the results in the same order as their corresponding

paulb@124

219

inputs.</p>

<h2 id="Queue">Converting Arbitrarily-Ordered Invocations</h2>

<p>In some programs, it is not important to receive the results of

paulb@124

224

computations in any particular order, usually because either the order of

paulb@124

225

these results is irrelevant, or because the results provide "positional"

paulb@124

226

information which let them be handled in an appropriate way. Consider the

paulb@124

227

following Python code:</p>

<pre>

paulb@124

230

    t = time.time()

    # Initialise an array.

    results = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

239

    for i in range(0, N):

paulb@124

240

        for j in range(0, N):

paulb@124

241

            i2, j2, result = calculate(i, j)

paulb@124

242

            results[i2*N+j2] = result

    # Show the results.

    for i in range(0, N):

paulb@124

247

        for result in results[i*N:i*N+N]:

paulb@124

248

            print result,

paulb@124

249

        print

    print "Time taken:", time.time() - t

paulb@124

252

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

255

found in the <code>examples/simple2.py</code> file.)</p>

<p>Here, a result array is initialised first and each computation is performed

paulb@124

258

sequentially. A significant difference to the previous examples is the return

paulb@124

259

value of the <code>calculate</code> function: the position details

paulb@124

260

corresponding to <code>i</code> and <code>j</code> are returned alongside the

paulb@124

261

result. Obviously, this is of limited value in the above code because the

paulb@124

262

order of the computations and the reception of results is fixed. However, we

paulb@124

263

get no benefit from parallelisation in the above example.</p>

<p>We can bring the benefits of parallel processing to the above program with

paulb@124

266

the following code:</p>

<pre id="simple_managed_queue">

paulb@124

269

    t = time.time()

    # Initialise the communications queue with a limit on the number of

paulb@124

272

    # channels/processes.

    <strong>queue = pprocess.Queue(limit=limit)</strong>

    # Initialise an array.

    results = [0] * N * N

    # Wrap the calculate function and manage it.

    <strong>calc = queue.manage(pprocess.MakeParallel(calculate))</strong>

    # Perform the work.

    print "Calculating..."

paulb@124

287

    for i in range(0, N):

paulb@124

288

        for j in range(0, N):

paulb@124

289

            <strong>calc(i, j)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paulb@124

294

    <strong>for i, j, result in queue:</strong>

paulb@124

295

        <strong>results[i*N+j] = result</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

300

        for result in results[i*N:i*N+N]:

paulb@124

301

            print result,

paulb@124

302

        print

    print "Time taken:", time.time() - t

paulb@124

305

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

308

found in the <code>examples/simple_managed_queue.py</code> file.)</p>

<p>This revised code employs a <code>pprocess.Queue</code> object whose

paulb@124

311

purpose is to collect the results of computations and to make them available

paulb@124

312

in the order in which they were received. The code collecting results has been

paulb@124

313

moved into a separate loop independent of the original computation loop and

paulb@124

314

taking advantage of the more relevant "positional" information emerging from

paulb@124

315

the queue.</p>

<h3 id="Exchange">Replacing Queues with Exchanges</h3>

<p>We can take this example further, illustrating some of the mechanisms

paulb@124

320

employed by <code>pprocess</code>. Instead of collecting results in a queue,

paulb@124

321

we can define a class containing a method which is called when new results

paulb@124

322

arrive:</p>

<pre>

paulb@124

325

class MyExchange(pprocess.Exchange):

    "Parallel convenience class containing the array assignment operation."

    def store_data(self, ch):

paulb@124

330

        i, j, result = ch.receive()

paulb@124

331

        self.D[i*N+j] = result

paulb@124

332

</pre>

<p>This code exposes the channel paradigm which is used throughout

paulb@124

335

<code>pprocess</code> and is available to applications, if desired. The effect

paulb@124

336

of the method is the storage of a result received through the channel in an

paulb@124

337

attribute of the object. The following code shows how this class can be used,

paulb@124

338

with differences to the previous program illustrated:</p>

<pre>

paulb@124

341

    t = time.time()

    # Initialise the communications exchange with a limit on the number of

paulb@124

344

    # channels/processes.

    <strong>exchange = MyExchange(limit=limit)</strong>

    # Initialise an array - it is stored in the exchange to permit automatic

paulb@124

349

    # assignment of values as the data arrives.

    <strong>results = exchange.D = [0] * N * N</strong>

    # Wrap the calculate function and manage it.

    calc = <strong>exchange</strong>.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@124

360

    for i in range(0, N):

paulb@124

361

        for j in range(0, N):

paulb@124

362

            calc(i, j)

    # Wait for the results.

    print "Finishing..."

paulb@124

367

    <strong>exchange.finish()</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

372

        for result in results[i*N:i*N+N]:

paulb@124

373

            print result,

paulb@124

374

        print

    print "Time taken:", time.time() - t

paulb@124

377

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

380

found in the <code>examples/simple_managed.py</code> file.)</p>

<p>The main visible differences between this and the previous program are the

paulb@124

383

storage of the result array in the exchange, the removal of the queue

paulb@124

384

consumption code from the main program, placing the act of storing values in

paulb@124

385

the exchange's <code>store_data</code> method, and the need to call the

paulb@124

386

<code>finish</code> method on the <code>MyExchange</code> object so that we do

paulb@124

387

not try and access the results too soon. One underlying benefit not visible in

paulb@124

388

the above code is that we no longer need to accumulate results in a queue or

paulb@124

389

other structure so that they may be processed and assigned to the correct

paulb@124

390

positions in the result array.</p>

<h3 id="channel">Using Channels in Callables</h3>

<p>For the curious, we may remove some of the remaining conveniences of the

paulb@124

395

above program to expose other features of <code>pprocess</code>. First, we

paulb@124

396

define a slightly modified version of the <code>calculate</code> function:</p>

<pre>

paulb@124

399

def calculate(ch, i, j):

"""

paulb@124

402

    A supposedly time-consuming calculation on 'i' and 'j', using 'ch' to

paulb@124

403

    communicate with the parent process.

paulb@124

404

"""

    time.sleep(delay)

paulb@124

407

    ch.send((i, j, i * N + j))

paulb@124

408

</pre>

<p>This function accepts a channel, <code>ch</code>, through which results

paulb@124

411

will be sent, and through which other values could potentially be received,

paulb@124

412

although we choose not to do so here. The program using this function is as

paulb@124

413

follows, with differences to the previous program illustrated:</p>

<pre>

paulb@124

416

    t = time.time()

    # Initialise the communications exchange with a limit on the number of

paulb@124

419

    # channels/processes.

    exchange = MyExchange(limit=limit)

    # Initialise an array - it is stored in the exchange to permit automatic

paulb@124

424

    # assignment of values as the data arrives.

    results = exchange.D = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

431

    for i in range(0, N):

paulb@124

432

        for j in range(0, N):

paulb@124

433

            <strong>exchange.start(calculate, i, j)</strong>

    # Wait for the results.

    print "Finishing..."

paulb@124

438

    exchange.finish()

    # Show the results.

    for i in range(0, N):

paulb@124

443

        for result in results[i*N:i*N+N]:

paulb@124

444

            print result,

paulb@124

445

        print

    print "Time taken:", time.time() - t

paulb@124

448

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

451

found in the <code>examples/simple_start.py</code> file.)</p>

<p>Here, we have discarded two conveniences: the wrapping of callables using

paulb@124

454

<code>MakeParallel</code>, which lets us use functions without providing any

paulb@124

455

channel parameters, and the management of callables using the

paulb@124

456

<code>manage</code> method on queues, exchanges, and so on. The

paulb@124

457

<code>start</code> method still calls the provided callable, but using a

paulb@124

458

different notation from that employed previously.</p>

<h2 id="create">Converting Inline Computations</h2>

<p>Although many programs employ functions and other useful abstractions which

paulb@124

463

can be treated as parallelisable units, some programs perform computations

paulb@124

464

"inline", meaning that the code responsible appears directly within a loop or

paulb@124

465

related control-flow construct. Consider the following code:</p>

<pre>

paulb@124

468

    t = time.time()

    # Initialise an array.

    results = [0] * N * N

    # Perform the work.

    print "Calculating..."

paulb@124

477

    for i in range(0, N):

paulb@124

478

        for j in range(0, N):

paulb@124

479

            time.sleep(delay)

paulb@124

480

            results[i*N+j] = i * N + j

    # Show the results.

    for i in range(0, N):

paulb@124

485

        for result in results[i*N:i*N+N]:

paulb@124

486

            print result,

paulb@124

487

        print

    print "Time taken:", time.time() - t

paulb@124

490

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

493

found in the <code>examples/simple.py</code> file.)</p>

<p>To simulate "work", as in the different versions of the

paulb@124

496

<code>calculate</code> function, we use the <code>time.sleep</code> function

paulb@124

497

(which does not actually do work, and which will cause a process to be

paulb@124

498

descheduled in most cases, but which simulates the delay associated with work

paulb@124

499

being done). This inline work, which must be performed sequentially in the

paulb@124

500

above program, can be performed in parallel in a somewhat modified version of

paulb@124

501

the program:</p>

<pre>

paulb@124

504

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

507

    # channels/processes.

    <strong>results = pprocess.Map(limit=limit)</strong>

    # Perform the work.

paulb@124

512

    # NOTE: Could use the with statement in the loop to package the

paulb@124

513

    # NOTE: try...finally functionality.

    print "Calculating..."

paulb@124

516

    for i in range(0, N):

paulb@124

517

        for j in range(0, N):

paulb@124

518

            <strong>ch = results.create()</strong>

paulb@124

519

            <strong>if ch:</strong>

paulb@124

520

                <strong>try: # Calculation work.</strong>

                    time.sleep(delay)

paulb@124

523

                    <strong>ch.send(i * N + j)</strong>

                <strong>finally: # Important finalisation.</strong>

                    <strong>pprocess.exit(ch)</strong>

    # Show the results.

    for i in range(0, N):

paulb@124

532

        for result in results[i*N:i*N+N]:

paulb@124

533

            print result,

paulb@124

534

        print

    print "Time taken:", time.time() - t

paulb@124

537

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

540

found in the <code>examples/simple_create_map.py</code> file.)</p>

<p>Although seemingly more complicated, the bulk of the changes in this

paulb@124

543

modified program are focused on obtaining a channel object, <code>ch</code>,

paulb@124

544

at the point where the computations are performed, and the wrapping of the

paulb@124

545

computation code in a <code>try</code>...<code>finally</code> statement which

paulb@124

546

ensures that the process associated with the channel exits when the

paulb@124

547

computation is complete. In order for the results of these computations to be

paulb@124

548

collected, a <code>pprocess.Map</code> object is used, since it will maintain

paulb@124

549

the results in the same order as the initiation of the computations which

paulb@124

550

produced them.</p>

<h2 id="MakeReusable">Reusing Processes in Parallel Programs</h2>

<p>One notable aspect of the above programs when parallelised is that each

paulb@124

555

invocation of a computation in parallel creates a new process in which the

paulb@124

556

computation is to be performed, regardless of whether existing processes had

paulb@124

557

just finished producing results and could theoretically have been asked to

paulb@124

558

perform new computations. In other words, processes were created and destroyed

paulb@124

559

instead of being reused.</p>

<p>However, we can request that processes be reused for computations by

paulb@124

562

enabling the <code>reuse</code> feature of exchange-like objects and employing

paulb@124

563

suitable reusable callables. Consider this modified version of the <a

paulb@124

564

href="#simple_managed_map">simple_managed_map</a> program:</p>

<pre>

paulb@124

567

    t = time.time()

    # Initialise the results using a map with a limit on the number of

paulb@124

570

    # channels/processes.

    results = pprocess.Map(limit=limit<strong>, reuse=1</strong>)

    # Wrap the calculate function and manage it.

    calc = results.manage(pprocess.Make<strong>Reusable</strong>(calculate))

    # Perform the work.

    print "Calculating..."

paulb@124

581

    for i in range(0, N):

paulb@124

582

        for j in range(0, N):

paulb@124

583

            calc(i, j)

    # Show the results.

    for i in range(0, N):

paulb@124

588

        for result in results[i*N:i*N+N]:

paulb@124

589

            print result,

paulb@124

590

        print

    print "Time taken:", time.time() - t

paulb@124

593

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@124

596

found in the <code>examples/simple_manage_map_reusable.py</code> file.)</p>

<p>By indicating that processes and channels shall be reused, and by wrapping

paulb@124

599

the <code>calculate</code> function with the necessary support, the

paulb@124

600

computations may be performed in parallel using a pool of processes instead of

paulb@124

601

creating a new process for each computation and then discarding it, only to

paulb@124

602

create a new process for the next computation.</p>

<h2 id="continuous">Supporting Continuous Processes in Parallel Programs</h2>

<p>Although reusable processes offer the opportunity to invoke a callable over

paul@159

607

and over within the same created process, they do not fully support the

paul@159

608

potential of the underlying mechanisms in <code>pprocess</code>: created

paul@159

609

processes can communicate multiple values to the creating process and can

paul@159

610

theoretically run within the same callable forever.</p>

<p>Consider this modified form of the <code>calculate</code> function:</p>

<pre>

paul@159

615

def calculate(ch, i):

"""

paul@159

618

    A supposedly time-consuming calculation on 'i'.

paul@159

619

"""

    for j in range(0, N):

paul@159

622

        time.sleep(delay)

paul@159

623

        ch.send((i, j, i * N + j))

paul@159

624

</pre>

<p>This function accepts a channel <code>ch</code> together with an argument

paul@159

627

<code>i</code> corresponding to an entire row of the input array, as opposed

paul@159

628

to having two arguments (<code>i</code> and <code>j</code>) corresponding to a

paul@159

629

single cell in the input array. In this function, a series of calculations are

paul@159

630

performed and a number of values are returned through the channel, without the

paul@159

631

function terminating until all values have been returned for the row data.</p>

<p>To use this modified function, a modified version of the

paul@159

634

<a href="#simple_managed_queue">simple_managed_queue</a> program is used:</p>

<pre>

paul@159

637

    t = time.time()

    # Initialise the communications queue with a limit on the number of

paul@159

640

    # channels/processes.

    queue = pprocess.Queue(limit=limit<strong>, continuous=1</strong>)

    # Initialise an array.

    results = [0] * N * N

    # Manage the calculate function.

    calc = queue.manage(<strong>calculate</strong>)

    # Perform the work.

    print "Calculating..."

paul@159

655

    for i in range(0, N):

paul@159

656

        <strong>calc(i)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paul@159

661

    for i, j, result in queue:

paul@159

662

        results[i*N+j] = result

    # Show the results.

    for i in range(0, N):

paul@159

667

        for result in results[i*N:i*N+N]:

paul@159

668

            print result,

paul@159

669

        print

    print "Time taken:", time.time() - t

paul@159

672

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paul@159

675

found in the <code>examples/simple_continuous_queue.py</code> file.)</p>

<p>Although the inner loop in the work section has been relocated to the

paul@159

678

<code>calculate</code> function, the queue still receives outputs from that

paul@159

679

function with positional information and a result for the result array. Thus,

paul@159

680

no change is needed for the retrieval of the results: they arrive in the queue

paul@159

681

as before.</p>

<h2 id="BackgroundCallable">Performing Computations in Background Processes</h2>

<p>Occasionally, it is desirable to initiate time-consuming computations and to

paulb@145

686

not only leave such processes running in the background, but to be able to detach

paulb@145

687

the creating process from them completely, potentially terminating the creating

paulb@145

688

process altogether, and yet also be able to collect the results of the created

paulb@145

689

processes at a later time, potentially in another completely different process.

paulb@145

690

For such situations, we can make use of the <code>BackgroundCallable</code>

paulb@145

691

class, which converts a parallel-aware callable into a callable which will run

paulb@145

692

in a background process when invoked.</p>

<p>Consider this excerpt from a modified version of the <a

paulb@145

695

href="#simple_managed_queue">simple_managed_queue</a> program:</p>

<pre>

paulb@145

698

<strong>def task():</strong>

    # Initialise the communications queue with a limit on the number of

paulb@145

701

    # channels/processes.

    queue = pprocess.Queue(limit=limit)

    # Initialise an array.

    results = [0] * N * N

    # Wrap the calculate function and manage it.

    calc = queue.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@145

716

    for i in range(0, N):

paulb@145

717

        for j in range(0, N):

paulb@145

718

            calc(i, j)

    # Store the results as they arrive.

    print "Finishing..."

paulb@145

723

    for i, j, result in queue:

paulb@145

724

        results[i*N+j] = result

    <strong>return results</strong>

paulb@145

727

</pre>

<p>Here, we have converted the main program into a function, and instead of

paulb@145

730

printing out the results, we return the results list from the function.</p>

<p>Now, let us consider the new main program (with the relevant mechanisms

paulb@145

733

highlighted):</p>

<pre>

paulb@145

736

    t = time.time()

    if "--reconnect" not in sys.argv:

        # Wrap the computation and manage it.

        <strong>ptask = pprocess.BackgroundCallable("task.socket", pprocess.MakeParallel(task))</strong>

        # Perform the work.

        ptask()

        # Discard the callable.

        del ptask

paulb@145

751

        print "Discarded the callable."

    if "--start" not in sys.argv:

        # Open a queue and reconnect to the task.

        print "Opening a queue."

paulb@145

758

        <strong>queue = pprocess.BackgroundQueue("task.socket")</strong>

        # Wait for the results.

        print "Waiting for persistent results"

paulb@145

763

        for results in queue:

paulb@145

764

            pass # should only be one element

        # Show the results.

        for i in range(0, N):

paulb@145

769

            for result in results[i*N:i*N+N]:

paulb@145

770

                print result,

paulb@145

771

            print

        print "Time taken:", time.time() - t

paulb@145

774

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@145

777

found in the <code>examples/simple_background_queue.py</code> file.)</p>

<p>This new main program has two parts: the part which initiates the

paulb@145

780

computation, and the part which connects to the computation in order to collect

paulb@145

781

the results. Both parts can be run in the same process, and this should result

paulb@145

782

in similar behaviour to that of the original

paulb@145

783

<a href="#simple_managed_queue">simple_managed_queue</a> program.</p>

<p>In the above program, however, we are free to specify <code>--start</code> as

paulb@145

786

an option when running the program, and the result of this is merely to initiate

paulb@145

787

the computation in a background process, using <code>BackgroundCallable</code>

paulb@145

788

to obtain a callable which, when invoked, creates the background process and

paulb@145

789

runs the computation. After doing this, the program will then exit, but it will

paulb@145

790

leave the computation running as a collection of background processes, and a

paulb@145

791

special file called <code>task.socket</code> will exist in the current working

paulb@145

792

directory.</p>

<p>When the above program is run using the <code>--reconnect</code> option, an

paulb@145

795

attempt will be made to reconnect to the background processes already created by

paulb@145

796

attempting to contact them using the previously created <code>task.socket</code>

paulb@145

797

special file (which is, in fact, a UNIX-domain socket); this being done using

paulb@145

798

the <code>BackgroundQueue</code> function which will handle the incoming results

paulb@145

799

in a fashion similar to that of a <code>Queue</code> object. Since only one

paulb@145

800

result is returned by the computation (as defined by the <code>return</code>

paulb@145

801

statement in the <code>task</code> function), we need only expect one element to

paulb@145

802

be collected by the queue: a list containing all of the results produced in the

paulb@145

803

computation.</p>

<h2 id="ManagingBackgroundProcesses">Managing Several Background Processes</h2>

<p>In the above example, a single background process was used to manage a number

paulb@145

808

of other processes, with all of them running in the background. However, it can

paulb@145

809

be desirable to manage more than one background process.</p>

<p>Consider this excerpt from a modified version of the <a

paulb@145

812

href="#simple_managed_queue">simple_managed_queue</a> program:</p>

<pre>

paulb@145

815

<strong>def task(i):</strong>

    # Initialise the communications queue with a limit on the number of

paulb@145

818

    # channels/processes.

    queue = pprocess.Queue(limit=limit)

    # Initialise an array.

    results = [0] * N

    # Wrap the calculate function and manage it.

    calc = queue.manage(pprocess.MakeParallel(calculate))

    # Perform the work.

    print "Calculating..."

paulb@145

833

    <strong>for j in range(0, N):</strong>

paulb@145

834

        <strong>calc(i, j)</strong>

    # Store the results as they arrive.

    print "Finishing..."

paulb@145

839

    <strong>for i, j, result in queue:</strong>

paulb@145

840

        <strong>results[j] = result</strong>

    <strong>return i, results</strong>

paulb@145

843

</pre>

<p>Just as we see in the previous example, a function called <code>task</code>

paulb@145

846

has been defined to hold a background computation, and this function returns a

paulb@145

847

portion of the results. However, unlike the previous example or the original

paulb@145

848

example, the scope of the results of the computation collected in the function

paulb@145

849

have been changed: here, only results for calculations involving a certain value

paulb@145

850

of <code>i</code> are collected, with the particular value of <code>i</code>

paulb@145

851

returned along with the appropriate portion of the results.</p>

<p>Now, let us consider the new main program (with the relevant mechanisms

paulb@145

854

highlighted):</p>

<pre>

paulb@145

857

    t = time.time()

    if "--reconnect" not in sys.argv:

        # Wrap the computation and manage it.

        <strong>ptask = pprocess.MakeParallel(task)</strong>

        <strong>for i in range(0, N):</strong>

            # Make a distinct callable for each part of the computation.

            <strong>ptask_i = pprocess.BackgroundCallable("task-%d.socket" % i, ptask)</strong>

            # Perform the work.

            <strong>ptask_i(i)</strong>

        # Discard the callable.

        del ptask

paulb@145

878

        print "Discarded the callable."

    if "--start" not in sys.argv:

        # Open a queue and reconnect to the task.

        print "Opening a queue."

paulb@145

885

        <strong>queue = pprocess.PersistentQueue()</strong>

paulb@145

886

        <strong>for i in range(0, N):</strong>

paulb@145

887

            <strong>queue.connect("task-%d.socket" % i)</strong>

        # Initialise an array.

        <strong>results = [0] * N</strong>

        # Wait for the results.

        print "Waiting for persistent results"

paulb@145

896

        <strong>for i, result in queue:</strong>

paulb@145

897

            <strong>results[i] = result</strong>

        # Show the results.

        for i in range(0, N):

paulb@145

902

            <strong>for result in results[i]:</strong>

paulb@145

903

                print result,

paulb@145

904

            print

        print "Time taken:", time.time() - t

paulb@145

907

</pre>

<p>(This code in context with <code>import</code> statements and functions is

paulb@145

910

found in the <code>examples/simple_persistent_queue.py</code> file.)</p>

<p>In the first section, the process of making a parallel-aware callable is as

paulb@145

913

expected, but instead of then invoking a single background version of such a

paulb@145

914

callable, as in the previous example, we create a version for each value of

paulb@145

915

<code>i</code> (using <code>BackgroundCallable</code>) and then invoke each one.

paulb@145

916

The result of this is a total of <code>N</code> background processes, each

paulb@145

917

running an invocation of the <code>task</code> function with a distinct value of

paulb@145

918

<code>i</code> (which in turn perform computations), and each employing a

paulb@145

919

UNIX-domain socket for communication with a name of the form

paulb@145

920

<code>task-<em>i</em>.socket</code>.</p>

<p>In the second section, since we now have more than one background process, we

paulb@145

923

must find a way to monitor them after reconnecting to them; to achieve this, a

paulb@145

924

<code>PersistentQueue</code> is created, which acts like a regular

paulb@145

925

<code>Queue</code> object but is instead focused on handling persistent

paulb@145

926

communications. Upon connecting the queue to each of the previously created

paulb@145

927

UNIX-domain sockets, the queue acts like a regular <code>Queue</code> and

paulb@145

928

exposes received results through an iterator. Here, the principal difference

paulb@145

929

from previous examples is the structure of results: instead of collecting each

paulb@145

930

individual value in a flat <code>i</code> by <code>j</code> array, a list is

paulb@145

931

returned for each value of <code>i</code> and is stored directly in another

paulb@145

932

list.</p>

<h3>Applications of Background Computations</h3>

<p>Background computations are useful because they provide flexibility in the

paulb@145

937

way the results can be collected. One area in which they can be useful is Web

paulb@145

938

programming, where a process handling an incoming HTTP request may need to

paulb@145

939

initiate a computation but then immediately send output to the Web client - such

paulb@145

940

as a page indicating that the computation is "in progress" - without having to

paulb@145

941

wait for the computation or to allocate resources to monitor it. Moreover, in

paulb@145

942

some Web architectures, notably those employing the Common Gateway Interface

paulb@145

943

(CGI), it is necessary for a process handling an incoming request to terminate

paulb@145

944

before its output will be sent to clients. By using a

paulb@145

945

<code>BackgroundCallable</code>, a Web server process can initiate a

paulb@145

946

communication, and then subsequent server processes can be used to reconnect to

paulb@145

947

the background computation and to wait efficiently for results.</p>

<h2 id="Summary">Summary</h2>

<p>The following table indicates the features used in converting one

paulb@124

952

sequential example program to another parallel program:</p>

<table border="1" cellspacing="0" cellpadding="5">

paulb@124

955

  <thead>

paulb@124

956

    <tr>

paulb@124

957

      <th>Sequential Example</th>

paulb@124

958

      <th>Parallel Example</th>

paulb@124

959

      <th>Features Used</th>

paulb@124

960

    </tr>

paulb@124

961

  </thead>

paulb@124

962

  <tbody>

paulb@124

963

    <tr>

paulb@124

964

      <td>simple_map</td>

paulb@124

965

      <td>simple_pmap</td>

paulb@124

966

      <td>pmap</td>

paulb@124

967

    </tr>

paulb@124

968

    <tr>

paulb@124

969

      <td>simple1</td>

paulb@124

970

      <td>simple_managed_map</td>

paulb@124

971

      <td>MakeParallel, Map, manage</td>

paulb@124

972

    </tr>

paulb@124

973

    <tr>

paul@159

974

      <td rowspan="6">simple2</td>

paulb@124

975

      <td>simple_managed_queue</td>

paulb@124

976

      <td>MakeParallel, Queue, manage</td>

paulb@124

977

    </tr>

paulb@124

978

    <tr>

paul@159

979

      <td>simple_continuous_queue</td>

paul@159

980

      <td>Queue, manage (continuous)</td>

paul@159

981

    </tr>

paul@159

982

    <tr>

paulb@124

983

      <td>simple_managed</td>

paulb@124

984

      <td>MakeParallel, Exchange (subclass), manage, finish</td>

paulb@124

985

    </tr>

paulb@124

986

    <tr>

paulb@124

987

      <td>simple_start</td>

paulb@124

988

      <td>Channel, Exchange (subclass), start, finish</td>

paulb@124

989

    </tr>

paulb@124

990

    <tr>

paulb@145

991

      <td>simple_background_queue</td>

paulb@145

992

      <td>MakeParallel, BackgroundCallable, BackgroundQueue</td>

paulb@145

993

    </tr>

paulb@145

994

    <tr>

paulb@145

995

      <td>simple_persistent_queue</td>

paulb@145

996

      <td>MakeParallel, BackgroundCallable, PersistentQueue</td>

paulb@145

997

    </tr>

paulb@145

998

    <tr>

paulb@124

999

      <td>simple</td>

paulb@124

1000

      <td>simple_create_map</td>

paulb@124

1001

      <td>Channel, Map, create, exit</td>

paulb@124

1002

    </tr>

paulb@124

1003

  </tbody>

paulb@124

1004

</table>

</body>

paulb@124

1007

</html>

pprocess

Annotated docs/tutorial.html