1Slices(3) User Contributed Perl Documentation Slices(3)
2
3
4
6 PDL::Slices -- Indexing, slicing, and dicing
7
9 use PDL;
10 $x = ones(3,3);
11 $y = $x->slice('-1:0,(1)');
12 $c = $x->dummy(2);
13
15 This package provides many of the powerful PerlDL core index
16 manipulation routines. These routines mostly allow two-way data flow,
17 so you can modify your data in the most convenient representation. For
18 example, you can make a 1000x1000 unit matrix with
19
20 $x = zeroes(1000,1000);
21 $x->diagonal(0,1) ++;
22
23 which is quite efficient. See PDL::Indexing and PDL::Tips for more
24 examples.
25
26 Slicing is so central to the PDL language that a special compile-time
27 syntax has been introduced to handle it compactly; see PDL::NiceSlice
28 for details.
29
30 PDL indexing and slicing functions usually include two-way data flow,
31 so that you can separate the actions of reshaping your data structures
32 and modifying the data themselves. Two special methods, "copy" and
33 "sever", help you control the data flow connection between related
34 variables.
35
36 $y = $x->slice("1:3"); # Slice maintains a link between $x and $y.
37 $y += 5; # $x is changed!
38
39 If you want to force a physical copy and no data flow, you can copy or
40 sever the slice expression:
41
42 $y = $x->slice("1:3")->copy;
43 $y += 5; # $x is not changed.
44
45 $y = $x->slice("1:3")->sever;
46 $y += 5; # $x is not changed.
47
48 The difference between "sever" and "copy" is that sever acts on (and
49 returns) its argument, while copy produces a disconnected copy. If you
50 say
51
52 $y = $x->slice("1:3");
53 $c = $y->sever;
54
55 then the variables $y and $c point to the same object but with "->copy"
56 they would not.
57
59 index
60 Signature: (a(n); indx ind(); [oca] c())
61
62 "index", "index1d", and "index2d" provide rudimentary index
63 indirection.
64
65 $c = index($source,$ind);
66 $c = index1d($source,$ind);
67 $c = index2d($source2,$ind1,$ind2);
68
69 use the $ind variables as indices to look up values in $source. The
70 three routines broadcast slightly differently.
71
72 • "index" uses direct broadcasting for 1-D indexing across the 0 dim
73 of $source. It can broadcast over source broadcast dims or index
74 broadcast dims, but not (easily) both: If $source has more than 1
75 dimension and $ind has more than 0 dimensions, they must agree in a
76 broadcasting sense.
77
78 • "index1d" uses a single active dim in $ind to produce a list of
79 indexed values in the 0 dim of the output - it is useful for
80 collapsing $source by indexing with a single row of values along
81 $source's 0 dimension. The output has the same number of dims as
82 $source. The 0 dim of the output has size 1 if $ind is a scalar,
83 and the same size as the 0 dim of $ind if it is not. If $ind and
84 $source both have more than 1 dim, then all dims higher than 0 must
85 agree in a broadcasting sense.
86
87 • "index2d" works like "index" but uses separate ndarrays for X and Y
88 coordinates. For more general N-dimensional indexing, see the
89 PDL::NiceSlice syntax or PDL::Slices (in particular "slice",
90 "indexND", and "range").
91
92 These functions are two-way, i.e. after
93
94 $c = $x->index(pdl[0,5,8]);
95 $c .= pdl [0,2,4];
96
97 the changes in $c will flow back to $x.
98
99 "index" provids simple broadcasting: multiple-dimensioned arrays are
100 treated as collections of 1-D arrays, so that
101
102 $x = xvals(10,10)+10*yvals(10,10);
103 $y = $x->index(3);
104 $c = $x->index(9-xvals(10));
105
106 puts a single column from $x into $y, and puts a single element from
107 each column of $x into $c. If you want to extract multiple columns
108 from an array in one operation, see "dice" or "indexND".
109
110 index barfs if any of the index values are bad.
111
112 index1d
113 Signature: (a(n); indx ind(m); [oca] c(m))
114
115 "index", "index1d", and "index2d" provide rudimentary index
116 indirection.
117
118 $c = index($source,$ind);
119 $c = index1d($source,$ind);
120 $c = index2d($source2,$ind1,$ind2);
121
122 use the $ind variables as indices to look up values in $source. The
123 three routines broadcast slightly differently.
124
125 • "index" uses direct broadcasting for 1-D indexing across the 0 dim
126 of $source. It can broadcast over source broadcast dims or index
127 broadcast dims, but not (easily) both: If $source has more than 1
128 dimension and $ind has more than 0 dimensions, they must agree in a
129 broadcasting sense.
130
131 • "index1d" uses a single active dim in $ind to produce a list of
132 indexed values in the 0 dim of the output - it is useful for
133 collapsing $source by indexing with a single row of values along
134 $source's 0 dimension. The output has the same number of dims as
135 $source. The 0 dim of the output has size 1 if $ind is a scalar,
136 and the same size as the 0 dim of $ind if it is not. If $ind and
137 $source both have more than 1 dim, then all dims higher than 0 must
138 agree in a broadcasting sense.
139
140 • "index2d" works like "index" but uses separate ndarrays for X and Y
141 coordinates. For more general N-dimensional indexing, see the
142 PDL::NiceSlice syntax or PDL::Slices (in particular "slice",
143 "indexND", and "range").
144
145 These functions are two-way, i.e. after
146
147 $c = $x->index(pdl[0,5,8]);
148 $c .= pdl [0,2,4];
149
150 the changes in $c will flow back to $x.
151
152 "index" provids simple broadcasting: multiple-dimensioned arrays are
153 treated as collections of 1-D arrays, so that
154
155 $x = xvals(10,10)+10*yvals(10,10);
156 $y = $x->index(3);
157 $c = $x->index(9-xvals(10));
158
159 puts a single column from $x into $y, and puts a single element from
160 each column of $x into $c. If you want to extract multiple columns
161 from an array in one operation, see "dice" or "indexND".
162
163 index1d propagates BAD index elements to the output variable.
164
165 index2d
166 Signature: (a(na,nb); indx inda(); indx indb(); [oca] c())
167
168 "index", "index1d", and "index2d" provide rudimentary index
169 indirection.
170
171 $c = index($source,$ind);
172 $c = index1d($source,$ind);
173 $c = index2d($source2,$ind1,$ind2);
174
175 use the $ind variables as indices to look up values in $source. The
176 three routines broadcast slightly differently.
177
178 • "index" uses direct broadcasting for 1-D indexing across the 0 dim
179 of $source. It can broadcast over source broadcast dims or index
180 broadcast dims, but not (easily) both: If $source has more than 1
181 dimension and $ind has more than 0 dimensions, they must agree in a
182 broadcasting sense.
183
184 • "index1d" uses a single active dim in $ind to produce a list of
185 indexed values in the 0 dim of the output - it is useful for
186 collapsing $source by indexing with a single row of values along
187 $source's 0 dimension. The output has the same number of dims as
188 $source. The 0 dim of the output has size 1 if $ind is a scalar,
189 and the same size as the 0 dim of $ind if it is not. If $ind and
190 $source both have more than 1 dim, then all dims higher than 0 must
191 agree in a broadcasting sense.
192
193 • "index2d" works like "index" but uses separate ndarrays for X and Y
194 coordinates. For more general N-dimensional indexing, see the
195 PDL::NiceSlice syntax or PDL::Slices (in particular "slice",
196 "indexND", and "range").
197
198 These functions are two-way, i.e. after
199
200 $c = $x->index(pdl[0,5,8]);
201 $c .= pdl [0,2,4];
202
203 the changes in $c will flow back to $x.
204
205 "index" provids simple broadcasting: multiple-dimensioned arrays are
206 treated as collections of 1-D arrays, so that
207
208 $x = xvals(10,10)+10*yvals(10,10);
209 $y = $x->index(3);
210 $c = $x->index(9-xvals(10));
211
212 puts a single column from $x into $y, and puts a single element from
213 each column of $x into $c. If you want to extract multiple columns
214 from an array in one operation, see "dice" or "indexND".
215
216 index2d barfs if either of the index values are bad.
217
218 indexNDb
219 Backwards-compatibility alias for indexND
220
221 indexND
222 Find selected elements in an N-D ndarray, with optional boundary handling
223
224 $out = $source->indexND( $index, [$method] )
225
226 $source = 10*xvals(10,10) + yvals(10,10);
227 $index = pdl([[2,3],[4,5]],[[6,7],[8,9]]);
228 print $source->indexND( $index );
229
230 [
231 [23 45]
232 [67 89]
233 ]
234
235 IndexND collapses $index by lookup into $source. The 0th dimension of
236 $index is treated as coordinates in $source, and the return value has
237 the same dimensions as the rest of $index. The returned elements are
238 looked up from $source. Dataflow works -- propagated assignment flows
239 back into $source.
240
241 IndexND and IndexNDb were originally separate routines but they are
242 both now implemented as a call to "range", and have identical syntax to
243 one another.
244
245 SEE ALSO:
246
247 "whichND" in PDL::Primitive returns N-D indices into a multidimensional
248 PDL, suitable for feeding to this.
249
250 rangeb
251 Signature: (P(); C(); pdl *ind_pdl; SV *size_sv; SV *boundary_sv)
252
253 Engine for "range"
254
255 Same calling convention as "range", but you must supply all parameters.
256 "rangeb" is marginally faster as it makes a direct PP call, avoiding
257 the perl argument-parsing step.
258
259 range
260 Extract selected chunks from a source ndarray, with boundary conditions
261
262 $out = $source->range($index,[$size,[$boundary]])
263
264 Returns elements or rectangular slices of the original ndarray, indexed
265 by the $index ndarray. $source is an N-dimensional ndarray, and $index
266 is an ndarray whose first dimension has size up to N. Each row of
267 $index is treated as coordinates of a single value or chunk from
268 $source, specifying the location(s) to extract.
269
270 If you specify a single index location, then range is essentially an
271 expensive slice, with controllable boundary conditions.
272
273 INPUTS
274
275 $index and $size can be ndarrays or array refs such as you would feed
276 to zeroes and its ilk. If $index's 0th dimension has size higher than
277 the number of dimensions in $source, then $source is treated as though
278 it had trivial dummy dimensions of size 1, up to the required size to
279 be indexed by $index -- so if your source array is 1-D and your index
280 array is a list of 3-vectors, you get two dummy dimensions of size 1 on
281 the end of your source array.
282
283 You can extract single elements or N-D rectangular ranges from $source,
284 by setting $size. If $size is undef or zero, then you get a single
285 sample for each row of $index. This behavior is similar to "indexNDb",
286 which is in fact implemented as a call to "range".
287
288 If $size is positive then you get a range of values from $source at
289 each location, and the output has extra dimensions allocated for them.
290 $size can be a scalar, in which case it applies to all dimensions, or
291 an N-vector, in which case each element is applied independently to the
292 corresponding dimension in $source. See below for details.
293
294 $boundary is a number, string, or list ref indicating the type of
295 boundary conditions to use when ranges reach the edge of $source. If
296 you specify no boundary conditions the default is to forbid boundary
297 violations on all axes. If you specify exactly one boundary condition,
298 it applies to all axes. If you specify more (as elements of a list
299 ref, or as a packed string, see below), then they apply to dimensions
300 in the order in which they appear, and the last one applies to all
301 subsequent dimensions. (This is less difficult than it sounds; see the
302 examples below).
303
304 0 (synonyms: 'f','forbid') (default)
305 Ranges are not allowed to cross the boundary of the original PDL.
306 Disallowed ranges throw an error. The errors are thrown at
307 evaluation time, not at the time of the range call (this is the same
308 behavior as "slice").
309
310 1 (synonyms: 't','truncate')
311 Values outside the original ndarray get BAD if you've got bad value
312 support compiled into your PDL and set the badflag for the source
313 PDL; or 0 if you haven't (you must set the badflag if you want BADs
314 for out of bound values, otherwise you get 0). Reverse dataflow
315 works OK for the portion of the child that is in-bounds. The out-
316 of-bounds part of the child is reset to (BAD|0) during each dataflow
317 operation, but execution continues.
318
319 2 (synonyms: 'e','x','extend')
320 Values that would be outside the original ndarray point instead to
321 the nearest allowed value within the ndarray. See the CAVEAT below
322 on mappings that are not single valued.
323
324 3 (synonyms: 'p','periodic')
325 Periodic boundary conditions apply: the numbers in $index are
326 applied, strict-modulo the corresponding dimensions of $source.
327 This is equivalent to duplicating the $source ndarray throughout N-D
328 space. See the CAVEAT below about mappings that are not single
329 valued.
330
331 4 (synonyms: 'm','mirror')
332 Mirror-reflection periodic boundary conditions apply. See the
333 CAVEAT below about mappings that are not single valued.
334
335 The boundary condition identifiers all begin with unique characters, so
336 you can feed in multiple boundary conditions as either a list ref or a
337 packed string. (The packed string is marginally faster to run). For
338 example, the four expressions [0,1], ['forbid','truncate'], ['f','t'],
339 and 'ft' all specify that violating the boundary in the 0th dimension
340 throws an error, and all other dimensions get truncated.
341
342 If you feed in a single string, it is interpreted as a packed boundary
343 array if all of its characters are valid boundary specifiers (e.g.
344 'pet'), but as a single word-style specifier if they are not (e.g.
345 'forbid').
346
347 OUTPUT
348
349 The output broadcasts over both $index and $source. Because implicit
350 broadcasting can happen in a couple of ways, a little thought is
351 needed. The returned dimension list is stacked up like this:
352
353 (index broadcast dims), (index dims (size)), (source broadcast dims)
354
355 The first few dims of the output correspond to the extra dims of $index
356 (beyond the 0 dim). They allow you to pick out individual ranges from a
357 large, broadcasted collection.
358
359 The middle few dims of the output correspond to the size dims specified
360 in $size, and contain the range of values that is extracted at each
361 location in $source. Every nonzero element of $size is copied to the
362 dimension list here, so that if you feed in (for example) "$size =
363 [2,0,1]" you get an index dim list of "(2,1)".
364
365 The last few dims of the output correspond to extra dims of $source
366 beyond the number of dims indexed by $index. These dims act like
367 ordinary broadcast dims, because adding more dims to $source just tacks
368 extra dims on the end of the output. Each source broadcast dim ranges
369 over the entire corresponding dim of $source.
370
371 Dataflow: Dataflow is bidirectional.
372
373 Examples: Here are basic examples of "range" operation, showing how to
374 get ranges out of a small matrix. The first few examples show
375 extraction and selection of individual chunks. The last example shows
376 how to mark loci in the original matrix (using dataflow).
377
378 pdl> $src = 10*xvals(10,5)+yvals(10,5)
379 pdl> print $src->range([2,3]) # Cut out a single element
380 23
381 pdl> print $src->range([2,3],1) # Cut out a single 1x1 block
382 [
383 [23]
384 ]
385 pdl> print $src->range([2,3], [2,1]) # Cut a 2x1 chunk
386 [
387 [23 33]
388 ]
389 pdl> print $src->range([[2,3]],[2,1]) # Trivial list of 1 chunk
390 [
391 [
392 [23]
393 [33]
394 ]
395 ]
396 pdl> print $src->range([[2,3],[0,1]], [2,1]) # two 2x1 chunks
397 [
398 [
399 [23 1]
400 [33 11]
401 ]
402 ]
403 pdl> # A 2x2 collection of 2x1 chunks
404 pdl> print $src->range([[[1,1],[2,2]],[[2,3],[0,1]]],[2,1])
405 [
406 [
407 [
408 [11 22]
409 [23 1]
410 ]
411 [
412 [21 32]
413 [33 11]
414 ]
415 ]
416 ]
417 pdl> $src = xvals(5,3)*10+yvals(5,3)
418 pdl> print $src->range(3,1) # Broadcast over y dimension in $src
419 [
420 [30]
421 [31]
422 [32]
423 ]
424
425 pdl> $src = zeroes(5,4);
426 pdl> $src->range(pdl([2,3],[0,1]),pdl(2,1)) .= xvals(2,2,1) + 1
427 pdl> print $src
428 [
429 [0 0 0 0 0]
430 [2 2 0 0 0]
431 [0 0 0 0 0]
432 [0 0 1 1 0]
433 ]
434
435 CAVEAT: It's quite possible to select multiple ranges that intersect.
436 In that case, modifying the ranges doesn't have a guaranteed result in
437 the original PDL -- the result is an arbitrary choice among the valid
438 values. For some things that's OK; but for others it's not. In
439 particular, this doesn't work:
440
441 pdl> $photon_list = PDL::RandVar->new->sample(500)->reshape(2,250)*10
442 pdl> $histogram = zeroes(10,10)
443 pdl> $histogram->range($photon_list,1)++; #not what you wanted
444
445 The reason is that if two photons land in the same bin, then that bin
446 doesn't get incremented twice. (That may get fixed in a later
447 version...)
448
449 PERMISSIVE RANGING: If $index has too many dimensions compared to
450 $source, then $source is treated as though it had dummy dimensions of
451 size 1, up to the required number of dimensions. These virtual dummy
452 dimensions have the usual boundary conditions applied to them.
453
454 If the 0 dimension of $index is ludicrously large (if its size is more
455 than 5 greater than the number of dims in the source PDL) then range
456 will insist that you specify a size in every dimension, to make sure
457 that you know what you're doing. That catches a common error with
458 range usage: confusing the initial dim (which is usually small) with
459 another index dim (perhaps of size 1000).
460
461 If the index variable is Empty, then range() always returns the Empty
462 PDL. If the index variable is not Empty, indexing it always yields a
463 boundary violation. All non-barfing conditions are treated as
464 truncation, since there are no actual data to return.
465
466 EFFICIENCY: Because "range" isn't an affine transformation (it involves
467 lookup into a list of N-D indices), it is somewhat memory-inefficient
468 for long lists of ranges, and keeping dataflow open is much slower than
469 for affine transformations (which don't have to copy data around).
470
471 Doing operations on small subfields of a large range is inefficient
472 because the engine must flow the entire range back into the original
473 PDL with every atomic perl operation, even if you only touch a single
474 element. One way to speed up such code is to sever your range, so that
475 PDL doesn't have to copy the data with each operation, then copy the
476 elements explicitly at the end of your loop. Here's an example that
477 labels each region in a range sequentially, using many small operations
478 rather than a single xvals assignment:
479
480 ### How to make a collection of small ops run fast with range...
481 $x = $data->range($index, $sizes, $bound)->sever;
482 $aa = $data->range($index, $sizes, $bound);
483 $x($_ - 1) .= $_ for 1..$x->nelem; # Lots of little ops
484 $aa .= $x;
485
486 "range" is a perl front-end to a PP function, "rangeb". Calling
487 "rangeb" is marginally faster but requires that you include all
488 arguments.
489
490 DEVEL NOTES
491
492 * index broadcast dimensions are effectively clumped internally. This
493 makes it easier to loop over the index array but a little more brain-
494 bending to tease out the algorithm.
495
496 rangeb processes bad values. It will set the bad-value flag of all
497 output ndarrays if the flag is set for any of the input ndarrays.
498
499 rld
500 Signature: (indx a(n); b(n); [o]c(m))
501
502 Run-length decode a vector
503
504 Given a vector $x of the numbers of instances of values $y, run-length
505 decode to $c.
506
507 rld($x,$y,$c=null);
508
509 rld does not process bad values. It will set the bad-value flag of all
510 output ndarrays if the flag is set for any of the input ndarrays.
511
512 rle
513 Signature: (c(n); indx [o]a(m); [o]b(m))
514
515 Run-length encode a vector
516
517 Given vector $c, generate a vector $x with the number of each element,
518 and a vector $y of the unique values. New in PDL 2.017, only the
519 elements up to the first instance of 0 in $x are returned, which makes
520 the common use case of a 1-dimensional $c simpler. For broadcast
521 operation, $x and $y will be large enough to hold the largest row of
522 $y, and only the elements up to the first instance of 0 in each row of
523 $x should be considered.
524
525 $c = floor(4*random(10));
526 rle($c,$x=null,$y=null);
527 #or
528 ($x,$y) = rle($c);
529
530 #for $c of shape [10, 4]:
531 $c = floor(4*random(10,4));
532 ($x,$y) = rle($c);
533
534 #to see the results of each row one at a time:
535 foreach (0..$c->dim(1)-1){
536 my ($as,$bs) = ($x(:,($_)),$y(:,($_)));
537 my ($ta,$tb) = where($as,$bs,$as!=0); #only the non-zero elements of $x
538 print $c(:,($_)) . " rle==> " , ($ta,$tb) , "\trld==> " . rld($ta,$tb) . "\n";
539 }
540
541 # the inverse of (chance of all 6 3d6 rolls being >= each possible sum)
542 ($nrolls, $ndice, $dmax) = (6, 3, 6);
543 ($x, $x1) = (allaxisvals(($dmax) x $ndice)+1)->sumover->flat->qsort->rle;
544 $y = $x->cumusumover;
545 $yprob1x = $y->slice('-1:0')->double / $y->slice('(-1)');
546 $z = cat($x1, 1 / $yprob1x**$nrolls)->transpose;
547
548 rle does not process bad values. It will set the bad-value flag of all
549 output ndarrays if the flag is set for any of the input ndarrays.
550
551 rlevec
552 Signature: (c(M,N); indx [o]a(N); [o]b(M,N))
553
554 Run-length encode a set of vectors.
555
556 Higher-order rle(), for use with qsortvec().
557
558 Given set of vectors $c, generate a vector $a with the number of
559 occurrences of each element (where an "element" is a vector of length
560 $M occurring in $c), and a set of vectors $b containing the unique
561 values. As for rle(), only the elements up to the first instance of 0
562 in $a should be considered.
563
564 Can be used together with clump() to run-length encode "values" of
565 arbitrary dimensions. Can be used together with rotate(), cat(),
566 append(), and qsortvec() to count N-grams over a 1d PDL.
567
568 See also: "rle", "qsortvec" in PDL::Ufunc, "uniqvec" in PDL::Primitive
569 Contributed by Bryan Jurish <moocow@cpan.org>.
570
571 rlevec does not process bad values. It will set the bad-value flag of
572 all output ndarrays if the flag is set for any of the input ndarrays.
573
574 rldvec
575 Signature: (indx a(N); b(M,N); [o]c(M,N))
576
577 Run-length decode a set of vectors, akin to a higher-order rld().
578
579 Given a vector $a() of the number of occurrences of each row, and a set
580 $c() of row-vectors each of length $M, run-length decode to $c().
581
582 Can be used together with clump() to run-length decode "values" of
583 arbitrary dimensions.
584
585 See also: "rld". Contributed by Bryan Jurish <moocow@cpan.org>.
586
587 rldvec does not process bad values. It will set the bad-value flag of
588 all output ndarrays if the flag is set for any of the input ndarrays.
589
590 rleseq
591 Signature: (c(N); indx [o]a(N); [o]b(N))
592
593 Run-length encode a vector of subsequences.
594
595 Given a vector of $c() of concatenated variable-length, variable-offset
596 subsequences, generate a vector $a containing the length of each
597 subsequence and a vector $b containing the subsequence offsets. As for
598 rle(), only the elements up to the first instance of 0 in $a should be
599 considered.
600
601 See also "rle". Contributed by Bryan Jurish <moocow@cpan.org>.
602
603 rleseq does not process bad values. It will set the bad-value flag of
604 all output ndarrays if the flag is set for any of the input ndarrays.
605
606 rldseq
607 Signature: (indx a(N); b(N); [o]c(M))
608
609 Run-length decode a subsequence vector.
610
611 Given a vector $a() of sequence lengths and a vector $b() of
612 corresponding offsets, decode concatenation of subsequences to $c(), as
613 for:
614
615 $c = null;
616 $c = $c->append($b($_)+sequence($a->type,$a($_))) foreach (0..($N-1));
617
618 See also: "rld". Contributed by Bryan Jurish <moocow@cpan.org>.
619
620 rldseq does not process bad values. It will set the bad-value flag of
621 all output ndarrays if the flag is set for any of the input ndarrays.
622
623 rleND
624 Signature: (data(@vdims,N); int [o]counts(N); [o]elts(@vdims,N))
625
626 Run-length encode a set of (sorted) n-dimensional values.
627
628 Generalization of rle() and vv_rlevec(): given set of values $data,
629 generate a vector $counts with the number of occurrences of each
630 element (where an "element" is a matrix of dimensions @vdims occurring
631 as a sequential run over the final dimension in $data), and a set of
632 vectors $elts containing the elements which begin a run. Really just a
633 wrapper for clump() and rlevec().
634
635 See also: "rle", "rlevec". Contributed by Bryan Jurish
636 <moocow@cpan.org>.
637
638 rldND
639 Signature: (int counts(N); elts(@vdims,N); [o]data(@vdims,N);)
640
641 Run-length decode a set of (sorted) n-dimensional values.
642
643 Generalization of rld() and rldvec(): given a vector $counts() of the
644 number of occurrences of each @vdims-dimensioned element, and a set
645 $elts() of @vdims-dimensioned elements, run-length decode to $data().
646
647 Really just a wrapper for clump() and rldvec().
648
649 See also: "rld", "rldvec". Contributed by Bryan Jurish
650 <moocow@cpan.org>.
651
652 xchg
653 Signature: (P(); C(); PDL_Indx n1; PDL_Indx n2)
654
655 exchange two dimensions
656
657 Negative dimension indices count from the end.
658
659 The command
660
661 $y = $x->xchg(2,3);
662
663 creates $y to be like $x except that the dimensions 2 and 3 are
664 exchanged with each other i.e.
665
666 $y->at(5,3,2,8) == $x->at(5,3,8,2)
667
668 xchg does not process bad values. It will set the bad-value flag of
669 all output ndarrays if the flag is set for any of the input ndarrays.
670
671 reorder
672 Re-orders the dimensions of a PDL based on the supplied list.
673
674 Similar to the "xchg" method, this method re-orders the dimensions of a
675 PDL. While the "xchg" method swaps the position of two dimensions, the
676 reorder method can change the positions of many dimensions at once.
677
678 # Completely reverse the dimension order of a 6-Dim array.
679 $reOrderedPDL = $pdl->reorder(5,4,3,2,1,0);
680
681 The argument to reorder is an array representing where the current
682 dimensions should go in the new array. In the above usage, the argument
683 to reorder "(5,4,3,2,1,0)" indicates that the old dimensions ($pdl's
684 dims) should be re-arranged to make the new pdl ($reOrderPDL) according
685 to the following:
686
687 Old Position New Position
688 ------------ ------------
689 5 0
690 4 1
691 3 2
692 2 3
693 1 4
694 0 5
695
696 You do not need to specify all dimensions, only a complete set starting
697 at position 0. (Extra dimensions are left where they are). This
698 means, for example, that you can reorder() the X and Y dimensions of an
699 image, and not care whether it is an RGB image with a third dimension
700 running across color plane.
701
702 Example:
703
704 pdl> $x = sequence(5,3,2); # Create a 3-d Array
705 pdl> p $x
706 [
707 [
708 [ 0 1 2 3 4]
709 [ 5 6 7 8 9]
710 [10 11 12 13 14]
711 ]
712 [
713 [15 16 17 18 19]
714 [20 21 22 23 24]
715 [25 26 27 28 29]
716 ]
717 ]
718 pdl> p $x->reorder(2,1,0); # Reverse the order of the 3-D PDL
719 [
720 [
721 [ 0 15]
722 [ 5 20]
723 [10 25]
724 ]
725 [
726 [ 1 16]
727 [ 6 21]
728 [11 26]
729 ]
730 [
731 [ 2 17]
732 [ 7 22]
733 [12 27]
734 ]
735 [
736 [ 3 18]
737 [ 8 23]
738 [13 28]
739 ]
740 [
741 [ 4 19]
742 [ 9 24]
743 [14 29]
744 ]
745 ]
746
747 The above is a simple example that could be duplicated by calling
748 "$x->xchg(0,2)", but it demonstrates the basic functionality of
749 reorder.
750
751 As this is an index function, any modifications to the result PDL will
752 change the parent.
753
754 mv
755 Signature: (P(); C(); PDL_Indx n1; PDL_Indx n2)
756
757 move a dimension to another position
758
759 The command
760
761 $y = $x->mv(4,1);
762
763 creates $y to be like $x except that the dimension 4 is moved to the
764 place 1, so:
765
766 $y->at(1,2,3,4,5,6) == $x->at(1,5,2,3,4,6);
767
768 The other dimensions are moved accordingly. Negative dimension indices
769 count from the end.
770
771 mv does not process bad values. It will set the bad-value flag of all
772 output ndarrays if the flag is set for any of the input ndarrays.
773
774 using
775 Returns array of column numbers requested
776
777 line $pdl->using(1,2);
778
779 Plot, as a line, column 1 of $pdl vs. column 2
780
781 pdl> $pdl = rcols("file");
782 pdl> line $pdl->using(1,2);
783
784 diagonal
785 Signature: (P(); C(); PDL_Indx whichdims[])
786
787 Returns the multidimensional diagonal over the specified dimensions.
788
789 The diagonal is placed at the first (by number) dimension that is
790 diagonalized. The other diagonalized dimensions are removed. So if $x
791 has dimensions "(5,3,5,4,6,5)" then after
792
793 $d = $x->diagonal(dim1, dim2,...)
794
795 $y = $x->diagonal(0,2,5);
796
797 the ndarray $y has dimensions "(5,3,4,6)" and "$y->at(2,1,0,1)" refers
798 to "$x->at(2,1,2,0,1,2)".
799
800 NOTE: diagonal doesn't handle broadcastids correctly. XXX FIX
801
802 pdl> $x = zeroes(3,3,3);
803 pdl> ($y = $x->diagonal(0,1))++;
804 pdl> p $x
805 [
806 [
807 [1 0 0]
808 [0 1 0]
809 [0 0 1]
810 ]
811 [
812 [1 0 0]
813 [0 1 0]
814 [0 0 1]
815 ]
816 [
817 [1 0 0]
818 [0 1 0]
819 [0 0 1]
820 ]
821 ]
822
823 diagonal does not process bad values. It will set the bad-value flag
824 of all output ndarrays if the flag is set for any of the input
825 ndarrays.
826
827 lags
828 Signature: (P(); C(); PDL_Indx nthdim;PDL_Indx step;PDL_Indx n)
829
830 Returns an ndarray of lags to parent.
831
832 Usage:
833
834 $lags = $x->lags($nthdim,$step,$nlags);
835
836 I.e. if $x contains
837
838 [0,1,2,3,4,5,6,7]
839
840 then
841
842 $y = $x->lags(0,2,2);
843
844 is a (5,2) matrix
845
846 [2,3,4,5,6,7]
847 [0,1,2,3,4,5]
848
849 This order of returned indices is kept because the function is called
850 "lags" i.e. the nth lag is n steps behind the original.
851
852 $step and $nlags must be positive. $nthdim can be negative and will
853 then be counted from the last dim backwards in the usual way (-1 = last
854 dim).
855
856 lags does not process bad values. It will set the bad-value flag of
857 all output ndarrays if the flag is set for any of the input ndarrays.
858
859 splitdim
860 Signature: (P(); C(); PDL_Indx nthdim;PDL_Indx nsp)
861
862 Splits a dimension in the parent ndarray (opposite of clump). As of
863 2.076, throws exception if non-divisible "nsp" given, and can give
864 negative "nthdim" which then counts backwards.
865
866 After
867
868 $y = $x->splitdim(2,3);
869
870 the expression
871
872 $y->at(6,4,m,n,3,6) == $x->at(6,4,m+3*n)
873
874 is always true ("m" has to be less than 3).
875
876 splitdim does not process bad values. It will set the bad-value flag
877 of all output ndarrays if the flag is set for any of the input
878 ndarrays.
879
880 rotate
881 Signature: (x(n); indx shift(); [oca]y(n))
882
883 Shift vector elements along with wrap. Flows data back&forth.
884
885 rotate does not process bad values. It will set the bad-value flag of
886 all output ndarrays if the flag is set for any of the input ndarrays.
887
888 broadcastI
889 Signature: (P(); C(); PDL_Indx id; PDL_Indx whichdims[])
890
891 internal
892
893 Put some dimensions to a broadcastid.
894
895 $y = $x->broadcastI(0,1,5); # broadcast over dims 1,5 in id 1
896
897 broadcastI does not process bad values. It will set the bad-value flag
898 of all output ndarrays if the flag is set for any of the input
899 ndarrays.
900
901 unbroadcast
902 Signature: (P(); C(); PDL_Indx atind)
903
904 All broadcasted dimensions are made real again.
905
906 See [TBD Doc] for details and examples.
907
908 unbroadcast does not process bad values. It will set the bad-value
909 flag of all output ndarrays if the flag is set for any of the input
910 ndarrays.
911
912 dice
913 Dice rows/columns/planes out of a PDL using indexes for each dimension.
914
915 This function can be used to extract irregular subsets along many
916 dimension of a PDL, e.g. only certain rows in an image, or planes in a
917 cube. This can of course be done with the usual dimension tricks but
918 this saves having to figure it out each time!
919
920 This method is similar in functionality to the "slice" method, but
921 "slice" requires that contiguous ranges or ranges with constant offset
922 be extracted. ( i.e. "slice" requires ranges of the form "1,2,3,4,5" or
923 "2,4,6,8,10"). Because of this restriction, "slice" is more memory
924 efficient and slightly faster than dice
925
926 $slice = $data->dice([0,2,6],[2,1,6]); # Dicing a 2-D array
927
928 The arguments to dice are arrays (or 1D PDLs) for each dimension in the
929 PDL. These arrays are used as indexes to which rows/columns/cubes,etc
930 to dice-out (or extract) from the $data PDL.
931
932 Use "X" to select all indices along a given dimension (compare also
933 mslice). As usual (in slicing methods) trailing dimensions can be
934 omitted implying "X"'es for those.
935
936 pdl> $x = sequence(10,4)
937 pdl> p $x
938 [
939 [ 0 1 2 3 4 5 6 7 8 9]
940 [10 11 12 13 14 15 16 17 18 19]
941 [20 21 22 23 24 25 26 27 28 29]
942 [30 31 32 33 34 35 36 37 38 39]
943 ]
944 pdl> p $x->dice([1,2],[0,3]) # Select columns 1,2 and rows 0,3
945 [
946 [ 1 2]
947 [31 32]
948 ]
949 pdl> p $x->dice(X,[0,3])
950 [
951 [ 0 1 2 3 4 5 6 7 8 9]
952 [30 31 32 33 34 35 36 37 38 39]
953 ]
954 pdl> p $x->dice([0,2,5])
955 [
956 [ 0 2 5]
957 [10 12 15]
958 [20 22 25]
959 [30 32 35]
960 ]
961
962 As this is an index function, any modifications to the slice will
963 change the parent (use the ".=" operator).
964
965 dice_axis
966 Dice rows/columns/planes from a single PDL axis (dimension) using index
967 along a specified axis
968
969 This function can be used to extract irregular subsets along any
970 dimension, e.g. only certain rows in an image, or planes in a cube.
971 This can of course be done with the usual dimension tricks but this
972 saves having to figure it out each time!
973
974 $slice = $data->dice_axis($axis,$index);
975
976 pdl> $x = sequence(10,4)
977 pdl> $idx = pdl(1,2)
978 pdl> p $x->dice_axis(0,$idx) # Select columns
979 [
980 [ 1 2]
981 [11 12]
982 [21 22]
983 [31 32]
984 ]
985 pdl> $t = $x->dice_axis(1,$idx) # Select rows
986 pdl> $t.=0
987 pdl> p $x
988 [
989 [ 0 1 2 3 4 5 6 7 8 9]
990 [ 0 0 0 0 0 0 0 0 0 0]
991 [ 0 0 0 0 0 0 0 0 0 0]
992 [30 31 32 33 34 35 36 37 38 39]
993 ]
994
995 The trick to using this is that the index selects elements along the
996 dimensions specified, so if you have a 2D image "axis=0" will select
997 certain "X" values - i.e. extract columns
998
999 As this is an index function, any modifications to the slice will
1000 change the parent.
1001
1002 slice
1003 Signature: (P(); C(); pdl_slice_args *arglist)
1004
1005 $slice = $data->slice([2,3],'x',[2,2,0],"-1:1:-1", "*3");
1006
1007 Extract rectangular slices of an ndarray, from a string specifier, an
1008 array ref specifier, or a combination.
1009
1010 "slice" is the main method for extracting regions of PDLs and
1011 manipulating their dimensionality. You can call it directly or via the
1012 NiceSlice source prefilter that extends Perl syntax to include array
1013 slicing.
1014
1015 "slice" can extract regions along each dimension of a source PDL,
1016 subsample or reverse those regions, dice each dimension by selecting a
1017 list of locations along it, or basic PDL indexing routine. The
1018 selected subfield remains connected to the original PDL via dataflow.
1019 In most cases this neither allocates more memory nor slows down
1020 subsequent operations on either of the two connected PDLs.
1021
1022 You pass in a list of arguments. Each term in the list controls the
1023 disposition of one axis of the source PDL and/or returned PDL. Each
1024 term can be a string-format cut specifier, a list ref that gives the
1025 same information without recourse to string manipulation, or a PDL with
1026 up to 1 dimension giving indices along that axis that should be
1027 selected.
1028
1029 If you want to pass in a single string specifier for the entire
1030 operation, you can pass in a comma-delimited list as the first
1031 argument. "slice" detects this condition and splits the string into a
1032 regular argument list. This calling style is fully backwards
1033 compatible with "slice" calls from before PDL 2.006.
1034
1035 STRING SYNTAX
1036
1037 If a particular argument to "slice" is a string, it is parsed as a
1038 selection, an affine slice, or a dummy dimension depending on the form.
1039 Leading or trailing whitespace in any part of each specifier is ignored
1040 (though it is not ignored within numbers).
1041
1042 '', ":", or "X" -- keep
1043 The empty string, ":", or "X" cause the entire corresponding
1044 dimension to be kept unchanged.
1045
1046 "<n>" -- selection
1047 A single number alone causes a single index to be selected from the
1048 corresponding dimension. The dimension is kept (and reduced to size
1049 1) in the output.
1050
1051 "(<n>)" -- selection and collapse
1052 A single number in parenthesis causes a single index to be selected
1053 from the corresponding dimension. The dimension is discarded
1054 (completely eliminated) in the output.
1055
1056 "<n>:<m>" -- select an inclusive range
1057 Two numbers separated by a colon selects a range of values from the
1058 corresponding axis, e.g. "3:4" selects elements 3 and 4 along the
1059 corresponding axis, and reduces that axis to size 2 in the output.
1060 Both numbers are regularized so that you can address the last
1061 element of the axis with an index of -1 . If, after
1062 regularization, the two numbers are the same, then exactly one
1063 element gets selected (just like the "<n>" case). If, after
1064 regulariation, the second number is lower than the first, then the
1065 resulting slice counts down rather than up -- e.g. "-1:0" will
1066 return the entire axis, in reversed order.
1067
1068 "<n>:<m>:<s>" -- select a range with explicit step
1069 If you include a third parameter, it is the stride of the extracted
1070 range. For example, "0:-1:2" will sample every other element across
1071 the complete dimension. Specifying a stride of 1 prevents
1072 autoreversal -- so to ensure that your slice is *always* forward you
1073 can specify, e.g., "2:$n:1". In that case, an "impossible" slice
1074 gets an Empty PDL (with 0 elements along the corresponding
1075 dimension), so you can generate an Empty PDL with a slice of the
1076 form "2:1:1".
1077
1078 "*<n>" -- insert a dummy dimension
1079 Dummy dimensions aren't present in the original source and are
1080 "mocked up" to match dimensional slots, by repeating the data in the
1081 original PDL some number of times. An asterisk followed by a number
1082 produces a dummy dimension in the output, for example *2 will
1083 generate a dimension of size 2 at the corresponding location in the
1084 output dim list. Omitting the number (and using just an asterisk)
1085 inserts a dummy dimension of size 1.
1086
1087 ARRAY REF SYNTAX
1088
1089 If you feed in an ARRAY ref as a slice term, then it can have 0-3
1090 elements. The first element is the start of the slice along the
1091 corresponding dim; the second is the end; and the third is the
1092 stepsize. Different combinations of inputs give the same flexibility
1093 as the string syntax.
1094
1095 "[]" - keep dim intact
1096 An empty ARRAY ref keeps the entire corresponding dim
1097
1098 "[ 'X' ]" - keep dim intact
1099 "[ '*',$n ]" - generate a dummy dim of size $n
1100 If $n is missing, you get a dummy dim of size 1.
1101
1102 "[ $dex, , 0 ]" - collapse and discard dim
1103 $dex must be a single value. It is used to index the source, and
1104 the corresponding dimension is discarded.
1105
1106 "[ $start, $end ]" - collect inclusive slice
1107 In the simple two-number case, you get a slice that runs up or down
1108 (as appropriate) to connect $start and $end.
1109
1110 "[ $start, $end, $inc ]" - collect inclusive slice
1111 The three-number case works exactly like the three-number string
1112 case above.
1113
1114 PDL args for dicing
1115
1116 If you pass in a 0- or 1-D PDL as a slicing argument, the corresponding
1117 dimension is "diced" -- you get one position along the corresponding
1118 dim, per element of the indexing PDL, e.g. "$x->slice( pdl(3,4,9))"
1119 gives you elements 3, 4, and 9 along the 0 dim of $x.
1120
1121 Because dicing is not an affine transformation, it is slower than
1122 direct slicing even though the syntax is convenient.
1123
1124 $x->slice('1:3'); # return the second to fourth elements of $x
1125 $x->slice('3:1'); # reverse the above
1126 $x->slice('-2:1'); # return last-but-one to second elements of $x
1127
1128 $x->slice([1,3]); # Same as above three calls, but using array ref syntax
1129 $x->slice([3,1]);
1130 $x->slice([-2,1]);
1131
1132 slice does not process bad values. It will set the bad-value flag of
1133 all output ndarrays if the flag is set for any of the input ndarrays.
1134
1136 For the moment, you can't slice one of the zero-length dims of an empty
1137 ndarray. It is not clear how to implement this in a way that makes
1138 sense.
1139
1140 Many types of index errors are reported far from the indexing operation
1141 that caused them. This is caused by the underlying architecture:
1142 slice() sets up a mapping between variables, but that mapping isn't
1143 tested for correctness until it is used (potentially much later).
1144
1146 Copyright (C) 1997 Tuomas J. Lukka. Contributions by Craig DeForest,
1147 deforest@boulder.swri.edu. Documentation contributions by David
1148 Mertens. All rights reserved. There is no warranty. You are allowed to
1149 redistribute this software / documentation under certain conditions.
1150 For details, see the file COPYING in the PDL distribution. If this file
1151 is separated from the PDL distribution, the copyright notice should be
1152 included in the file.
1153
1154
1155
1156perl v5.38.0 2023-07-21 Slices(3)