1LaTeXML::Package(3)   User Contributed Perl Documentation  LaTeXML::Package(3)
2
3
4

NAME

6       "LaTeXML::Package" - Support for package implementations and document
7       customization.
8

SYNOPSIS

10       This package defines and exports most of the procedures users will need
11       to customize or extend LaTeXML. The LaTeXML implementation of some
12       package might look something like the following, but see the installed
13       "LaTeXML/Package" directory for realistic examples.
14
15         package LaTeXML::Package::pool;  # to put new subs & variables in common pool
16         use LaTeXML::Package;            # to load these definitions
17         use strict;                      # good style
18         use warnings;
19         #
20         # Load "anotherpackage"
21         RequirePackage('anotherpackage');
22         #
23         # A simple macro, just like in TeX
24         DefMacro('\thesection', '\thechapter.\roman{section}');
25         #
26         # A constructor defines how a control sequence generates XML:
27         DefConstructor('\thanks{}', "<ltx:thanks>#1</ltx:thanks>");
28         #
29         # And a simple environment ...
30         DefEnvironment('{abstract}','<abstract>#body</abstract>');
31         #
32         # A math  symbol \Real to stand for the Reals:
33         DefMath('\Real', "\x{211D}", role=>'ID');
34         #
35         # Or a semantic floor:
36         DefMath('\floor{}','\left\lfloor#1\right\rfloor');
37         #
38         # More esoteric ...
39         # Use a RelaxNG schema
40         RelaxNGSchema("MySchema");
41         # Or use a special DocType if you have to:
42         # DocType("rootelement",
43         #         "-//Your Site//Your DocType",'your.dtd',
44         #          prefix=>"http://whatever/");
45         #
46         # Allow sometag elements to be automatically closed if needed
47         Tag('prefix:sometag', autoClose=>1);
48         #
49         # Don't forget this, so perl knows the package loaded.
50         1;
51

DESCRIPTION

53       This module provides a large set of utilities and declarations that are
54       useful for writing `bindings': LaTeXML-specific implementations of a
55       set of control sequences such as would be defined in a LaTeX style or
56       class file. They are also useful for controlling and customization of
57       LaTeXML's processing.  See the "See also" section, below, for
58       additional lower-level modules imported & re-exported.
59
60       To a limited extent (and currently only when explicitly enabled),
61       LaTeXML can process the raw TeX code found in style files.  However, to
62       preserve document structure and semantics, as well as for efficiency,
63       it is usually necessary to supply a LaTeXML-specific `binding' for
64       style and class files. For example, a binding "mypackage.sty.ltxml"
65       would encode LaTeXML-specific implementations of all the control
66       sequences in "mypackage.sty" so that "\usepackage{mypackage}" would
67       work.  Similarly for "myclass.cls.ltxml".  Additionally, document-
68       specific bindings can be supplied: before processing a TeX source file,
69       eg "mydoc.tex", LaTeXML will automatically include the definitions and
70       settings in "mydoc.latexml".  These ".ltxml" and ".latexml" files
71       should be placed LaTeXML's searchpaths, where will find them: either in
72       the current directory or in a directory given to the --path option, or
73       possibly added to the variable SEARCHPATHS).
74
75       Since LaTeXML mimics TeX, a familiarity with TeX's processing model is
76       critical.  LaTeXML models: catcodes and tokens (See
77       LaTeXML::Core::Token,  LaTeXML::Core::Tokens) which are extracted from
78       the plain source text characters by the LaTeXML::Core::Mouth; "Macros",
79       which are expanded within the LaTeXML::Core::Gullet; and "Primitives",
80       which are digested within the LaTeXML::Core::Stomach to produce
81       LaTeXML::Core::Box, LaTeXML::Core::List.  A key additional feature is
82       the "Constructors": when digested they generate a
83       LaTeXML::Core::Whatsit which, upon absorption by
84       LaTeXML::Core::Document, inserts text or XML fragments in the final
85       document tree.
86
87       Notation: Many of the following forms take code references as arguments
88       or options.  That is, either a reference to a defined sub, eg.
89       "\&somesub", or an anonymous function "sub { ... }".  To document these
90       cases, and the arguments that are passed in each case, we'll use a
91       notation like "code($stomach,...)".
92
93   Control Sequences
94       Many of the following forms define the behaviour of control sequences.
95       While in TeX you'll typically only define macros, LaTeXML is
96       effectively redefining TeX itself, so we define "Macros" as well as
97       "Primitives", "Registers", "Constructors" and "Environments".  These
98       define the behaviour of these control sequences when processed during
99       the various phases of LaTeX's imitation of TeX's digestive tract.
100
101       Prototypes
102
103       LaTeXML uses a more convenient method of specifying parameter patterns
104       for control sequences. The first argument to each of these defining
105       forms ("DefMacro", "DefPrimive", etc) is a prototype consisting of the
106       control sequence being defined along with the specification of
107       parameters required by the control sequence.  Each parameter describes
108       how to parse tokens following the control sequence into arguments or
109       how to delimit them.  To simplify coding and capture common idioms in
110       TeX/LaTeX programming, latexml's parameter specifications are more
111       expressive than TeX's  "\def" or LaTeX's "\newcommand".  Examples of
112       the prototypes for familiar TeX or LaTeX control sequences are:
113
114          DefConstructor('\usepackage[]{}',...
115          DefPrimitive('\multiply Variable SkipKeyword:by Number',..
116          DefPrimitive('\newcommand OptionalMatch:* DefToken[]{}', ...
117
118       The general syntax for parameter specification is
119
120       "{spec}"
121           reads a regular TeX argument.  spec can be omitted (ie. "{}").
122           Otherwise spec is itself a parameter specification and the argument
123           is reparsed to accordingly.  ("{}" is a shorthand for "Plain".)
124
125       "[spec]"
126           reads an LaTeX-style optional argument.  spec can be omitted (ie.
127           "{}").  Otherwise, if spec is of the form Default:stuff, then stuff
128           would be the default value.  Otherwise spec is itself a parameter
129           specification and the argument, if supplied, is reparsed according
130           to that specification.  ("[]" is a shorthand for "Optional".)
131
132       Type
133           Reads an argument of the given type, where either Type has been
134           declared, or there exists a ReadType function accessible from
135           LaTeXML::Package::Pool.  See the available types, below.
136
137       "Type:value | Type:value1:value2..."
138           These forms invoke the parser for Type but pass additional Tokens
139           to the reader function.  Typically this would supply defaults or
140           parameters to a match.
141
142       "OptionalType"
143           Similar to Type, but it is not considered an error if the reader
144           returns undef.
145
146       "SkipType"
147           Similar to "Optional"Type, but the value returned from the reader
148           is ignored, and does not occupy a position in the arguments list.
149
150       The predefined argument Types are as follows.
151
152       "Plain, Semiverbatim"
153
154           Reads a standard TeX argument being either the next token, or if
155           the next token is an {, the balanced token list.  In the case of
156           "Semiverbatim", many catcodes are disabled, which is handy for
157           URL's, labels and similar.
158
159       "Token, XToken"
160
161           Read a single TeX Token.  For "XToken", if the next token is
162           expandable, it is repeatedly expanded until an unexpandable token
163           remains, which is returned.
164
165       "Number, Dimension, Glue | MuGlue"
166
167           Read an Object corresponding to Number, Dimension, Glue or MuGlue,
168           using TeX's rules for parsing these objects.
169
170       "Until:match | XUntil:"match>
171
172           Reads tokens until a match to the tokens match is found, returning
173           the tokens preceding the match. This corresponds to TeX delimited
174           arguments.  For "XUntil", tokens are expanded as they are matched
175           and accumulated (but a brace reads and accumulates till a matching
176           close brace, without expanding).
177
178       "UntilBrace"
179
180           Reads tokens until the next open brace "{".  This corresponds to
181           the peculiar TeX construct "\def\foo#{...".
182
183       "Match:match(|match)* | Keyword:"match(|match)*>
184
185           Reads tokens expecting a match to one of the token lists match,
186           returning the one that matches, or undef.  For "Keyword", case and
187           catcode of the matches are ignored.  Additionally, any leading
188           spaces are skipped.
189
190       "Balanced"
191
192           Read tokens until a closing }, but respecting nested {} pairs.
193
194       "BalancedParen"
195
196           Read a parenthesis delimited tokens, but does not balance any
197           nested parentheses.
198
199       "Undigested, Digested, DigestUntil:match"
200
201           These types alter the usual sequence of tokenization and digestion
202           in separate stages (like TeX).  A "Undigested" parameter inhibits
203           digestion completely and remains in token form.  A "Digested"
204           parameter gets digested until the (required) opening { is balanced;
205           this is useful when the content would usually need to have been
206           protected in order to correctly deal with catcodes.  "DigestUntil"
207           digests tokens until a token matching match is found.
208
209       "Variable"
210
211           Reads a token, expanding if necessary, and expects a control
212           sequence naming a writable register.  If such is found, it returns
213           an array of the corresponding definition object, and any arguments
214           required by that definition.
215
216       "SkipSpaces, Skip1Space"
217
218           Skips one, or any number of, space tokens, if present, but
219           contributes nothing to the argument list.
220
221       Common Options
222
223       "scope=>'local' | 'global' | scope"
224           Most defining commands accept an option to control how the
225           definition is stored, for global or local definitions, or using a
226           named scope A named scope saves a set of definitions and values
227           that can be activated at a later time.
228
229           Particularly interesting forms of scope are those that get
230           automatically activated upon changes of counter and label.  For
231           example, definitions that have "scope=>'section:1.1'"  will be
232           activated when the section number is "1.1", and will be deactivated
233           when that section ends.
234
235       "locked=>boolean"
236           This option controls whether this definition is locked from further
237           changes in the TeX sources; this keeps local 'customizations' by an
238           author from overriding important LaTeXML definitions and breaking
239           the conversion.
240
241       "protected=>boolean"
242           Makes a definition "protected", in the sense of eTeX's "\protected"
243           directive.  This inhibits expansion under certain circumstances.
244
245       "robust=>boolean"
246           Makes a definition "robust", in the sense of LaTeX's
247           "\DeclareRobustCommand".  This essentially creates an indirect
248           macro definition which is preceded by "\protect".  This inhibits
249           expansion (and argument processing!) under certain circumstances.
250           It usually only makes sense for macros, but may be useful for
251           Primitives, Constructors and DefMath in cases where LaTeX would
252           normally have created a macro that needs protection.
253
254       Macros
255
256       "DefMacro(prototype, expansion, %options);"
257
258           Defines the macro expansion for prototype; a macro control sequence
259           that is expanded during macro expansion time in the
260           LaTeXML::Core::Gullet.  The expansion should be one of tokens |
261           string | code($gullet,@args)>: a string will be tokenized upon
262           first usage.  Any macro arguments will be substituted for parameter
263           indicators (eg #1) in the tokens or tokenized string and the result
264           is used as the expansion of the control sequence. If code is used,
265           it is called at expansion time and should return a list of tokens
266           as its result.
267
268           DefMacro options are
269
270           "scope=>scope",
271           "locked=>boolean"
272               See "Common Options".
273
274           "mathactive=>boolean"
275               specifies a definition that will only be expanded in math mode;
276               the control sequence must be a single character.
277
278           Examples:
279
280             DefMacro('\thefootnote','\arabic{footnote}');
281             DefMacro('\today',sub { ExplodeText(today()); });
282
283       "DefMacroI(cs, paramlist, expansion, %options);"
284
285           Internal form of "DefMacro" where the control sequence and
286           parameter list have already been separated; useful for definitions
287           from within code.  Also, slightly more efficient for macros with no
288           arguments (use "undef" for paramlist), and useful for obscure cases
289           like defining "\begin{something*}" as a Macro.
290
291       Conditionals
292
293       "DefConditional(prototype, test, %options);"
294
295           Defines a conditional for prototype; a control sequence that is
296           processed during macro expansion time (in the
297           LaTeXML::Core::Gullet).  A conditional corresponds to a TeX "\if".
298           If the test is "undef", a "\newif" type of conditional is defined,
299           which is controlled with control sequences like "\footrue" and
300           "\foofalse".  Otherwise the test should be "code($gullet,@args)"
301           (with the control sequence's arguments) that is called at expand
302           time to determine the condition.  Depending on whether the result
303           of that evaluation returns a true or false value (in the usual Perl
304           sense), the result of the expansion is either the first or else
305           code following, in the usual TeX sense.
306
307           DefConditional options are
308
309           "scope=>scope",
310           "locked=>boolean"
311               See "Common Options".
312
313           "skipper=>code($gullet)"
314               This option is only used to define "\ifcase".
315
316           Example:
317
318             DefConditional('\ifmmode',sub {
319                LookupValue('IN_MATH'); });
320
321       "DefConditionalI(cs, paramlist, test, %options);"
322
323           Internal form of "DefConditional" where the control sequence and
324           parameter list have already been parsed; useful for definitions
325           from within code.  Also, slightly more efficient for conditinal
326           with no arguments (use "undef" for "paramlist").
327
328       "IfCondition($ifcs,@args)"
329
330           "IfCondition" allows you to test a conditional from within perl.
331           Thus something like "if(IfCondition('\ifmmode')){ domath } else {
332           dotext }" might be equivalent to TeX's "\ifmmode domath \else
333           dotext \fi".
334
335       Primitives
336
337       "DefPrimitive(prototype, replacement, %options);"
338
339           Defines a primitive control sequence; a primitive is processed
340           during digestion (in the  LaTeXML::Core::Stomach), after macro
341           expansion but before Construction time.  Primitive control
342           sequences generate Boxes or Lists, generally containing basic
343           Unicode content, rather than structured XML.  Primitive control
344           sequences are also executed for side effect during digestion,
345           effecting changes to the LaTeXML::Core::State.
346
347           The replacement can be a string used as the text content of a Box
348           to be created (using the current font).  Alternatively replacement
349           can be "code($stomach,@args)" (with the control sequence's
350           arguments) which is invoked at digestion time, probably for side-
351           effect, but returning Boxes or Lists or nothing.  replacement may
352           also be undef, which contributes nothing to the document, but does
353           record the TeX code that created it.
354
355           DefPrimitive options are
356
357           "scope=>scope",
358           "locked=>boolean"
359               See "Common Options".
360
361           "mode=> ('text' | 'display_math' | 'inline_math')"
362               Changes to this mode during digestion.
363
364           "font=>{%fontspec}"
365               Specifies the font to use (see "Fonts").  If the font change is
366               to only apply to material generated within this command, you
367               would also use "<bounded="1>>; otherwise, the font will remain
368               in effect afterwards as for a font switching command.
369
370           "bounded=>boolean"
371               If true, TeX grouping (ie. "{}") is enforced around this
372               invocation.
373
374           "requireMath=>boolean",
375           "forbidMath=>boolean"
376               specifies whether the given constructor can only appear, or
377               cannot appear, in math mode.
378
379           "beforeDigest=>code($stomach)"
380               supplies a hook to execute during digestion just before the
381               main part of the primitive is executed (and before any
382               arguments have been read).  The code should either return
383               nothing (return;) or a list of digested items
384               (Box's,List,Whatsit).  It can thus change the State and/or add
385               to the digested output.
386
387           "afterDigest=>code($stomach)"
388               supplies a hook to execute during digestion just after the main
389               part of the primitive ie executed.  it should either return
390               nothing (return;) or digested items.  It can thus change the
391               State and/or add to the digested output.
392
393           "isPrefix=>boolean"
394               indicates whether this is a prefix type of command; This is
395               only used for the special TeX assignment prefixes, like
396               "\global".
397
398           Example:
399
400              DefPrimitive('\begingroup',sub { $_[0]->begingroup; });
401
402       "DefPrimitiveI(cs, paramlist, code($stomach,@args), %options);"
403
404           Internal form of "DefPrimitive" where the control sequence and
405           parameter list have already been separated; useful for definitions
406           from within code.
407
408       Registers
409
410       "DefRegister(prototype, value, %options);"
411
412           Defines a register with value as the initial value (a Number,
413           Dimension, Glue, MuGlue or Tokens --- I haven't handled Box's yet).
414           Usually, the prototype is just the control sequence, but registers
415           are also handled by prototypes like "\count{Number}". "DefRegister"
416           arranges that the register value can be accessed when a numeric,
417           dimension, ... value is being read, and also defines the control
418           sequence for assignment.
419
420           Options are
421
422           "readonly=>boolean"
423               specifies if it is not allowed to change this value.
424
425           "getter=>code(@args)",
426           "setter=>code($value,@args)"
427               By default value is stored in the State's Value table under a
428               name concatenating the control sequence and argument values.
429               These options allow other means of fetching and storing the
430               value.
431
432           Example:
433
434             DefRegister('\pretolerance',Number(100));
435
436       "DefRegisterI(cs, paramlist, value, %options);"
437
438           Internal form of "DefRegister" where the control sequence and
439           parameter list have already been parsed; useful for definitions
440           from within code.
441
442       Constructors
443
444       "DefConstructor(prototype, $replacement, %options);"
445
446           The Constructor is where LaTeXML really starts getting interesting;
447           invoking the control sequence will generate an arbitrary XML
448           fragment in the document tree.  More specifically: during
449           digestion, the arguments will be read and digested, creating a
450           LaTeXML::Core::Whatsit to represent the object. During absorption
451           by the LaTeXML::Core::Document, the "Whatsit" will generate the XML
452           fragment according to replacement. The replacement can be
453           "code($document,@args,%properties)" which is called during document
454           absorption to create the appropriate XML (See the methods of
455           LaTeXML::Core::Document).
456
457           More conveniently, replacement can be an pattern: simply a bit of
458           XML as a string with certain substitutions to be made. The
459           substitutions are of the following forms:
460
461           "#1, #2 ... #name"
462               These are replaced by the corresponding argument (for #1) or
463               property (for #name) stored with the Whatsit. Each are turned
464               into a string when it appears as in an attribute position, or
465               recursively processed when it appears as content.
466
467           "&function(@args)"
468               Another form of substituted value is prefixed with "&" which
469               invokes a function.  For example, " &func(#1) " would invoke
470               the function "func" on the first argument to the control
471               sequence; what it returns will be inserted into the document.
472
473           "?test(pattern)"  or "?test(ifpattern)(elsepattern)"
474               Patterns can be conditionallized using this form.  The test is
475               any of the above expressions (eg. "#1"), considered true if the
476               result is non-empty.  Thus "?#1(<foo/>)" would add the empty
477               element "foo" if the first argument were given.
478
479           "^" If the constructor begins with "^", the XML fragment is allowed
480               to float up to a parent node that is allowed to contain it,
481               according to the Document Type.
482
483           The Whatsit property "font" is defined by default.  Additional
484           properties "body" and "trailer" are defined when "captureBody" is
485           true, or for environments.  By using
486           "$whatsit->setProperty(key=>$value);" within "afterDigest", or by
487           using the "properties" option, other properties can be added.
488
489           DefConstructor options are
490
491           "scope=>scope",
492           "locked=>boolean"
493               See "Common Options".
494
495           "mode=>mode",
496           "font=>{%fontspec}",
497           "bounded=>boolean",
498           "requireMath=>boolean",
499           "forbidMath=>boolean"
500               These options are the same as for "Primitives"
501
502           "reversion=>texstring | code($whatsit,#1,#2,...)"
503               specifies the reversion of the invocation back into TeX tokens
504               (if the default reversion is not appropriate).  The textstring
505               string can include "#1", "#2"...  The code is called with the
506               $whatsit and digested arguments and must return a list of
507               Token's.
508
509           "alias=>control_sequence"
510               provides a control sequence to be used in the "reversion"
511               instead of the one defined in the "prototype".  This is a
512               convenient alternative for reversion when a 'public' command
513               conditionally expands into an internal one, but the reversion
514               should be for the public command.
515
516           "sizer=>string | code($whatsit)"
517               specifies how to compute (approximate) the displayed size of
518               the object, if that size is ever needed (typically needed for
519               graphics generation).  If a string is given, it should contain
520               only a sequence of "#1" or "#name" to access arguments and
521               properties of the Whatsit: the size is computed from these
522               items layed out side-by-side.  If code is given, it should
523               return the three Dimensions (width, height and depth).  If
524               neither is given, and the "reversion" specification is of
525               suitible format, it will be used for the sizer.
526
527           "properties=>{%properties} | code($stomach,#1,#2...)"
528               supplies additional properties to be set on the generated
529               Whatsit.  In the first form, the values can be of any type, but
530               if a value is a code references, it takes the same args
531               ($stomach,#1,#2,...) and should return the value; it is
532               executed before creating the Whatsit.  In the second form, the
533               code should return a hash of properties.
534
535           "beforeDigest=>code($stomach)"
536               supplies a hook to execute during digestion just before the
537               Whatsit is created.  The code should either return nothing
538               (return;) or a list of digested items (Box's,List,Whatsit).  It
539               can thus change the State and/or add to the digested output.
540
541           "afterDigest=>code($stomach,$whatsit)"
542               supplies a hook to execute during digestion just after the
543               Whatsit is created (and so the Whatsit already has its
544               arguments and properties). It should either return nothing
545               (return;) or digested items.  It can thus change the State,
546               modify the Whatsit, and/or add to the digested output.
547
548           "beforeConstruct=>code($document,$whatsit)"
549               supplies a hook to execute before constructing the XML
550               (generated by replacement).
551
552           "afterConstruct=>code($document,$whatsit)"
553               Supplies code to execute after constructing the XML.
554
555           "captureBody=>boolean | Token"
556               if true, arbitrary following material will be accumulated into
557               a `body' until the current grouping level is reverted, or till
558               the "Token" is encountered if the option is a "Token".  This
559               body is available as the "body" property of the Whatsit.  This
560               is used by environments and math.
561
562           "nargs=>nargs"
563               This gives a number of args for cases where it can't be
564               inferred directly from the prototype (eg. when more args are
565               explicitly read by hooks).
566
567       "DefConstructorI(cs, paramlist, replacement, %options);"
568
569           Internal form of "DefConstructor" where the control sequence and
570           parameter list have already been separated; useful for definitions
571           from within code.
572
573       "DefMath(prototype, tex, %options);"
574
575           A common shorthand constructor; it defines a control sequence that
576           creates a mathematical object, such as a symbol, function or
577           operator application.  The options given can effectively create
578           semantic macros that contribute to the eventual parsing of
579           mathematical content.  In particular, it generates an XMDual using
580           the replacement tex for the presentation.  The content information
581           is drawn from the name and options
582
583           "DefMath" accepts the options:
584
585           "scope=>scope",
586           "locked=>boolean"
587               See "Common Options".
588
589           "font=>{%fontspec}",
590           "reversion=>reversion",
591           "alias=>cs",
592           "sizer=>sizer",
593           "properties=>properties",
594           "beforeDigest=>code($stomach)",
595           "afterDigest=>code($stomach,$whatsit)",
596               These options are the same as for "Constructors"
597
598           "name=>name"
599               gives a name attribute for the object
600
601           "omcd=>cdname"
602               gives the OpenMath content dictionary that name is from.
603
604           "role=>grammatical_role"
605               adds a grammatical role attribute to the object; this specifies
606               the grammatical role that the object plays in surrounding
607               expressions.  This direly needs documentation!
608
609           "mathstyle=>('display' | 'text' | 'script' | 'scriptscript')"
610               Controls whether the this object will be presented in a
611               specific mathstyle, or according to the current setting of
612               "mathstyle".
613
614           "scriptpos=>('mid' | 'post')"
615               Controls the positioning of any sub and super-scripts relative
616               to this object; whether they be stacked over or under it, or
617               whether they will appear in the usual position.  TeX.pool
618               defines a function "doScriptpos()" which is useful for
619               operators like "\sum" in that it sets to "mid" position when in
620               displaystyle, otherwise "post".
621
622           "stretchy=>boolean"
623               Whether or not the object is stretchy when displayed.
624
625           "operator_role=>grammatical_role",
626           "operator_scriptpos=>boolean",
627           "operator_stretchy=>boolean"
628               These three are similar to "role", "scriptpos" and "stretchy",
629               but are used in unusual cases.  These apply to the given
630               attributes to the operator token in the content branch.
631
632           "nogroup=>boolean"
633               Normally, these commands are digested with an implicit grouping
634               around them, localizing changes to fonts, etc; "noggroup=>1"
635               inhibits this.
636
637           Example:
638
639             DefMath('\infty',"\x{221E}",
640                role=>'ID', meaning=>'infinity');
641
642       "DefMathI(cs, paramlist, tex, %options);"
643
644           Internal form of "DefMath" where the control sequence and parameter
645           list have already been separated; useful for definitions from
646           within code.
647
648       Environments
649
650       "DefEnvironment(prototype, replacement, %options);"
651
652           Defines an Environment that generates a specific XML fragment.
653           "replacement" is of the same form as for DefConstructor, but will
654           generally include reference to the "#body" property. Upon
655           encountering a "\begin{env}":  the mode is switched, if needed,
656           else a new group is opened; then the environment name is noted; the
657           beforeDigest hook is run.  Then the Whatsit representing the begin
658           command (but ultimately the whole environment) is created and the
659           afterDigestBegin hook is run.  Next, the body will be digested and
660           collected until the balancing "\end{env}".   Then, any afterDigest
661           hook is run, the environment is ended, finally the mode is ended or
662           the group is closed.  The body and "\end{env}" whatsit are added to
663           the "\begin{env}"'s whatsit as body and trailer, respectively.
664
665           "DefEnvironment" takes the following options:
666
667           "scope=>scope",
668           "locked=>boolean"
669               See "Common Options".
670
671           "mode=>mode",
672           "font=>{%fontspec}"
673           "requireMath=>boolean",
674           "forbidMath=>boolean",
675               These options are the same as for "Primitives"
676
677           "reversion=>reversion",
678           "alias=>cs",
679           "sizer=>sizer",
680           "properties=>properties",
681           "nargs=>nargs"
682               These options are the same as for "Constructors"
683
684           "beforeDigest=>code($stomach)"
685               This hook is similar to that for "DefConstructor", but it
686               applies to the "\begin{environment}" control sequence.
687
688           "afterDigestBegin=>code($stomach,$whatsit)"
689               This hook is similar to "DefConstructor"'s "afterDigest" but it
690               applies to the "\begin{environment}" control sequence.  The
691               Whatsit is the one for the beginning control sequence, but
692               represents the environment as a whole.  Note that although the
693               arguments and properties are present in the Whatsit, the body
694               of the environment is not yet available!
695
696           "beforeDigestEnd=>code($stomach)"
697               This hook is similar to "DefConstructor"'s "beforeDigest" but
698               it applies to the "\end{environment}" control sequence.
699
700           "afterDigest=>code($stomach,$whatsit)"
701               This hook is similar to "DefConstructor"'s "afterDigest" but it
702               applies to the "\end{environment}" control sequence.  Note,
703               however that the Whatsit is only for the ending control
704               sequence, not the Whatsit for the environment as a whole.
705
706           "afterDigestBody=>code($stomach,$whatsit)"
707               This option supplies a hook to be executed during digestion
708               after the ending control sequence has been digested (and all
709               the 4 other digestion hook have executed) and after the body of
710               the environment has been obtained.  The Whatsit is the (useful)
711               one representing the whole environment, and it now does have
712               the body and trailer available, stored as a properties.
713
714           Example:
715
716             DefConstructor('\emph{}',
717                "<ltx:emph>#1</ltx:emph", mode=>'text');
718
719       "DefEnvironmentI(name, paramlist, replacement, %options);"
720
721           Internal form of "DefEnvironment" where the control sequence and
722           parameter list have already been separated; useful for definitions
723           from within code.
724
725   Inputing Content and Definitions
726       "FindFile(name, %options);"
727
728           Find an appropriate file with the given name in the current
729           directories in "SEARCHPATHS".  If a file ending with ".ltxml" is
730           found, it will be preferred.
731
732           Note that if the "name" starts with a recognized protocol
733           (currently one of "(literal|http|https|ftp)") followed by a colon,
734           the name is returned, as is, and no search for files is carried
735           out.
736
737           The options are:
738
739           "type=>type"
740               specifies the file type.  If not set, it will search for both
741               "name.tex" and name.
742
743           "noltxml=>1"
744               inhibits searching for a LaTeXML binding ("name.type.ltxml") to
745               use instead of the file itself.
746
747           "notex=>1"
748               inhibits searching for raw tex version of the file.  That is,
749               it will only search for the LaTeXML binding.
750
751       "InputContent(request, %options);"
752
753           "InputContent" is used for cases when the file (or data) is plain
754           TeX material that is expected to contribute content to the document
755           (as opposed to pure definitions).  A Mouth is opened onto the file,
756           and subsequent reading and/or digestion will pull Tokens from that
757           Mouth until it is exhausted, or closed.
758
759           In some circumstances it may be useful to provide a string
760           containing the TeX material explicitly, rather than referencing a
761           file.  In this case, the "literal" pseudo-protocal may be used:
762
763             InputContent('literal:\textit{Hey}');
764
765           If a file named "$request.latexml" exists, it will be read in as if
766           it were a latexml binding file, before processing.  This can be
767           used for adhoc customization of the conversion of specific files,
768           without modifying the source, or creating more elaborate bindings.
769
770           The only option to "InputContent" is:
771
772           "noerror=>boolean"
773               Inhibits signalling an error if no appropriate file is found.
774
775       "Input(request);"
776
777           "Input" is analogous to LaTeX's "\input", and is used in cases
778           where it isn't completely clear whether content or definitions is
779           expected.  Once a file is found, the approach specified by
780           "InputContent" or "InputDefinitions" is used, depending on which
781           type of file is found.
782
783       "InputDefinitions(request, %options);"
784
785           "InputDefinitions" is used for loading definitions, ie. various
786           macros, settings, etc, rather than document content; it can be used
787           to load LaTeXML's binding files, or for reading in raw TeX
788           definitions or style files.  It reads and processes the material
789           completely before returning, even in the case of TeX definitions.
790           This procedure optionally supports the conventions used for
791           standard LaTeX packages and classes (see "RequirePackage" and
792           "LoadClass").
793
794           Options for "InputDefinitions" are:
795
796           "type=>type"
797               the file type to search for.
798
799           "noltxml=>boolean"
800               inhibits searching for a LaTeXML binding; only raw TeX files
801               will be sought and loaded.
802
803           "notex=>boolean"
804               inhibits searching for raw TeX files, only a LaTeXML binding
805               will be sought and loaded.
806
807           "noerror=>boolean"
808               inhibits reporting an error if no appropriate file is found.
809
810           The following options are primarily useful when "InputDefinitions"
811           is supporting standard LaTeX package and class loading.
812
813           "withoptions=>boolean"
814               indicates whether to pass in any options from the calling class
815               or package.
816
817           "handleoptions=>boolean"
818               indicates whether options processing should be handled.
819
820           "options=>[...]"
821               specifies a list of options (in the 'package options' sense) to
822               be passed (possibly in addition to any provided by the calling
823               class or package).
824
825           "after=>tokens | code($gullet)"
826               provides tokens or code to be processed by a "name.type-h@@k"
827               macro.
828
829           "as_class=>boolean"
830               fishy option that indicates that this definitions file should
831               be treated as if it were defining a class; typically shows up
832               in latex compatibility mode, or AMSTeX.
833
834           A handy method to use most of the TeX distribution's raw TeX
835           definitions for a package, but override only a few with LaTeXML
836           bindings is by defining a binding file, say "tikz.sty.ltxml", to
837           contain
838
839             InputDefinitions('tikz', type => 'sty', noltxml => 1);
840
841           which would find and read in "tizk.sty", and then follow it by a
842           couple of strategic LaTeXML definitions, "DefMacro", etc.
843
844   Class and Packages
845       "RequirePackage(package, %options);"
846
847           Finds and loads a package implementation (usually
848           "package.sty.ltxml", unless "noltxml" is specified)for the
849           requested package.  It returns the pathname of the loaded package.
850           The options are:
851
852           "type=>type"
853               specifies the file type (default "sty".
854
855           "options=>[...]"
856               specifies a list of package options.
857
858           "noltxml=>boolean"
859               inhibits searching for the LaTeXML binding for the file (ie.
860               "name.type.ltxml"
861
862           "notex=>1"
863               inhibits searching for raw tex version of the file.  That is,
864               it will only search for the LaTeXML binding.
865
866       "LoadClass(class, %options);"
867
868           Finds and loads a class definition (usually "class.cls.ltxml").  It
869           returns the pathname of the loaded class.  The only option is
870
871           "options=>[...]"
872               specifies a list of class options.
873
874       "LoadPool(pool, %options);"
875
876           Loads a pool file (usually "pool.pool.ltxml"), one of the top-level
877           definition files, such as TeX, LaTeX or AMSTeX.  It returns the
878           pathname of the loaded file.
879
880       "DeclareOption(option, tokens | string | code($stomach));"
881
882           Declares an option for the current package or class.  The 2nd
883           argument can be a string (which will be tokenized and expanded) or
884           tokens (which will be macro expanded), to provide the value for the
885           option, or it can be a code reference which is treated as a
886           primitive for side-effect.
887
888           If a package or class wants to accommodate options, it should start
889           with one or more "DeclareOptions", followed by "ProcessOptions()".
890
891       "PassOptions(name, ext, @options); "
892
893           Causes the given @options (strings) to be passed to the package (if
894           ext is "sty") or class (if ext is "cls") named by name.
895
896       "ProcessOptions(%options);"
897
898           Processes the options that have been passed to the current package
899           or class in a fashion similar to LaTeX.  The only option (to
900           "ProcessOptions" is "inorder=>boolean" indicating whehter the
901           (package) options are processed in the order they were used, like
902           "ProcessOptions*".
903
904       "ExecuteOptions(@options);"
905
906           Process the options given explicitly in @options.
907
908       "AtBeginDocument(@stuff); "
909
910           Arranges for @stuff to be carried out after the preamble, at the
911           beginning of the document.  @stuff should typically be macro-level
912           stuff, but carried out for side effect; it should be tokens, tokens
913           lists, strings (which will be tokenized), or "code($gullet)" which
914           would yield tokens to be expanded.
915
916           This operation is useful for style files loaded with "--preload" or
917           document specific customization files (ie. ending with ".latexml");
918           normally the contents would be executed before LaTeX and other
919           style files are loaded and thus can be overridden by them.  By
920           deferring the evaluation to begin-document time, these contents can
921           override those style files.  This is likely to only be meaningful
922           for LaTeX documents.
923
924       "AtEndDocument(@stuff)"
925           Arranges for @stuff to be carried out just before
926           "\\end{document}".  These tokens can be used for side effect, or
927           any content they generate will appear as the last children of the
928           document.
929
930   Counters and IDs
931       "NewCounter(ctr, within, %options);"
932
933           Defines a new counter, like LaTeX's \newcounter, but extended.  It
934           defines a counter that can be used to generate reference numbers,
935           and defines "\thectr", etc. It also defines an "uncounter" which
936           can be used to generate ID's (xml:id) for unnumbered objects.  ctr
937           is the name of the counter.  If defined, within is the name of
938           another counter which, when incremented, will cause this counter to
939           be reset.  The options are
940
941           "idprefix=>string"
942               Specifies a prefix to be used to generate ID's when using this
943               counter
944
945           "nested"
946               Not sure that this is even sane.
947
948       "$num = CounterValue($ctr);"
949
950           Fetches the value associated with the counter $ctr.
951
952       "$tokens = StepCounter($ctr);"
953
954           Analog of "\stepcounter", steps the counter and returns the
955           expansion of "\the$ctr".  Usually you should use
956           "RefStepCounter($ctr)" instead.
957
958       "$keys = RefStepCounter($ctr);"
959
960           Analog of "\refstepcounter", steps the counter and returns a hash
961           containing the keys "refnum="$refnum, id=>$id>.  This makes it
962           suitable for use in a "properties" option to constructors.  The
963           "id" is generated in parallel with the reference number to assist
964           debugging.
965
966       "$keys = RefStepID($ctr);"
967
968           Like to "RefStepCounter", but only steps the "uncounter", and
969           returns only the id;  This is useful for unnumbered cases of
970           objects that normally get both a refnum and id.
971
972       "ResetCounter($ctr);"
973
974           Resets the counter $ctr to zero.
975
976       "GenerateID($document,$node,$whatsit,$prefix);"
977
978           Generates an ID for nodes during the construction phase, useful for
979           cases where the counter based scheme is inappropriate.  The calling
980           pattern makes it appropriate for use in Tag, as in
981
982              Tag('ltx:para',afterClose=>sub { GenerateID(@_,'p'); })
983
984           If $node doesn't already have an xml:id set, it computes an
985           appropriate id by concatenating the xml:id of the closest ancestor
986           with an id (if any), the prefix (if any) and a unique counter.
987
988   Document Model
989       Constructors define how TeX markup will generate XML fragments, but the
990       Document Model is used to control exactly how those fragments are
991       assembled.
992
993       "Tag(tag, %properties);"
994
995           Declares properties of elements with the name tag.  Note that "Tag"
996           can set or add properties to any element from any binding file,
997           unlike the properties set on control by  "DefPrimtive",
998           "DefConstructor", etc..  And, since the properties are recorded in
999           the current Model, they are not subject to TeX grouping; once set,
1000           they remain in effect until changed or the end of the document.
1001
1002           The tag can be specified in one of three forms:
1003
1004              prefix:name matches specific name in specific namespace
1005              prefix:*    matches any tag in the specific namespace;
1006              *           matches any tag in any namespace.
1007
1008           There are two kinds of properties:
1009
1010           Scalar properties
1011               For scalar properties, only a single value is returned for a
1012               given element.  When the property is looked up, each of the
1013               above forms is considered (the specific element name, the
1014               namespace, and all elements); the first defined value is
1015               returned.
1016
1017               The recognized scalar properties are:
1018
1019               "autoOpen=>boolean"
1020                   Specifies whether tag can be automatically opened if needed
1021                   to insert an element that can only be contained by tag.
1022                   This property can help match the more  SGML-like LaTeX to
1023                   XML.
1024
1025               "autoClose=>boolean"
1026                   Specifies whether this tag can be automatically closed if
1027                   needed to close an ancestor node, or insert an element into
1028                   an ancestor.  This property can help match the more  SGML-
1029                   like LaTeX to XML.
1030
1031           Code properties
1032               These properties provide a bit of code to be run at the times
1033               of certain events associated with an element.  All the code
1034               bits that match a given element will be run, and since they can
1035               be added by any binding file, and be specified in a random
1036               orders, a little bit of extra control is desirable.
1037
1038               Firstly, any early codes are run (eg "afterOpen:early"), then
1039               any normal codes (without modifier) are run, and finally any
1040               late codes are run (eg. "afterOpen:late").
1041
1042               Within each of those groups, the codes assigned for an
1043               element's specific name are run first, then those assigned for
1044               its package and finally the generic one ("*"); that is, the
1045               most specific codes are run first.
1046
1047               When code properties are accumulated by "Tag" for normal or
1048               late events, the code is appended to the end of the current
1049               list (if there were any previous codes added); for early event,
1050               the code is prepended.
1051
1052               The recognized code properties are:
1053
1054               "afterOpen=>code($document,$box)"
1055                   Provides code to be run whenever a node with this tag is
1056                   opened.  It is called with the document being constructed,
1057                   and the initiating digested object as arguments.  It is
1058                   called after the node has been created, and after any
1059                   initial attributes due to the constructor (passed to
1060                   openElement) are added.
1061
1062                   "afterOpen:early" or "afterOpen:late" can be used in place
1063                   of "afterOpen"; these will be run as a group before, or
1064                   after (respectively) the unmodified blocks.
1065
1066               "afterClose=>code($document,$box)"
1067                   Provides code to be run whenever a node with this tag is
1068                   closed.  It is called with the document being constructed,
1069                   and the initiating digested object as arguments.
1070
1071                   "afterClose:early" or "afterClose:late" can be used in
1072                   place of "afterClose"; these will be run as a group bfore,
1073                   or after (respectively) the unmodified blocks.
1074
1075       "RelaxNGSchema(schemaname);"
1076
1077           Specifies the schema to use for determining document model.  You
1078           can leave off the extension; it will look for "schemaname.rng" (and
1079           maybe eventually, ".rnc" if that is ever implemented).
1080
1081       "RegisterNamespace(prefix, URL);"
1082
1083           Declares the prefix to be associated with the given URL.  These
1084           prefixes may be used in ltxml files, particularly for constructors,
1085           xpath expressions, etc.  They are not necessarily the same as the
1086           prefixes that will be used in the generated document Use the prefix
1087           "#default" for the default, non-prefixed, namespace.  (See
1088           RegisterDocumentNamespace, as well as DocType or RelaxNGSchema).
1089
1090       "RegisterDocumentNamespace(prefix, URL);"
1091
1092           Declares the prefix to be associated with the given URL used within
1093           the generated XML. They are not necessarily the same as the
1094           prefixes used in code (RegisterNamespace).  This function is less
1095           rarely needed, as the namespace declarations are generally obtained
1096           from the DTD or Schema themselves Use the prefix "#default" for the
1097           default, non-prefixed, namespace.  (See DocType or RelaxNGSchema).
1098
1099       "DocType(rootelement, publicid, systemid, %namespaces);"
1100
1101           Declares the expected rootelement, the public and system ID's of
1102           the document type to be used in the final document.  The hash
1103           %namespaces specifies the namespaces prefixes that are expected to
1104           be found in the DTD, along with each associated namespace URI.  Use
1105           the prefix "#default" for the default namespace (ie. the namespace
1106           of non-prefixed elements in the DTD).
1107
1108           The prefixes defined for the DTD may be different from the prefixes
1109           used in implementation CODE (eg. in ltxml files; see
1110           RegisterNamespace).  The generated document will use the namespaces
1111           and prefixes defined for the DTD.
1112
1113   Document Rewriting
1114       During document construction, as each node gets closed, the text
1115       content gets simplfied.  We'll call it applying ligatures, for lack of
1116       a better name.
1117
1118       "DefLigature(regexp, %options);"
1119
1120           Apply the regular expression (given as a string: "/fa/fa/" since it
1121           will be converted internally to a true regexp), to the text
1122           content.  The only option is "fontTest=>code($font)"; if given,
1123           then the substitution is applied only when "fontTest" returns true.
1124
1125           Predefined Ligatures combine sequences of "." or single-quotes into
1126           appropriate Unicode characters.
1127
1128       "DefMathLigature($string"=""$replacment,%options);>
1129
1130           A Math Ligature typically combines a sequence of math tokens
1131           (XMTok) into a single one.  A simple example is
1132
1133              DefMathLigature(":=" => ":=", role => 'RELOP', meaning => 'assign');
1134
1135           replaces the two tokens for colon and equals by a token
1136           representing assignment.  The options are those characterising an
1137           XMTok, namely: "role", "meaning" and "name".
1138
1139           For more complex cases (recognizing numbers, for example), you may
1140           supply a function "matcher="CODE($document,$node)>, which is passed
1141           the current document and the last math node in the sequence.  It
1142           should examine $node and any preceding nodes (using
1143           "previousSibling") and return a list of "($n,$string,%attributes)"
1144           to replace the $n nodes by a new one with text content being
1145           $string content and the given attributes.  If no replacement is
1146           called for, CODE should return undef.
1147
1148       After document construction, various rewriting and augmenting of the
1149       document can take place.
1150
1151       "DefRewrite(%specification);"
1152       "DefMathRewrite(%specification);"
1153
1154           These two declarations define document rewrite rules that are
1155           applied to the document tree after it has been constructed, but
1156           before math parsing, or any other postprocessing, is done.  The
1157           %specification consists of a sequence of key/value pairs with the
1158           initial specs successively narrowing the selection of document
1159           nodes, and the remaining specs indicating how to modify or replace
1160           the selected nodes.
1161
1162           The following select portions of the document:
1163
1164           "label=>label"
1165               Selects the part of the document with label=$label
1166
1167           "scope=>scope"
1168               The scope could be "label:foo" or "section:1.2.3" or something
1169               similar. These select a subtree labelled 'foo', or a section
1170               with reference number "1.2.3"
1171
1172           "xpath=>xpath"
1173               Select those nodes matching an explicit xpath expression.
1174
1175           "match=>tex"
1176               Selects nodes that look like what the processing of tex would
1177               produce.
1178
1179           "regexp=>regexp"
1180               Selects text nodes that match the regular expression.
1181
1182           The following act upon the selected node:
1183
1184           "attributes=>hashref"
1185               Adds the attributes given in the hash reference to the node.
1186
1187           "replace=>replacement"
1188               Interprets replacement as TeX code to generate nodes that will
1189               replace the selected nodes.
1190
1191   Mid-Level support
1192       "$tokens = Expand($tokens);"
1193
1194           Expands the given $tokens according to current definitions.
1195
1196       "$boxes = Digest($tokens);"
1197
1198           Processes and digestes the $tokens.  Any arguments needed by
1199           control sequences in $tokens must be contained within the $tokens
1200           itself.
1201
1202       "@tokens = Invocation($cs,@args);"
1203
1204           Constructs a sequence of tokens that would invoke the token $cs on
1205           the arguments.
1206
1207       "RawTeX('... tex code ...');"
1208
1209           RawTeX is a convenience function for including chunks of raw TeX
1210           (or LaTeX) code in a Package implementation.  It is useful for
1211           copying portions of the normal implementation that can be handled
1212           simply using macros and primitives.
1213
1214       "Let($token1,$token2);"
1215
1216           Gives $token1 the same `meaning' (definition) as $token2; like
1217           TeX's \let.
1218
1219       "StartSemiVerbatim(); ... ; EndSemiVerbatim();"
1220           Disable disable most TeX catcodes.
1221
1222       "$tokens = Tokenize($string);"
1223           Tokenizes the $string using the standard catcodes, returning a
1224           LaTeXML::Core::Tokens.
1225
1226       "$tokens = TokenizeInternal($string);"
1227           Tokenizes the $string according to the internal cattable (where @
1228           is a letter), returning a LaTeXML::Core::Tokens.
1229
1230   Argument Readers
1231       "ReadParameters($gullet,$spec);"
1232
1233           Reads from $gullet the tokens corresponding to $spec (a Parameters
1234           object).
1235
1236       "DefParameterType(type, code($gullet,@values), %options);"
1237
1238           Defines a new Parameter type, type, with code for its reader.
1239
1240           Options are:
1241
1242           "reversion=>code($arg,@values);"
1243               This code is responsible for converting a previously parsed
1244               argument back into a sequence of Token's.
1245
1246           "optional=>boolean"
1247               whether it is an error if no matching input is found.
1248
1249           "novalue=>boolean"
1250               whether the value returned should contribute to argument lists,
1251               or simply be passed over.
1252
1253           "semiverbatim=>boolean"
1254               whether the catcode table should be modified before reading
1255               tokens.
1256
1257       "<DefColumnType(proto, expansion);"
1258
1259           Defines a new column type for tabular and arrays.  proto is the
1260           prototype for the pattern, analogous to the pattern used for other
1261           definitions, except that macro being defined is a single character.
1262           The expansion is a string specifying what it should expand into,
1263           typically more verbose column specification.
1264
1265   Access to State
1266       "$value = LookupValue($name);"
1267
1268           Lookup the current value associated with the the string $name.
1269
1270       "AssignValue($name,$value,$scope);"
1271
1272           Assign $value to be associated with the the string $name, according
1273           to the given scoping rule.
1274
1275           Values are also used to specify most configuration parameters
1276           (which can therefore also be scoped).  The recognized configuration
1277           parameters are:
1278
1279            STRICT            : whether errors (eg. undefined macros)
1280                                are fatal.
1281            INCLUDE_COMMENTS  : whether to preserve comments in the
1282                                source, and to add occasional line
1283                                number comments. (Default true).
1284            PRESERVE_NEWLINES : whether newlines in the source should
1285                                be preserved (not 100% TeX-like).
1286                                By default this is true.
1287            SEARCHPATHS       : a list of directories to search for
1288                                sources, implementations, etc.
1289
1290       "PushValue($name,@values);"
1291
1292           This function, along with the next three are like "AssignValue",
1293           but maintain a global list of values.  "PushValue" pushes the
1294           provided values onto the end of a list.  The data stored for $name
1295           is global and must be a LIST reference; it is created if needed.
1296
1297       "UnshiftValue($name,@values);"
1298
1299           Similar to  "PushValue", but pushes a value onto the front of the
1300           list.  The data stored for $name is global and must be a LIST
1301           reference; it is created if needed.
1302
1303       "PopValue($name);"
1304
1305           Removes and returns the value on the end of the list named by
1306           $name.  The data stored for $name is global and must be a LIST
1307           reference.  Returns "undef" if there is no data in the list.
1308
1309       "ShiftValue($name);"
1310
1311           Removes and returns the first value in the list named by $name.
1312           The data stored for $name is global and must be a LIST reference.
1313           Returns "undef" if there is no data in the list.
1314
1315       "LookupMapping($name,$key);"
1316
1317           This function maintains a hash association named by $name.  It
1318           returns the value associated with $key within that mapping.  The
1319           data stored for $name is global and must be a HASH reference.
1320           Returns "undef" if there is no data associated with $key in the
1321           mapping, or the mapping is not (yet) defined.
1322
1323       "AssignMapping($name,$key,$value);"
1324
1325           This function associates $value with $key within the mapping named
1326           by $name.  The data stored for $name is global and must be a HASH
1327           reference; it is created if needed.
1328
1329       "$value = LookupCatcode($char);"
1330
1331           Lookup the current catcode associated with the the character $char.
1332
1333       "AssignCatcode($char,$catcode,$scope);"
1334
1335           Set $char to have the given $catcode, with the assignment made
1336           according to the given scoping rule.
1337
1338           This method is also used to specify whether a given character is
1339           active in math mode, by using "math:$char" for the character, and
1340           using a value of 1 to specify that it is active.
1341
1342       "$meaning = LookupMeaning($token);"
1343
1344           Looks up the current meaning of the given $token which may be a
1345           Definition, another token, or the token itself if it has not
1346           otherwise been defined.
1347
1348       "$defn = LookupDefinition($token);"
1349
1350           Looks up the current definition, if any, of the $token.
1351
1352       "InstallDefinition($defn);"
1353
1354           Install the Definition $defn into $STATE under its control
1355           sequence.
1356
1357       "XEquals($token1,$token2)"
1358           Tests whether the two tokens are equal in the sense that they are
1359           either equal tokens, or if defined, have the same definition.
1360
1361   Fonts
1362       "MergeFont(%fontspec); "
1363
1364           Set the current font by merging the font style attributes with the
1365           current font.  The %fontspec specifies the properties of the
1366           desired font.  Likely values include (the values aren't required to
1367           be in this set):
1368
1369            family : serif, sansserif, typewriter, caligraphic,
1370                     fraktur, script
1371            series : medium, bold
1372            shape  : upright, italic, slanted, smallcaps
1373            size   : tiny, footnote, small, normal, large,
1374                     Large, LARGE, huge, Huge
1375            color  : any named color, default is black
1376
1377           Some families will only be used in math.  This function returns
1378           nothing so it can be easily used in beforeDigest, afterDigest.
1379
1380       "DeclareFontMap($name,$map,%options);"
1381           Declares a font map for the encoding $name. The map $map is an
1382           array of 128 or 256 entries, each element is either a unicode
1383           string for the representation of that codepoint, or undef if that
1384           codepoint is not supported  by this encoding.  The only option
1385           currently is "family" used because some fonts (notably cmr!)  have
1386           different glyphs in some font families, such as
1387           "family="'typewriter'>.
1388
1389       "FontDecode($code,$encoding,$implicit);"
1390           Returns the unicode string representing the given codepoint $code
1391           (an integer) in the given font encoding $encoding.  If $encoding is
1392           undefined, the usual case, the current font encoding and font
1393           family is used for the lookup.  Explicit decoding is used when
1394           "\\char" or similar are invoked ($implicit is false), and the
1395           codepoint must be represented in the fontmap, otherwise undef is
1396           returned.  Implicit decoding (ie. $implicit is true) occurs within
1397           the Stomach when a Token's content is being digested and converted
1398           to a Box; in that case only the lower 128 codepoints are converted;
1399           all codepoints above 128 are assumed to already be Unicode.
1400
1401           The font map for $encoding is automatically loaded if it has not
1402           already been loaded.
1403
1404       "FontDecodeString($string,$encoding,$implicit);"
1405           Returns the unicode string resulting from decoding the individual
1406           characters in $string according to FontDecode, above.
1407
1408       "LoadFontMap($encoding);"
1409           Finds and loads the font map for the encoding named $encoding, if
1410           it hasn't been loaded before.  It looks for
1411           "encoding.fontmap.ltxml", which would typically define the font map
1412           using "DeclareFontMap", possibly including extra maps for families
1413           like "typewriter".
1414
1415   Color
1416       "$color=LookupColor($name);"
1417           Lookup the color object associated with $name.
1418
1419       "DefColor($name,$color,$scope);"
1420           Associates the $name with the given $color (a color object), with
1421           the given scoping.
1422
1423       "DefColorModel($model,$coremodel,$tocore,$fromcore);"
1424           Defines a color model $model that is derived from the core color
1425           model $coremodel.  The two functions $tocore and $fromcore convert
1426           a color object in that model to the core model, or from the core
1427           model to the derived model.  Core models are rgb, cmy, cmyk, hsb
1428           and gray.
1429
1430   Low-level Functions
1431       "CleanID($id);"
1432
1433           Cleans an $id of disallowed characters, trimming space.
1434
1435       "CleanLabel($label,$prefix);"
1436
1437           Cleans a $label of disallowed characters, trimming space.  The
1438           prefix $prefix is prepended (or "LABEL", if none given).
1439
1440       "CleanIndexKey($key);"
1441
1442           Cleans an index key, so it can be used as an ID.
1443
1444       "CleanBibKey($key);"
1445           Cleans a bibliographic citation key, so it can be used as an ID.
1446
1447       "CleanURL($url);"
1448
1449           Cleans a url.
1450
1451       "UTF($code);"
1452
1453           Generates a UTF character, handy for the the 8 bit characters.  For
1454           example, "UTF(0xA0)" generates the non-breaking space.
1455
1456       "@tokens = roman($number);"
1457
1458           Formats the $number in (lowercase) roman numerals, returning a list
1459           of the tokens.
1460
1461       "@tokens = Roman($number);"
1462
1463           Formats the $number in (uppercase) roman numerals, returning a list
1464           of the tokens.
1465

SEE ALSO

1467       See also LaTeXML::Global, LaTeXML::Common::Object,
1468       LaTeXML::Common::Error, LaTeXML::Core::Token, LaTeXML::Core::Tokens,
1469       LaTeXML::Core::Box, LaTeXML::Core::List, LaTeXML::Common::Number,
1470       LaTeXML::Common::Float, LaTeXML::Common::Dimension,
1471       LaTeXML::Common::Glue, LaTeXML::Core::MuDimension,
1472       LaTeXML::Core::MuGlue, LaTeXML::Core::Pair, LaTeXML::Core::PairList,
1473       LaTeXML::Common::Color, LaTeXML::Core::Alignment, LaTeXML::Common::XML,
1474       LaTeXML::Util::Radix.
1475

AUTHOR

1477       Bruce Miller <bruce.miller@nist.gov>
1478
1480       Public domain software, produced as part of work done by the United
1481       States Government & not subject to copyright in the US.
1482
1483
1484
1485perl v5.36.0                      2022-07-20               LaTeXML::Package(3)
Impressum