1pt::pgen(n)                      Parser Tools                      pt::pgen(n)
2
3
4
5______________________________________________________________________________
6

NAME

8       pt::pgen - Parser Generator
9

SYNOPSIS

11       package require Tcl  8.5
12
13       package require pt::pgen  ?1.1?
14
15       ::pt::pgen inputformat text resultformat ?options...?
16
17______________________________________________________________________________
18

DESCRIPTION

20       Are  you  lost ?  Do you have trouble understanding this document ?  In
21       that case please read the overview  provided  by  the  Introduction  to
22       Parser  Tools.  This document is the entrypoint to the whole system the
23       current package is a part of.
24
25       This package provides a command implementing a parser generator  taking
26       parsing expression grammars as input.
27
28       It is the implementation of method generate of pt, the Parser Tools Ap‐
29       plication.
30
31       As such the intended audience of this document are  people  wishing  to
32       modify  and/or  extend  this part of pt's functionality. Users of pt on
33       the other hand are hereby refered to the  applications'  manpage,  i.e.
34       Parser Tools Application.
35
36       It resides in the User Package Layer of Parser Tools.
37
38       IMAGE: arch_user_pkg
39

API

41       ::pt::pgen inputformat text resultformat ?options...?
42              This  command  takes  the parsing expression grammar in text (in
43              the format specified by inputformat), and returns the same gram‐
44              mar in the format resultformat as the result of the command.
45
46              The  two known input formats are peg and json.  Introductions to
47              them, including their formal specifications, can be found in the
48              PEG  Language Tutorial and The JSON Grammar Exchange Format. The
49              packages used to parse these formats are
50
51              peg    pt::peg::from::peg
52
53              json   pt::peg::from::json
54
55       On the output side the known formats, and the packages used to generate
56       them are
57
58              c      pt::peg::to::cparam
59
60              container
61                     pt::peg::to::container
62
63              critcl pt::peg::to::cparam + pt::cparam::configuration::critcl
64
65              json   pt::peg::to::json
66
67              oo     pt::peg::to::tclparam      +     pt::tclparam::configura‐
68                     tion::tcloo
69
70              peg    pt::peg::to::peg
71
72              snit   pt::peg::to::tclparam + pt::tclparam::configuration::snit
73
74              The options supported by each of these  formats  are  documented
75              with their respective packages.
76

EXAMPLE

78       In  this section we are working a complete example, starting with a PEG
79       grammar and ending with running the parser generated from it over  some
80       input, following the outline shown in the figure below:
81
82       IMAGE: flow
83
84       Our grammar, assumed to the stored in the file "calculator.peg" is
85
86
87              PEG calculator (Expression)
88                  Digit      <- '0'/'1'/'2'/'3'/'4'/'5'/'6'/'7'/'8'/'9'       ;
89                  Sign       <- '-' / '+'                                     ;
90                  Number     <- Sign? Digit+                                  ;
91                  Expression <- Term (AddOp Term)*                            ;
92                  MulOp      <- '*' / '/'                                     ;
93                  Term       <- Factor (MulOp Factor)*                        ;
94                  AddOp      <- '+'/'-'                                       ;
95                  Factor     <- '(' Expression ')' / Number                   ;
96              END;
97
98       From this we create a snit-based parser using the script "gen"
99
100
101              package require Tcl 8.5
102              package require fileutil
103              package require pt::pgen
104
105              lassign $argv name
106              set grammar [fileutil::cat $name.peg]
107              set pclass  [pt::pgen peg $gr snit -class $name -file  $name.peg -name  $name]
108              fileutil::writeFile $name.tcl $pclass
109              exit 0
110
111       calling it like
112
113               tclsh8.5 gen calculator
114       which  leaves  us with the parser package and class written to the file
115       "calculator.tcl".  Assuming that this  package  is  then  properly  in‐
116       stalled  in a place where Tcl can find it we can now use this class via
117       a script like
118
119
120                  package require calculator
121
122                  lassign $argv input
123                  set channel [open $input r]
124
125                  set parser [calculator]
126                  set ast [$parser parse $channel]
127                  $parser destroy
128                  close $channel
129
130                  ... now process the returned abstract syntax tree ...
131
132       where the abstract syntax tree stored in the variable will look like
133
134              set ast {Expression 0 4
135                  {Factor 0 4
136                      {Term 0 2
137                          {Number 0 2
138                              {Digit 0 0}
139                              {Digit 1 1}
140                              {Digit 2 2}
141                          }
142                      }
143                      {AddOp 3 3}
144                      {Term 4 4
145                          {Number 4 4
146                              {Digit 4 4}
147                          }
148                      }
149                  }
150              }
151
152
153       assuming that the input file and channel contained the text
154
155               120+5
156       A more graphical representation of the tree would be
157
158       .nf +- Digit 0 0 | 1 |            | +- Term 0 2  ---  Number  0  2  -+-
159       Digit   1   1   |   2   |                            |             |  |
160       +- Digit 2 2 | 0 |                                        |  Expression
161       0  4  ---  Factor  0  4 -+----------------------------- AddOp 3 3 | + |
162       | +- Term 4 4 --- Number 4 4 --- Digit 4 4 | 5 .fi
163
164       Regardless, at this point it is the user's responsibility to work  with
165       the tree to reach whatever goal she desires. I.e. analyze it, transform
166       it, etc. The package pt::ast should be of help here, providing commands
167       to walk such ASTs structures in various ways.
168
169       One important thing to note is that the parsers used here return a data
170       structure representing the structure of the input per the  grammar  un‐
171       derlying the parser. There are no callbacks during the parsing process,
172       i.e. no parsing actions, as most other parsers will have.
173
174       Going back to the last snippet of code, the execution of the parser for
175       some  input,  note how the parser instance follows the specified Parser
176       API.
177

BUGS, IDEAS, FEEDBACK

179       This document, and the package it describes, will  undoubtedly  contain
180       bugs  and other problems.  Please report such in the category pt of the
181       Tcllib Trackers  [http://core.tcl.tk/tcllib/reportlist].   Please  also
182       report  any  ideas  for  enhancements  you  may have for either package
183       and/or documentation.
184
185       When proposing code changes, please provide unified diffs, i.e the out‐
186       put of diff -u.
187
188       Note  further  that  attachments  are  strongly  preferred over inlined
189       patches. Attachments can be made by going  to  the  Edit  form  of  the
190       ticket  immediately  after  its  creation, and then using the left-most
191       button in the secondary navigation bar.
192

KEYWORDS

194       EBNF, LL(k), PEG, TDPL, context-free  languages,  expression,  grammar,
195       matching,  parser, parsing expression, parsing expression grammar, push
196       down automaton, recursive descent, state, top-down  parsing  languages,
197       transducer
198

CATEGORY

200       Parsing and Grammars
201
203       Copyright (c) 2009 Andreas Kupries <andreas_kupries@users.sourceforge.net>
204
205
206
207
208tcllib                                1.1                          pt::pgen(n)
Impressum