1grammar::me::cpu(n) Grammar operations and usage grammar::me::cpu(n)
2
3
4
5______________________________________________________________________________
6
8 grammar::me::cpu - Virtual machine implementation II for parsing token
9 streams
10
12 package require Tcl 8.4
13
14 package require grammar::me::cpu ?0.2?
15
16 ::grammar::me::cpu meName matchcode
17
18 meName option ?arg arg ...?
19
20 meName lc location
21
22 meName tok ?from ?to??
23
24 meName pc state
25
26 meName iseof state
27
28 meName at state
29
30 meName cc state
31
32 meName sv
33
34 meName ok
35
36 meName error
37
38 meName lstk state
39
40 meName astk state
41
42 meName mstk state
43
44 meName estk state
45
46 meName rstk state
47
48 meName nc state
49
50 meName ast
51
52 meName halted
53
54 meName code
55
56 meName eof
57
58 meName put tok lex line col
59
60 meName putstring string lvar cvar
61
62 meName run ?n?
63
64 meName pull nextcmd
65
66 meName reset
67
68 meName destroy
69
70_________________________________________________________________
71
73 This package provides an implementation of the ME virtual machine.
74 Please go and read the document grammar::me_intro first if you do not
75 know what a ME virtual machine is.
76
77 This implementation provides an object-based API and the machines are
78 not truly tied to Tcl. A C implementation of the same API is quite pos‐
79 sible.
80
81 Internally the package actually uses the value-based machine manipula‐
82 tion commands as provided by the package grammar::me::cpu::core to per‐
83 form its duties.
84
86 CLASS API
87 The package directly provides only a single command for the construc‐
88 tion of ME virtual machines.
89
90 ::grammar::me::cpu meName matchcode
91 The command creates a new ME machine object with an associated
92 global Tcl command whose name is meName. This command may be
93 used to invoke various operations on the machine. It has the
94 following general form:
95
96 meName option ?arg arg ...?
97 Option and the args determine the exact behavior of the
98 command.
99
100 The argument matchcode contains the match instructions the machine has
101 to execute while parsing the input stream. Please read section MATCH
102 CODE REPRESENTATION of the documentation for the package gram‐
103 mar::me::cpu::core for the specification of the structure of this
104 value.
105
106 The tokmap argument taken by the implementation provided by the package
107 grammar::me::tcl is here hidden inside of the match instructions and
108 therefore not needed.
109
110 OBJECT API
111 All ME virtual machine objects created by the class command specified
112 in section CLASS API support the methods listed below.
113
114 The machines provided by this package provide methods for operation in
115 both push- and pull-styles. Push-style means that tokens are pushed
116 into the machine state when they arrive, triggering further execution
117 until they are consumed. In other words, this allows the machine to be
118 suspended and resumed at will and an arbitrary number of times, the
119 quasi-parallel operation of several machines, and the operation as part
120 of the event loop.
121
122 meName lc location
123 This method converts the location of a token given as offset in
124 the input stream into the associated line number and column
125 index. The result of the command is a 2-element list containing
126 the two values, in the order mentioned in the previous sentence.
127 This allows higher levels to convert the location information
128 found in the error status and the generated AST into more human
129 readable data.
130
131 Note that the command is not able to convert locations which
132 have not been reached by the machine yet. In other words, if the
133 machine has read 7 tokens the command is able to convert the
134 offsets 0 to 6, but nothing beyond that. This also shows that it
135 is not possible to convert offsets which refer to locations
136 before the beginning of the stream.
137
138 meName tok ?from ?to??
139 This method returns a Tcl list containing the part of the input
140 stream between the locations from and to (both inclusive). If to
141 is not specified it will default to the value of from. If from
142 is not specified either the whole input stream is returned.
143
144 Each element of the returned list is a list of four elements,
145 the token, its associated lexeme, line number, and column index,
146 in this order. This command places the same restrictions on its
147 location arguments as the method lc.
148
149 meName pc state
150 This method takes the state value of a ME virtual machine and
151 returns the current value of the stored program counter.
152
153 meName iseof state
154 This method takes the state value of a ME virtual machine and
155 returns the current value of the stored eof flag.
156
157 meName at state
158 This method takes the state value of a ME virtual machine and
159 returns the current location in the input stream.
160
161 meName cc state
162 This method takes the state value of a ME virtual machine and
163 returns the current token.
164
165 meName sv
166 This command returns the current semantic value SV stored in the
167 machine. This is an abstract syntax tree as specified in the
168 document grammar::me_ast, section AST VALUES.
169
170 meName ok
171 This method returns the current match status OK.
172
173 meName error
174 This method returns the current error status ER.
175
176 meName lstk state
177 This method takes the state value of a ME virtual machine and
178 returns the location stack.
179
180 meName astk state
181 This method takes the state value of a ME virtual machine and
182 returns the AST stack.
183
184 meName mstk state
185 This method takes the state value of a ME virtual machine and
186 returns the AST marker stack.
187
188 meName estk state
189 This method takes the state value of a ME virtual machine and
190 returns the error stack.
191
192 meName rstk state
193 This method takes the state value of a ME virtual machine and
194 returns the subroutine return stack.
195
196 meName nc state
197 This method takes the state value of a ME virtual machine and
198 returns the nonterminal match cache as a dictionary.
199
200 meName ast
201 This method returns the current top entry of the AST stack AS.
202 This is an abstract syntax tree as specified in the document
203 grammar::me_ast, section AST VALUES.
204
205 meName halted
206 This method returns a boolean value telling the caller whether
207 the engine has halted execution or not. Halt means that no fur‐
208 ther matching is possible, and the information retrieved via the
209 other method is final. Attempts to run the engine will be
210 ignored, until a reset is made.
211
212 meName code
213 This method returns the code information used to construct the
214 object. In other words, the match program executed by the
215 machine.
216
217 meName eof
218 This method adds an end of file marker to the end of the input
219 stream. This signals the machine that the current contents of
220 the input queue are the final parts of the input and nothing
221 will come after. Attempts to put more characters into the queue
222 will fail.
223
224 meName put tok lex line col
225 This method adds the token tok to the end of the input stream,
226 with associated lexeme data lex and line/column information.
227
228 meName putstring string lvar cvar
229 This method adds each individual character in the string as a
230 token to the end of the input stream, from first to last. The
231 lexemes will be empty and the line/col information is computed
232 based on the characters encountered and the data in the vari‐
233 ables lvar and cvar.
234
235 meName run ?n?
236 This methods causes the engine to execute match instructions
237 until either
238
239 · n instructions have been executed, or
240
241 · a halt instruction was executed, or
242
243 · the input queue is empty and the code is asking for more
244 tokens to process.
245
246 If no limit n was set only the last two conditions are checked for.
247
248 meName pull nextcmd
249 This method implements pull-style operation of the machine. It
250 causes it to execute match instructions until either a halt
251 instruction is reached, or the command prefix nextcmd ceases to
252 deliver more tokens.
253
254 The command prefix nextcmd represents the input stream of char‐
255 acters and is invoked by the machine whenever the a new charac‐
256 ter from the stream is required. The instruction for handling
257 this is ict_advance. The callback has to return either the
258 empty list, or a list of 4 elements containing the token, its
259 lexeme attribute, and its location as line number and column
260 index, in this order. The empty list is the signal that the end
261 of the input stream has been reached. The lexeme attribute is
262 stored in the terminal cache, but otherwise not used by the
263 machine.
264
265 The end of the input stream for this method does not imply that
266 method eof is called for the machine as a whole. By avoiding
267 this and still asking for an explicit call of the method it is
268 possible to mix push- and pull-style operation during the life‐
269 time of the machine.
270
271 meName reset
272 This method resets the machine to its initial state, discarding
273 any state it may have.
274
275 meName destroy
276 This method deletes the object and releases all resurces it
277 claimed.
278
280 grammar, parsing, virtual machine
281
283 Copyright (c) 2005-2006 Andreas Kupries <andreas_kupries@users.sourceforge.net>
284
285
286
287
288grammar_me 0.2 grammar::me::cpu(n)