diff options
| author | aarne <aarne@cs.chalmers.se> | 2005-11-17 23:17:42 +0000 |
|---|---|---|
| committer | aarne <aarne@cs.chalmers.se> | 2005-11-17 23:17:42 +0000 |
| commit | 524c4829f9cc5720c18b8d43bd430d0627edcb89 (patch) | |
| tree | c10cc4dbb4b6f0bb5464369b1ed3d028c29fec18 /doc | |
| parent | e29a1430bf76b00c3714b72b7763190df6716081 (diff) | |
nondeterministic lexer, e.g. subseqs
Diffstat (limited to 'doc')
| -rw-r--r-- | doc/gf-history.html | 13 |
1 files changed, 13 insertions, 0 deletions
diff --git a/doc/gf-history.html b/doc/gf-history.html index a6b3b970b..81ca8106a 100644 --- a/doc/gf-history.html +++ b/doc/gf-history.html @@ -13,6 +13,19 @@ Changes in functionality since May 17, 2005, release of GF Version 2.2 </center> <p> +17/11 (AR) Made it possible for lexers to be nondeterministic. +Now with a simple-minded implementation that the parser is sent +each lexing result in turn. The option <tt>-cut</tt> is used for +breaking after first lexing leading to successful parse. The only +nondeterministic lexer right now is <tt>-lexer=subseqs</tt>, which +first filters with <tt>-lexer=ignore</tt> (dropping words neither in +the grammar nor literals) and then starts ignoring other words from +longest to shortest subsequence. This is usable for parser tasks +of keyword spotting type, but expensive (2<sup>n</sup>) in long input. +A smarter implementation is therefore desirable. + +<p> + 14/11 (AR) Functions can be made unparsable (or "internal" as in BNFC). This is done by <tt>i -noparse=file</tt>, where the nonparsable functions are given in <tt>file</tt> using the |
