summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Collapse)Author
2013-03-26minibar: show parse trees without or with function nameshallgren
Click once to show the parse tree without function names. Click again to add function names.
2013-03-26PGF service: support all parse tree visualziation options in the web APIhallgren
The new options are * noleaves, nofun, nocat (booleans, false by default) * nodefont, leaffont, nodecolor, leafcolor, nodeedgestyle, leafedgestyle (strings, empty by default)
2013-03-26haddock bug workaroundhallgren
2013-03-25minibar: initial support for selecting an arbitrary subset of target languageshallgren
This affects which translations are shown and which languages are included in the word alignment diagrams.
2013-03-20the calculation of lexical_prob in the statistical parser doesn't work ↵kr.angelov
properly. It should be fixed but for now I just disabled the optimization
2013-03-20replace #if with #ifdef when checking for the optional bottom up filtering ↵kr.angelov
in the C runtime
2013-03-19the bottom up filtering in the C runtime is temporary disabled. It takes too ↵kr.angelov
much memory and even makes it impossible to load the Finnish and the German parsing grammars.
2013-03-16Fix a problem with pattern macros in pre { } expressionshallgren
The old partial evaluator has special rules to convert pattern macros in pre { } expressions. These rules were missing in the new partial evaluator.
2013-03-14bug fix in the linearizer in the C runtimekr.angelov
2013-03-14update the pgf-service tool from the C runtime after the changes in the APIkr.angelov
2013-03-13PGF web API: add 'to' parameter to the 'alignment' commandhallgren
Word alignment diagrams can now be restricted to a subset of the languages supported by the grammar.
2013-03-13PGF web API: generalize the 'to' parameter to accept a list of languageshallgren
Some commands (linearize, linearizeAll, random, generate, translate and translategroup) by default produce output in all languages supported by the grammar and the 'to' parameter could be used to restrict output to a single language. Now you can restrict the output to a list of languages. Languages should be separated by spaces. Also removed an unnecessary LANGUAGE pragma and reduced code verbosity.
2013-03-13pt -nub to remove duplicate trees from a list returned e.g. by a parseraarne
2013-03-12partial evaluator: push predefined functions inside variantshallgren
This should prevent errors like Internal error in Compute.ConcreteNew: Applying Predef.drop: Expected a value of type String, got VFV [VString "gewandt",VString "gewendet"]
2013-03-12command pt -subtrees that analyses a tree into the set of subtrees. Using pt ↵aarne
-subtrees <bigtree> | l -treebank for debugging the lin of a big tree
2013-03-11bugfix in the statistical parserkr.angelov
2013-03-11Additional changes for GHC 7.4 & 7.6 compatibilityhallgren
2013-03-09ghc-7.6: allow directory-1.2Sergei Trofimovich
Get rid of old-time depend (and ClockTime in favour of UTCTime). time-compat helps to retain backward compatibility with directory-1.1 and lower.
2013-03-09ghc-7.6: fix src/server/PGFService.hs (use catch from base-4 Control.Exception)Sergei Trofimovich
2013-03-09ghc-7.6: add missing Num instance for BitsSergei Trofimovich
Fixes the following build failure: src/runtime/haskell/Data/Binary/IEEE754.lhs:256:17: Could not deduce (Num a) arising from a use of `mask' from the context (Bits a) bound by the type signature for clamp :: Bits a => BitCount -> a -> a
2013-02-28Fix a bug that could cause "Prelude.head: empty list"hallgren
In Data.Operations, the function topoTest2 assumed too much about the form of the input, compared to the older function topoTest.
2013-02-28pattern match length estimation code simplicationhallgren
2013-02-27Faster regular expression pattern matching in the grammar compiler.hallgren
The sequence operator (x+y) was implemented by splitting the string to be matched at all positions and trying to match the parts against the two subpatterns. To reduce the number of splits, we now estimate the minimum and maximum length of the string that the subpatterns could match. For common cases, where one of the subpatterns is a string of known length, like in (x+"y") or (x + ("a"|"o"|"u"|"e")+"y"), only one split will be tried.
2013-02-27GF grammar pretty printer improvementshallgren
Allow line breaks in more places to make large terms more readable.
2013-02-26added gu_buf_flush in seq.c which removes all elements from a bufferkr.angelov
2013-02-19the parser in the C runtime should not crash if the start category is not ↵kr.angelov
defined
2013-02-19bugfix in the grammar reader in the C runtimekr.angelov
2013-02-13bugfix in the linearizer for the C runtimekr.angelov
2013-02-13Fix for a PGF portability problemhallgren
GF produced slightly different PGF files on 64-bit systems and 32-bit systems. This could cause problems when a PGF was produced on a 32-bit system and used on a 64-bit system. To fix this, the GF compiler and the Haskell PGF run-time library now reads and writes PGF files like the 32-bit version even when compiled on a 64-bit system. Note: the Haskell type Int is still used internally in GF, which could be 32 bits or 64 bits...
2013-02-12now the beam size for the statistical parser can be configured by using the ↵kr.angelov
flag beam_size in the top-level concrete module
2013-02-12the statistical parser now uses a baseline lexical estimation of the beam sizekr.angelov
2013-02-11translator: add some menu padding for touch deviceshallgren
2013-02-11translator: small style changehallgren
2013-02-11translator: remove debug codehallgren
2013-02-11translator: "Open..." doesn't discard the current document anymorehallgren
It shows available documents and a Cancel button on top of the current document.
2013-02-11the class PgfConcr from the Python binding now has a property name which ↵kr.angelov
returns the name of the concrete syntax
2013-02-11refactoring: now all named objects in the C runtime have an explicit name fieldkr.angelov
2013-02-11remove the pgf2yaml tool which was both broken and redundant. The ↵kr.angelov
declarations for generic programming from data.c are removed as well
2013-02-11the grammar reader in the C runtime is completely rewritten and it doesn't ↵kr.angelov
use the generic programming API
2013-02-03pg -lexc now writes a list of multichar symbols and a title ("Root") for the ↵aarne
lexicon, as required by Xerox lexc
2013-02-01This patch removes Gregoire's parse_tokens function in the python binding ↵kr.angelov
and adds another implementation which builds on the existing API for lexers in the C runtime. Now it is possible to write incremental Lexers in Python
2013-02-01implement gu_exn_caught in gu/exn.c. It was missingkr.angelov
2013-01-29Better error message for unsupported token gluinghallgren
Instead of "Internal error in ...", you now get a proper error message with a source location and a function name.
2013-01-29Fix a bug with record extensionhallgren
Add a conversion rule for ({ l1 = e } ** x).l2 in PMCFG generation. (A rule for the symmetric case (x ** { l1 = e }).l2 was added some time ago.)
2013-01-29Avoid crash in random generation with probabilitieshallgren
2013-01-29bugfix in the reference counting for Pythonkr.angelov
2013-01-29the Python binding is in pure C againkr.angelov
2013-01-29added an API for composing and decomposing abstract trees from Pythonkr.angelov
2013-01-29fixed typos in the python binding: in a few places pgf_ExprType was used ↵kr.angelov
instead of pgf_ExprIterType
2013-01-28Quick fix to render some parser error messages from UTF-8-encoded source ↵hallgren
files correctly. The parser works on raw byte sequences read from source files. If parsing succeeds the raw byte sequences are converted to proper Unicode characters in a later phase. But the parser calls the function buildAnyTree, which can fail and generate error messages containing source code fragments, which might then containing raw byte sequences. To render these error messages correctly, they need to be converted in accordance with the coding flag in the source file. This is now done for UTF-8-encoded source files, but should ideally also be done for other character encodings. (Latin-1-encoded files never suffered from this problem, since raw bytes are proper Unicode characters in this case.)