summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2013-12-10bugfix in the grammar splitterkr.angelov
2013-12-10option --split-pgf replaces option --mk-index. This splits the PGF into one ↵kr.angelov
file for the abstract and one more for each concrete syntax. This is a preparation for being able to load only specific languages from the whole grammar.
2013-12-09new inflection_fi.xml, for stemmed and refactored ParseFinaarne
2013-12-07make ParseFin compile with Constructionaarne
2013-12-06testsuite: reduced the number of failing tests from 25 to 13hallgren
Some tests may fail on Windows because of \ instead of / in paths.
2013-12-06Show relative file paths in error messageshallgren
This is to avoid one trivial reason for failures in the test suite.
2013-12-06partial reconstruction for sentence structures in Susannekr.angelov
2013-12-06GF.Compile.CheckGrammar: add source path to error message for circular ↵hallgren
definitions
2013-12-06GF shell: fix help text for generate_treeshallgren
Trees are not generated with increasing depth.
2013-12-05more on the Susanne treebankkr.angelov
2013-12-04added script for reading the Susanne treebankkr.angelov
2013-12-03testsuite: Use Cabal's new test suite interfacehallgren
* The old way: a user hook in Setup.hs * The new way: specify it in gf.cabal * The test suite is now called gf-tests, and it runs testsuite/run.hs. * You can run it manually with 'runhaskell testsuite/run.hs'. It also runs, together with rgl-tests, when you do 'cabal test' * Currently only 9 of 34 tests pass. Many failures have silly causes: - Error messages that look slightly different - Same output but in a different order - Absolute paths in output
2013-12-03testsuite: use forward slashes in pathshallgren
2013-12-03removed the unlines-lines wrapper from Lexing.unlexer to prevent empty lines ↵aarne
when an unlexer (such as -bind or -unchars) is used as an option in linearization. Don't know really why the input had been broken into lines in the first place. You can see the effect by importing LangEng and running "gr -cat=Cl | l -table -bind" before and after recompiling GF.
2013-12-03fix in inflection_en.xmlkr.angelov
2013-12-02ui/android/.../Translate.java uses Parse8 (change the name and language list ↵aarne
if you need something else)
2013-11-29examples/phrasebook/Makefile2: Romanian needs a larger stackhallgren
2013-11-29Commment code and options relating to the old partial evaluatorhallgren
This means that the -old-comp and -new-comp flags are not recognized anymore. The only functional difference is that printnames were still normalized with the old partial evaluator. Now that is done with the new partial evaluator.
2013-11-29Minor release/download/developer doc changeshallgren
2013-11-29Move typePredefined from GF.Compile.Compute.AppPredefined to ↵hallgren
GF.Compile.TypeCheck.Primitives Also move the list of primitives
2013-11-29Move checkPredefError from GF.Compile.Compute.ConcreteLazy to GF.Grammar.Macroshallgren
Also simplified its type.
2013-11-29-optimize-pgf should also apply to the linrefskr.angelov
2013-11-28added caching to be able to quickly generate the inflection table for Finnishkr.angelov
2013-11-29examples/foods: remove all -path flagshallgren
All languages n the Foods grammar compile fine with alltenses.
2013-11-29added forms to Finnish xml tableaarne
2013-11-28html templates for Finnish noun and verb inflection startedaarne
2013-11-28bugfix in the android appkr.angelov
2013-11-28now the TTS is used only if there is actually a language model. this is ↵kr.angelov
better than using English for speaking Swedish
2013-11-28a bit of cleanup in the Android appkr.angelov
2013-11-28extension in the template format for inflection tables. This makes it ↵kr.angelov
possible to use the templates with Finish
2013-11-28GFI: adapt to character encoding changeshallgren
GF.Compile.Coding is not used any more.
2013-11-28SimpleEditor.Convert: adapt to recent character encoding changeshallgren
Things got simpler!
2013-11-28the Android App now show the valency patterns for verbs in English, ↵kr.angelov
Bulgarian and Swedish
2013-11-28Translate grammar for Ger and Hin added. Some functions in Ger phrasebook ↵aarne
need to be commented out to compile. Now a system for 7 languages can be built, but including Fre as well is too heavy on my laptop although it can be compiled separately.
2013-11-27better dictionary visualization in the Android Appkr.angelov
2013-11-27Adding an alternate style for the GF home pagehallgren
2013-11-27minor tweaks in the Android UIkr.angelov
2013-11-27fix the character encoding problem in the web view in the Android UIkr.angelov
2013-11-27added API in the C runtime and the Java binding for checking whether a given ↵kr.angelov
function is linearizable in a given language. This is used in the Android UI for better vizualizations
2013-11-27update the project files for the Java bindingkr.angelov
2013-11-27space leak in the Java bindingkr.angelov
2013-11-26Represent identifiers as UTF-8-encoded ByteStringshallgren
This was a fairly simple change thanks to previous work on making the Ident type abstract and the fact that PGF.CId already uses UTF-8-encoded ByteStrings. One potential pitfall is that Data.ByteString.UTF8 uses the same type for ByteStrings as Data.ByteString. I renamed ident2bs to ident2utf8 and bsCId to utf8CId, to make it clearer that they work with UTF-8-encoded ByteStrings. Since both the compiler input and identifiers are now UTF-8-encoded ByteStrings, the lexer now creates identifiers without copying any characters. **END OF DESCRIPTION*** Place the long patch description above the ***END OF DESCRIPTION*** marker. The first line of this file will be the patch name. This patch contains the following changes: M ./src/compiler/GF/Compile/CheckGrammar.hs -3 +3 M ./src/compiler/GF/Compile/GrammarToPGF.hs -2 +2 M ./src/compiler/GF/Grammar/Binary.hs -5 +1 M ./src/compiler/GF/Grammar/Lexer.x -11 +13 M ./src/compiler/GF/Infra/Ident.hs -19 +36 M ./src/runtime/haskell/PGF.hs -1 +1 M ./src/runtime/haskell/PGF/CId.hs -2 +3
2013-11-25Change how GF deals with character encodings in grammar fileshallgren
1. The default encoding is changed from Latin-1 to UTF-8. 2. Alternate encodings should be specified as "--# -coding=enc", the old "flags coding=enc" declarations have no effect but are still checked for consistency. 3. A transitional warning is generated for files that contain non-ASCII characters without specifying a character encoding: "Warning: default encoding has changed from Latin-1 to UTF-8" 4. Conversion to Unicode is now done *before* lexing. This makes it possible to allow arbitrary Unicode characters in identifiers. But identifiers are still stored as ByteStrings, so they are limited to Latin-1 characters for now. 5. Lexer.hs is no longer part of the repository. We now generate the lexer from Lexer.x with alex>=3. Some workarounds for bugs in alex-3.0 were needed. These bugs might already be fixed in newer versions of alex, but we should be compatible with what is shipped in the Haskell Platform.
2013-11-26examples/phrasebook: add Makefile2 for parallel grammar compilationhallgren
Makefile2 specificies the dependencies accurately enough that you can use make's support for parallel compilation. Run make -f Makefile2 -j to utilize all the cores in your computer to significantly speed up the creation of Phrasebook.pgf.
2013-11-26examples/letter: convert grammar files to UTF-8hallgren
2013-11-25GF home page: mention upcoming default character encoding changehallgren
2013-11-25Setup.hs: avoid a problem with Cabal-1.18hallgren
In Cabal-1.18, the build command takes some new arguments. The Setup.hs script should not die if these are present.
2013-11-25Document the upcoming default character encoding change in the release noteshallgren
2013-11-22examples/phrasebook: adding --# -coding=latin1 to 18 moduleshallgren
Adding coding pragmas --# -coding=latin1 so that grammars will continue to work when we change the default character encoding to UTF-8.
2013-11-22examples/foods: add --# -coding=latin1hallgren
Adding coding pragmas --# -coding=latin1 so that grammars will continue to work when we change the default character encoding to UTF-8.