summaryrefslogtreecommitdiff
path: root/src/compiler/GF/Text
AgeCommit message (Collapse)Author
2013-11-25Change how GF deals with character encodings in grammar fileshallgren
1. The default encoding is changed from Latin-1 to UTF-8. 2. Alternate encodings should be specified as "--# -coding=enc", the old "flags coding=enc" declarations have no effect but are still checked for consistency. 3. A transitional warning is generated for files that contain non-ASCII characters without specifying a character encoding: "Warning: default encoding has changed from Latin-1 to UTF-8" 4. Conversion to Unicode is now done *before* lexing. This makes it possible to allow arbitrary Unicode characters in identifiers. But identifiers are still stored as ByteStrings, so they are limited to Latin-1 characters for now. 5. Lexer.hs is no longer part of the repository. We now generate the lexer from Lexer.x with alex>=3. Some workarounds for bugs in alex-3.0 were needed. These bugs might already be fixed in newer versions of alex, but we should be compatible with what is shipped in the Haskell Platform.
2013-06-15Improvements In Sindhi RGvirk.shafqat
2013-06-02GF.Text.Transliterations: avoid error prone function Data.Map.fromAscListhallgren
2013-05-31Prasad's sanskrit transliteration ; MiniresourceSan now compiles but is ↵aarne
mostly incorrect due to missing paradigms
2012-11-05unicode4k-changedvirk.shafqat
2012-03-26compiler/GF/Text/Coding.hs: fix build failure against ghc-7.2Sergei Trofimovich
2012-02-23hindi-resource-grammarvirk.shafqat
2012-02-21sindhipatchvirk.shafqat
2011-09-15made ps -from_TRANSLIT symmetric to -to_TRANSLIT in the sense that unknown ↵aarne
characters are returned as themselves and not as question marks
2011-06-20refinementNepali-11-06-20virk.shafqat
2011-06-14allow empty lines in transliteration filesaarne
2011-05-19refinementsTextUrd-11-05-19virk.shafqat
2011-05-06fixed problems in persian transliteration pointed out by Elnazaarne
2011-05-02transliteration via configuration file: ps -to=file or ps -from=fileaarne
2011-02-06a simple clitic analysis command 'ca'aarne
2011-01-31corrections to ancientgreek encoding by Hans Leissaarne
2010-11-25DiffUrd and Hin; updated Transliteration.hsaarne
2010-05-07Amharic transliteration by Markosaarne
2010-04-19use the native unicode support from GHC 6.12krasimir
2010-04-01Urdu transliteration fixed (by Shafqat)aarne
2010-03-23added codepage for Turkishkrasimir
2010-03-23added comment to every GF.Text.CPxxxx module about the purpose of the codepagekrasimir
2010-03-22transliteration for Urdukrasimir
2009-12-17correct capitalization in unlexmixed; unlextext and unlexmixed now remove ↵aarne
string literal quotes
2009-12-13reorganize the directories under src, and rescue the JavaScript interpreter ↵krasimir
from deprecated