summaryrefslogtreecommitdiff
path: root/treebanks
AgeCommit message (Collapse)Author
2015-11-18some more conversion patterns for Susannekrasimir
2015-11-13a new version of the conversion script for the Susanne corpus which covers a ↵krasimir
bit less than half of the data
2014-10-10a refactoring in the Susanne converter which for some reason was not pushed ↵kr.angelov
before
2013-12-06partial reconstruction for sentence structures in Susannekr.angelov
2013-12-05more on the Susanne treebankkr.angelov
2013-12-04added script for reading the Susanne treebankkr.angelov
2013-11-06the content of ParseEngAbs3.probs is now merged with ParseEngAbs.probs. The ↵kr.angelov
later is now retrained. Once the grammar is compiled with the .probs file now it doesn't need anything more to do robust parsing. The robustness itself is controlled by the flags 'heuristic_search_factor', 'meta_prob' and 'meta_token_prob' in ParseEngAbs.gf
2013-09-20work on the translation dictionaries and the statistical modelkr.angelov
2013-03-28added simple script for estimating the coverage on the PennTreebankkr.angelov
2013-03-27added a few more multiword expressions in DictEng and a few words in the ↵kr.angelov
abstract syntax are not tagged with their senses. There is a new statistical model too
2013-03-25added the file treebanks/PennTreebank/ParseEngAbs3.probs which is used by ↵kr.angelov
the statistical parser for robust chunking
2013-03-25a long list of prepositions from Wikipedia is now imported in DictEng in ↵kr.angelov
addition there are a number of small other changes in the dictionary. The statistical model is updated and is now moved to treebanks/PennTreebank/ParseEngAbs.probs
2013-03-21added configuration file which defines the heads for all syntactic functions ↵kr.angelov
in ParseEng
2012-10-01move examples/PennTreebank to /treebanks/PennTreebankkr.angelov
2012-10-01added the GF version of Talbanken which was imported by Malinkr.angelov