2020-04-25 |
Reimar Döffinger | WiktionarySplitter: implement parallel processing |
blob | commitdiff | raw |
2020-04-15 |
Reimar Döffinger | Minor automated code simplifications. |
blob | commitdiff | raw | diff to current |
2020-04-15 |
Reimar Döffinger | Avoid creating the same Matchers over and over. |
blob | commitdiff | raw | diff to current |
2020-04-13 |
Reimar Döffinger | Get rid of xerces dependency. |
blob | commitdiff | raw | diff to current |
2020-04-13 |
Reimar Döffinger | Import cleanup/changes for Eclipse compatibility. |
blob | commitdiff | raw | diff to current |
2020-04-11 |
Reimar Döffinger | Exclude some more special titles not relevant for us. |
blob | commitdiff | raw | diff to current |
2019-01-09 |
Reimar Döffinger | Refine fix for Spanish wiktionary. |
blob | commitdiff | raw | diff to current |
2019-01-09 |
Reimar Döffinger | Improve wiktionary splitter for Spanish and Portuguese |
blob | commitdiff | raw | diff to current |
2018-12-04 |
Reimar Döffinger | Add french-greek dictionary support. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Reduce progress prints and optimize title check. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Minor optimizations for endPage function. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Move code out of loop that had no reason to be in it. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Compress WiktionarySplitter output files. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Add a write buffer to wiktionary splitter outputs. |
blob | commitdiff | raw | diff to current |
2017-10-15 |
Reimar Döffinger | Cache compiled patterns. |
blob | commitdiff | raw | diff to current |
2017-10-14 |
Reimar Döffinger | Add read-ahead buffer to decompress in parallel. |
blob | commitdiff | raw | diff to current |
2017-10-07 |
Reimar Döffinger | WiktionarySplitter: Support compressed inputs. |
blob | commitdiff | raw | diff to current |
2016-11-08 |
Reimar Döffinger | Apply astyle code formatting. |
blob | commitdiff | raw | diff to current |
2015-12-16 |
Reimar Döffinger | Fix for splitting Mandarin/Cantonese/... |
blob | commitdiff | raw | diff to current |
2015-12-16 |
Reimar Döffinger | Fix splitting of Greek/Ancient Greek. |
blob | commitdiff | raw | diff to current |
2015-12-08 |
Reimar Döffinger | Improvements to wikisplit code. |
blob | commitdiff | raw | diff to current |
2015-09-05 |
Reimar Döffinger | Fix WiktionarySplitter breakage. |
blob | commitdiff | raw | diff to current |
2015-08-28 |
Reimar Döffinger | Hacks to support Spanish wiktionary. |
blob | commitdiff | raw | diff to current |
2012-12-30 |
Thad Hughes | Update URL format and parsing, fix FR handling. |
blob | commitdiff | raw | diff to current |
2012-12-16 |
Thad Hughes | Update to latest wiktionaries, update unit tests, der... |
blob | commitdiff | raw | diff to current |
2012-09-10 |
thadh | Update goldens. |
blob | commitdiff | raw | diff to current |
2012-09-10 |
thadh | Add some langs (Ancient Greek, Cantonese, Burmese(MY... |
blob | commitdiff | raw | diff to current |
2012-07-21 |
thadh | Added WholeSection entries and parser. |
blob | commitdiff | raw | diff to current |
2012-02-10 |
Thad Hughes | Rename enwiktionary package to wiktionary. |
blob | commitdiff | raw | diff to current |
2012-02-08 |
Thad Hughes | Split EN, DE, IT, FR wiktionaries! Fix splitting to... |
blob | commitdiff | raw | diff to current |
2012-01-16 |
Thad Hughes | 2 types of TokenRow. |
blob | commitdiff | raw | diff to current |
2012-01-16 |
Thad Hughes | Changing the way dictionaries are indexed (listed)... |
blob | commitdiff | raw | diff to current |
2011-12-30 |
Thad Hughes | Refactoring wiki parsing, bigtime. Underway, so lots... |
blob | commitdiff | raw | diff to current |
2011-12-30 |
Thad Hughes | More languages, simpler splitter. |
blob | commitdiff | raw | diff to current |
2011-12-28 |
Thad Hughes | Upgrading wiktionary version.... |
blob | commitdiff | raw | diff to current |
2011-12-26 |
Thad Hughes | Add script to download data. |
blob | commitdiff | raw | diff to current |
2011-12-23 |
Thad Hughes | Better {{form of}} handling, remove "lang=..." |
blob | commitdiff | raw | diff to current |
2011-12-19 |
Thad Hughes | Fix enIndex=1, not 2. |
blob | commitdiff | raw | diff to current |
2011-12-18 |
Thad Hughes | Move test data, fix DictFileParser, fix splitter, fix... |
blob | commitdiff | raw | diff to current |
2011-12-17 |
Thad Hughes | Redo splitter language codes. |
blob | commitdiff | raw | diff to current |
2011-12-16 |
Thad Hughes | Stoplists, fix location of wikisplits. |
blob | commitdiff | raw | diff to current |
2011-12-13 |
Thad Hughes | Apache license. |
blob | commitdiff | raw | diff to current |
2011-12-13 |
Thad Hughes | go |
blob | commitdiff | raw | diff to current |
|