2016-10-05 |
Reimar Döffinger | Partial progress to fix frwiktionary parsing. |
tree | commitdiff |
2016-04-15 |
Reimar Döffinger | Support generation FR-AR dictionary. |
tree | commitdiff |
2016-03-19 |
Reimar Döffinger | Add missing | in pattern. |
tree | commitdiff |
2016-01-06 |
Reimar Döffinger | Fix compilation. |
tree | commitdiff |
2015-12-17 |
Reimar Döffinger | Fix filtering out translation from French HTML. |
tree | commitdiff |
2015-12-16 |
Reimar Döffinger | Fix for splitting Mandarin/Cantonese/... |
tree | commitdiff |
2015-12-16 |
Reimar Döffinger | Fix splitting of Greek/Ancient Greek. |
tree | commitdiff |
2015-12-14 |
Reimar Döffinger | Improve tokenizer speed. |
tree | commitdiff |
2015-12-13 |
Reimar Döffinger | Use default Java Collator. |
tree | commitdiff |
2015-12-13 |
Reimar Döffinger | Fix parsing of examples with multiline foreign part. |
tree | commitdiff |
2015-12-13 |
Reimar Döffinger | Minor code cleanup. |
tree | commitdiff |
2015-12-13 |
Reimar Döffinger | Avoid replaceAll. |
tree | commitdiff |
2015-12-13 |
Reimar Döffinger | Free some memory as early as possible. |
tree | commitdiff |
2015-12-12 |
Reimar Döffinger | Fix German name for latin. |
tree | commitdiff |
2015-12-12 |
Reimar Döffinger | Fix compilation against latest newformat branch. |
tree | commitdiff |
2015-12-09 |
Reimar Döffinger | Encode URLs as ASCII, avoid UTF-8. |
tree | commitdiff |
2015-12-08 |
Reimar Döffinger | Improvements to wikisplit code. |
tree | commitdiff |
2015-12-07 |
Reimar Döffinger | Support v006 and v007 dictionary formats. |
tree | commitdiff |
2015-09-12 |
Reimar Döffinger | Update for new dictionary version URL. |
tree | commitdiff |
2015-09-05 |
Reimar Döffinger | Fix WiktionarySplitter breakage. |
tree | commitdiff |
2015-09-05 |
Reimar Döffinger | Remove dummy code that makes no sense/does not work. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Try filtering out anagrams from FR dictionary. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Partial support for Spanish Wiktionary. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Also accept language variants as Spanish. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Hacks to support Spanish wiktionary. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Report error when hitting end when searching token. |
tree | commitdiff |
2015-08-28 |
Reimar Döffinger | Small updates to dictionary generation. |
tree | commitdiff |
2015-08-27 |
Reimar Döffinger | Update for new dictionary release URL. |
tree | commitdiff |
2015-08-27 |
Reimar Döffinger | Add script to help with dictionary generation. |
tree | commitdiff |
2015-08-24 |
Reimar Döffinger | Update file location URL. |
tree | commitdiff |
2015-08-24 |
Reimar Döffinger | Replace com.sun.xml.internal.rngom.util.Uri. |
tree | commitdiff |
2015-08-24 |
Reimar Döffinger | Disable some debug code to allow compilation. |
tree | commitdiff |
2013-12-26 |
Thad Hughes | Fixes for Malay$ and reorderings due to new ICU4J. |
tree | commitdiff |
2013-12-03 |
Thad Hughes | Update WiktionaryLangs. |
tree | commitdiff |
2013-04-07 |
Thad Hughes | go |
tree | commitdiff |
2013-01-09 |
Thad Hughes | Fix Malay/Malayalam, add test for "buon g". |
tree | commitdiff |
2013-01-05 |
Thad Hughes | Using new Chemnitz dictionary. |
tree | commitdiff |
2013-01-05 |
Thad Hughes | Fix AF-EN test. |
tree | commitdiff |
2013-01-05 |
Thad Hughes | Fixed comment for German dictionary. |
tree | commitdiff |
2013-01-03 |
Thad Hughes | Eliminated <ref>s. |
tree | commitdiff |
2013-01-03 |
Thad Hughes | Skip Italian references. |
tree | commitdiff |
2013-01-03 |
Thad Hughes | Split ZH into yue and cmn, fixed German heading. |
tree | commitdiff |
2012-12-30 |
Thad Hughes | Update URL format and parsing, fix FR handling. |
tree | commitdiff |
2012-12-23 |
Thad Hughes | Multi word search now looks for exact matches of TokenRows. |
tree | commitdiff |
2012-12-23 |
Thad Hughes | Building dicitonaries. |
tree | commitdiff |
2012-12-16 |
Thad Hughes | Update to latest wiktionaries, update unit tests, der... |
tree | commitdiff |
2012-12-03 |
Thad Hughes | go |
tree | commitdiff |
2012-10-07 |
thadh | Added simple parsing logic for DE and IT wiktionaries. |
tree | commitdiff |
2012-10-04 |
thadh | Updated input locations. Moved pairs in builder. |
tree | commitdiff |
2012-10-03 |
thadh | Fixed trailing ,s in italian verb tenses. |
tree | commitdiff |
2012-10-01 |
thadh | Format links properly. |
tree | commitdiff |
2012-09-30 |
thadh | Synonyms, antonyms. |
tree | commitdiff |
2012-09-25 |
thadh | Don't handle it-conj in EnParser. |
tree | commitdiff |
2012-09-25 |
thadh | it-noun. |
tree | commitdiff |
2012-09-25 |
thadh | Link forms, page limit arabic, change HTML. |
tree | commitdiff |
2012-09-25 |
thadh | Put links into HtmlEntry. |
tree | commitdiff |
2012-09-23 |
thadh | Italian verb conjugations! |
tree | commitdiff |
2012-09-22 |
thadh | it-conj (most of the way), unicode handling in strings. |
tree | commitdiff |
2012-09-18 |
thadh | Expand italian test to get verb conjuations. |
tree | commitdiff |
2012-09-18 |
thadh | Basic general functions in WholeSectionParser. |
tree | commitdiff |
2012-09-18 |
thadh | Skip lang=XX for the lang we care about. |
tree | commitdiff |
2012-09-18 |
thadh | Skip w: and Image: wikiLinks. |
tree | commitdiff |
2012-09-18 |
thadh | Delete Anagrams and References sections. |
tree | commitdiff |
2012-09-18 |
thadh | Got rid of Category:. |
tree | commitdiff |
2012-09-18 |
thadh | Get rid of training "en:word" crap. |
tree | commitdiff |
2012-09-18 |
thadh | Reformat. |
tree | commitdiff |
2012-09-18 |
thadh | Update unit tests for parsing function name. |
tree | commitdiff |
2012-09-18 |
thadh | Fixed Builder, and escaping arg names. |
tree | commitdiff |
2012-09-11 |
thadh | HtmlEntries don't count as main entries. |
tree | commitdiff |
2012-09-10 |
thadh | Whitespace. |
tree | commitdiff |
2012-09-10 |
thadh | Whitespace. |
tree | commitdiff |
2012-09-10 |
thadh | Update goldens. |
tree | commitdiff |
2012-09-10 |
thadh | Add some langs (Ancient Greek, Cantonese, Burmese(MY... |
tree | commitdiff |
2012-09-10 |
thadh | First decent implementation of HtmlEntry attached to... |
tree | commitdiff |
2012-08-20 |
thadh | Add TA=Tamil language. |
tree | commitdiff |
2012-07-28 |
thadh | Escape HTML. Test special ISO coding. |
tree | commitdiff |
2012-07-24 |
thadh | Baseline HTML parsing done, goldens updated! |
tree | commitdiff |
2012-07-22 |
thadh | Refactor code to generate dictionaries to make it all... |
tree | commitdiff |
2012-07-21 |
thadh | Added WholeSection entries and parser. |
tree | commitdiff |
2012-07-17 |
thadh | Updated unit tests, added WholeSectionToHtmlParser. |
tree | commitdiff |
2012-05-20 |
Thad Hughes | DictionaryBuilder prints sortable langs, JP->JA fix. |
tree | commitdiff |
2012-05-11 |
Thad Hughes | Build fr_de dictionary from enwiktionary, yeah! |
tree | commitdiff |
2012-05-09 |
Thad Hughes | Unit tests working, looks like I'd been revamping the... |
tree | commitdiff |
2012-03-09 |
Thad Hughes | Update version to v004. |
tree | commitdiff |
2012-03-08 |
Thad Hughes | Fixes to tr= and head= make Arabic,Thai look much better. |
tree | commitdiff |
2012-03-08 |
Thad Hughes | Bug-fixes to WikiTokenizer (handle weird line-feed... |
tree | commitdiff |
2012-03-06 |
Thad Hughes | EnTranslationToTranslationParser |
tree | commitdiff |
2012-03-06 |
Thad Hughes | Fixed combining marks on Unicode regexes. |
tree | commitdiff |
2012-02-10 |
Thad Hughes | Unit tests working again after refactoring!!! |
tree | commitdiff |
2012-02-10 |
Thad Hughes | Major en refactoring underway. |
tree | commitdiff |
2012-02-10 |
Thad Hughes | Rename enwiktionary package to wiktionary. |
tree | commitdiff |
2012-02-08 |
Thad Hughes | Point unit tests at new wikiSplit/en/. |
tree | commitdiff |
2012-02-08 |
Thad Hughes | Split EN, DE, IT, FR wiktionaries! Fix splitting to... |
tree | commitdiff |
2012-01-31 |
Thad Hughes | Fix test. |
tree | commitdiff |
2012-01-31 |
Thad Hughes | Moved normalization, more tests. |
tree | commitdiff |
2012-01-30 |
Thad Hughes | Stoplist, more languages... |
tree | commitdiff |
2012-01-26 |
Thad Hughes | zipSize, overrideStoplist-> special isMainEntry, tagalo... |
tree | commitdiff |
2012-01-24 |
Thad Hughes | Added Urdu! |
tree | commitdiff |
2012-01-17 |
Thad Hughes | Better DictionaryInfo, IndexBuilder counts main TokenRows. |
tree | commitdiff |
2012-01-16 |
Thad Hughes | Wiktionary upgrade! |
tree | commitdiff |
next |