X-Git-Url: http://gitweb.fperrin.net/?a=blobdiff_plain;f=README.md;h=600ba04efc13d25820c3fbd87f9892d06878470f;hb=HEAD;hp=1c3e87be734f4a831b1324c1f6a1d23ec920f291;hpb=7717deec013e2df5a8e38d26f3a7a252eac6f11b;p=Dictionary.git diff --git a/README.md b/README.md index 1c3e87b..600ba04 100644 --- a/README.md +++ b/README.md @@ -17,10 +17,11 @@ It is possible to disable the smallicu/makesmallicu step if it causes issues, though help to debug any such issue would be welcome. ## Generating dictionaries -And approximation of the steps for generating dictionaries: +An approximation of the steps for generating dictionaries: 1. Go into the DictionaryPC repository 2. Run `data/downloadInputs.sh` to get the data 3. Run `./compile.sh` (probably needs quite a few things to be installed or paths in it configured) + Alternatively, download a release binary: https://github.com/rdoeffinger/DictionaryPC/releases 4. Run `./WiktionarySplitter.sh` 5. Run `./generate_dictionaries.sh` You might want to edit the settings at the start of the file @@ -29,6 +30,6 @@ And approximation of the steps for generating dictionaries: For a more manual approach, the commands it runs are something like `./run.sh --lang1=ES --lang2=PT --lang1Stoplist=data/inputs/stoplists/es.txt \ --dictOut=data/outputs/ES-PT.quickdic \ - --dictInfo="(EN)Wiktionary-based ES-PT dictionary. --input1=data/inputs/wikiSplit/en/EN.data" \ + --dictInfo="(EN)Wiktionary-based ES-PT dictionary." --input1=data/inputs/wikiSplit/en/EN.data \ --input1Name=enwikitionary --input1Format=EnTranslationToTranslation \ --input1LangPattern1=es --input1LangPattern2=pt`