Make sure you use the `--recursive` option to clone, or otherwise
make sure the `Util` submodule is correctly checked out.
Make sure you use the `--recursive` option to clone, or otherwise
make sure the `Util` submodule is correctly checked out.
Note that compilation is only tested on Linux.
It is possible to disable the smallicu/makesmallicu step if it causes
issues, though help to debug any such issue would be welcome.
Note that compilation is only tested on Linux.
It is possible to disable the smallicu/makesmallicu step if it causes
issues, though help to debug any such issue would be welcome.
-1. go into the DictionaryPC repository
-2. run `data/downloadInputs.sh` to get the data
-3. run `./compile.sh` (probably needs quite a few things to be installed or paths in it configured)
-4. run `./WiktionarySplitter.sh`
-5. run `./generate_dictionaries.sh`
+1. Go into the DictionaryPC repository
+2. Run `data/downloadInputs.sh` to get the data
+3. Run `./compile.sh` (probably needs quite a few things to be installed or paths in it configured)
+4. Run `./WiktionarySplitter.sh`
+5. Run `./generate_dictionaries.sh`
You might want to edit the settings at the start of the file
so it does not generate all dictionaries, and edit its input files
like EN-trans-dictlist.txt to add new dictionaries.
For a more manual approach, the commands it runs are something like
`./run.sh --lang1=ES --lang2=PT --lang1Stoplist=data/inputs/stoplists/es.txt \
--dictOut=data/outputs/ES-PT.quickdic \
You might want to edit the settings at the start of the file
so it does not generate all dictionaries, and edit its input files
like EN-trans-dictlist.txt to add new dictionaries.
For a more manual approach, the commands it runs are something like
`./run.sh --lang1=ES --lang2=PT --lang1Stoplist=data/inputs/stoplists/es.txt \
--dictOut=data/outputs/ES-PT.quickdic \
--input1Name=enwikitionary --input1Format=EnTranslationToTranslation \
--input1LangPattern1=es --input1LangPattern2=pt`
--input1Name=enwikitionary --input1Format=EnTranslationToTranslation \
--input1LangPattern1=es --input1LangPattern2=pt`