I used this script to turn a DE-ES dict.cc file into a quickdic compatible with my Tolino. From the original 45k entries more than 20k were dropped because they had a subject label:
> WARNING: Malformed line: Atomphysik {f} física {f} atómica noun [phys.]
This change allows lines to have 4 fields/columns: `language1`, `language2`, `word class`, `subject labels`.
see also https://github.com/natowi/quickdic-dictionary.dictionarypc/issues/1
return;
}
final String[] fields = fieldSplit.split(line);
- // dictcc now has a part of speech field as field #3.
- if (fields.length < 2 || fields.length > 3) {
- logger.warning("Malformed line: " + line);
+ if (fields.length < 2 || fields.length > 4) {
+ logger.warning("Malformed line, expected 3 or 4 fields, got " + fields.length + ": " + line);
return;
}