Other neat stuff

O Goireasan Akerbeltz
Am mùthadh mar a bha e 21:46, 16 dhen Ghearran 2015 le Akerbeltz (deasbaireachd | mùthaidhean) (Created page with "Here are some other neat tools and scripts that might come in handy for people working on l10n tools in under-resourced languages. ==Alphabetical words== As in, words in whic...")
(diofar) ← Mùthadh nas sine | Am mùthadh mu dheireadh (diofar) | Mùthadh nas ùire → (diofar)
Jump to navigation Jump to search

Here are some other neat tools and scripts that might come in handy for people working on l10n tools in under-resourced languages.

Alphabetical words

As in, words in which all letters are in the order of the alphabet (like beefily in English). You need a text file with one word on each line, then run the following command:

$ cat FILENAME.txt | while read x; do echo $x `echo $x | sed 's/./\n&/g' | sort -u | tr -d "\n"`; done | egrep '^(.+) \1$'

If you want to allow double letters such as nn, chance | sort -u | to | sort |. If there's a lot of output, paste it into a spreadsheet like LibreOffice Calc or Microsoft Excel and use a sort-by-length function.

l10n for Humans
Basics - Projects - Gear - Terminology - Other neat stuff