NAMEapertium-deshtml - This application is part of ( apertium )
This tool is part of the apertium open-source machine translation toolbox: www.apertium.org.
SYNOPSISapertium-deshtml [ -h ] [ -i ] [ -n ] [ <input file> [ <output file> ] ]
DESCRIPTIONapertium-deshtml is an HTML format processor. Data should be passed through this processor before being piped to lt-proc. The program takes input in the form of an HTML document and produces output suitable for processing with lt-proc. HTML tags and other format information are enclosed in brackets so that lt-proc treats them as whitespace between words.
- -h, --help
- Display this help. -i Makes the addition of trailing sentence terminator (".") unconditional, often leading to duplicates. -n Suppresses the addition of a trailing sentence terminator.
- You could write the following to show how the word "gener" is analysed:
- echo "<b>gener</b>" | apertium-deshtml | lt-proc ca-es.automorf.bin