TagText

TagText is a file conversion to text format.

The conversion deletes all layout and structural tags and keeps the textual content. The conversion deletes the HTML headings.

A typical usage of TagText is to get the textual content in order to process it automatically by the means of other tools.

TagText operates according to two modes:

  1. A batch mode thru command line,
  2. An interactive mode thru a Swing GUI.

The two modes offer the same functionalities.


Return to products page

 

The interactive mode displays the following window:

TagText accepts all formats but processes only the ones it is able to deal with.

The algorithm is as follows:

If the format is recognized (c.f. TagTools for the format list)

Then if is possible to convert this format (i.e. Text, HTML, SGML, XML)

The file is converted

Else a message is displayed and the conversion is not done.

Else a message is displayed and the conversion is not done

To know the batch mode syntax, run the program with the "-help" option.