Jaf
Posts: 80
Joined: 2/1/2006 Status: offline
|
The first "Early Adopter" release of Detagger has just been uploaded to the web site. This is available to any registered user who contacts us to express an interest in early releases of Detagger. The current version is 2.4.0.25 and contains a number of improvements and enhancements since the official 2.4 release. These are described in the documentation included with the windows version. A summary is listed at the end of this message. The new versions can be downloaded from the same URLs previously notified via email to the registered Early Adopters Please note, Early Adopters are under no obligation to either download these files, or to supply feedback if you do, but obviously any and all feedback about these releases would be appreciated, together with any comments and suggestions for further improving the software. Summary of changes New features: - The policy "Input text encoding" can now be set on the main screen to allow Unicode files to be detected. Auto-detection of character encoding has been improved - The new policy "Table delimiter character" allows you to specify your choice of delimiter when extracting table data to delimited data format. - The new policies "Enclose delimited data in quotes" and "Enclose nested tables in extra levels of quotes" offer greater control over how extracted table data is encased in quotes - The new policies "Italic markers" and "Bold markers" allow you to specify how bold and italic text should be represented when converted to text. - Added new "white space policies" to allow some control over the white space in an HTML file during markup removal. New options include - New policy "Remove all line feed (LF) and carriage return (CR) characters" - The new policy "Ignore local (relative) links" allows you to determine whether hyperlinks to local resources should be ignored or not - The new policies "Reference table Link template" and "Reference table URL template" allow control of the formatting of hyperlink information in the URL reference table added at the end of the file - The new policy "In-line URL template" allows control of the formatting of in-situ URLs - The new policy "In-line IMG tag template" allows control of any markers used to replace images. - The new policy "Remove <NOSCRIPT> sections from output" allows you to decide whether or not text in <NOSCRIPT> sections should be included in the output. - The new policy "Remove all 8-bit characters" allows you eliminate 8-bit characters from the output. - Added support for the OUTPUT fragment tag, specifically to allow information from tables to be included in text commands. - Renamed the /POLICY qualifier to /SAMPLE_POLICY to hopefully reduce confusion - Added IN_PATH as an attribute value for the DATA fragment tag Bug fixes and enhancements: - Various performance enhancements have been made, especially when dealing with large multi-Mb) files with heavily nested tables - The program now starts faster when converting many (1000's) of files at once. - Fixed a bug that caused crashes when a table contained over 100 columns with some cells having ROWSPAN values. - Changes to a text commands file are now picked up in a more timely manner. Previously you'd have to stop and restart the program. - Many more I've forgotten :-)
< Message edited by Jaf -- 5/27/2006 12:41:28 PM >
|