JafSoft Support Forums  
  Products:
AscToHTM (text to HTML) / AscToPDF (text to PDF) / AscToRTF (text to RTF) / Detagger (HTML to text and markup removal) 

 
  Forum options:
Forum Index  Register  Login  Search  FAQ  Log Out
Member options:
My Profile  Inbox  Member List  Address Book  My Subscription  My Forums 
 
 

Note: Some forums require a login other than "Guest" in order to post messages and replies


Can't stop detagger from removing deprecated tag attributes

 
Logged in as: Guest
Users viewing this topic: none
  Printable Version
All Forums > [Product forums] > Detagger > Can't stop detagger from removing deprecated tag attributes Page: [1]
Login
Message << Older Topic   Newer Topic >
Can't stop detagger from removing deprecated tag attrib... - 8/7/2007 8:57:07 PM   
vaze

 

Posts: 1
Joined: 8/7/2007
Status: offline
Detagger is removing deprecated attributes even when I tell it not to.  This happens anytime a policy file is specified that contains at least one setting.  Telling it to keep deprecated attributes makes no difference.  

I would like to keep the "width" attribute of <td> tags...but nothing I do with the policy file causes Detagger to keep them.

I have tried both the GUI version and console version of Detagger with the same result. 

_____________________________

Dave Weber
TerpSys
Post #: 1
RE: Can't stop detagger from removing deprecated tag at... - 8/7/2007 9:36:03 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
TD WIDTH attributes shouldn't be removed as being deprecated (you'd get a message telling you how many were removed if it was).  It's more likely that you've somehow managed to check the remove all size attributes setting on the Detag Table Options tab, as this will remove the WIDTH attributes. 

Check under the Conversion options -> markup removal -> Tables menu option.

Failing that it may be that any policy file you're using has some policies already set in it which lead to this result.  You can look at a policy file in a text editor to see if there are any candidates for this.

If all else fails, go back to scratch using the following steps

1) restart Detagger.
2) clear any policy file name, and click the Reset button to the right of the policy filename field
3) change the conversion type to "selectively remove markup"
4) go to the Conversion Options -> Markup Removal menu option and start selecting options but
5) leave the remove non-standard attributes and keep deprecated attributes options checked.

If you continue to have problems, please email me a sample HTML file and policy file that demonstrates the problem, and point to where in the HTML the widths are being removed (if it's a particular location).  A copy of the status messages generated during conversion would also be helpful.


(in reply to vaze)
Post #: 2
RE: Can't stop detagger from removing deprecated tag at... - 8/16/2007 10:29:27 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
(in subsequent emails it became clear that the OP was using the Convert tags to be XHTML compatible option)

The problem is your attempt to convert tags to be more XHTML like

Although the width attribute isn't deprecated in HTML 4.0 Transitional, it is deprecated thereafter.  In particular it, and almost all other non-CSS attributes are deprecated in XHTML.

In XHTML all style, sizing and positioning tags and attributes are deprecated.  All of these aspects should be managed via the use of stylesheets and the ID and STYLE attributes.  The intention with XHTML is to totally separate document style and structure

So selecting this policy is the cause of your problem since making the HTML more like XHTML specifically requires the removal of all these tags and attributes.  As such this option overrides any of the other settings.

If you disable this setting you should find that your WIDTH attributes are left intact and you can select to remove other groups of tags.

I hope this explains what is happening..

Note... this "XHTML" is only intended to lay the groundwork for a conversion to XHTML, and not do the conversion itself.  This option will convert tagnames to lower case, remove all deprecated tag attributes, and place all attribute values in quotes.  It will also add the railing "/" in singleton tags such as <br>.  However it won't ensure that your tagging structure is valid, in particular it wan't resolve any missing or incorrectly matcheed or nested tags.

< Message edited by Jaf -- 8/16/2007 10:34:00 PM >

(in reply to Jaf)
Post #: 3
Page:   [1]
All Forums > [Product forums] > Detagger > Can't stop detagger from removing deprecated tag attributes Page: [1]
Jump to:





New Messages No New Messages
Hot Topic w/ New Messages Hot Topic w/o New Messages
Locked w/ New Messages Locked w/o New Messages
 Post New Thread
 Reply to Message
 Post New Poll
 Submit Vote
 Delete My Own Post
 Delete My Own Thread
 Rate Posts