JafSoft Support Forums  
  Products:
AscToHTM (text to HTML) / AscToPDF (text to PDF) / AscToRTF (text to RTF) / Detagger (HTML to text and markup removal) 

 
  Forum options:
Forum Index  Register  Login  Search  FAQ  Log Out
Member options:
My Profile  Inbox  Member List  Address Book  My Subscription  My Forums 
 
 

Note: Some forums require a login other than "Guest" in order to post messages and replies


Junk Character Inserted into Plain Text Body

 
Logged in as: Guest
Users viewing this topic: none
  Printable Version
All Forums > [Product forums] > Detagger > Junk Character Inserted into Plain Text Body Page: [1]
Login
Message << Older Topic   Newer Topic >
Junk Character Inserted into Plain Text Body - 2/26/2009 3:59:04 PM   
ChrisA

 

Posts: 1
Joined: 2/25/2009
Status: offline
When using the detagger a junk character is being inserted as the first character of the plain text body.

The detagged text is as follows:

 Text Converted: ´╗┐BODY1   ┬á   BODY2
We are unsure how, or why, this is happening.

I have the entire command link processing in a word document, however am unable to attach such files. Please let me know if additional information is needed.

Thanks,
Chris
Post #: 1
RE: Junk Character Inserted into Plain Text Body - 2/26/2009 7:40:05 PM   
Jaf

 

Posts: 80
Joined: 2/1/2006
Status: offline
This is almost certainly due to the output generating Unicode.  It can do this if there is Unicode in the input, or if the HTML has HTML entities that require Unicode to represent them.

Normally the output file is marked as Unicode and programs looking at the file should interpret the two-byte character sequences as the correct character (e.g. as a copyright symbol).

If you'd like me to investigate further, please place the original HTML file in a .zip file attachement and send it to me at jaf-at-jafsoft.com.  Don't paste the text into Word or anythging else as the act of copying the text into another file is highly likely to change the text... I'd need to see the original HTML file.



(in reply to ChrisA)
Post #: 2
Page:   [1]
All Forums > [Product forums] > Detagger > Junk Character Inserted into Plain Text Body Page: [1]
Jump to:





New Messages No New Messages
Hot Topic w/ New Messages Hot Topic w/o New Messages
Locked w/ New Messages Locked w/o New Messages
 Post New Thread
 Reply to Message
 Post New Poll
 Submit Vote
 Delete My Own Post
 Delete My Own Thread
 Rate Posts