JafSoft Support Forums  
  Products:
AscToHTM (text to HTML) / AscToPDF (text to PDF) / AscToRTF (text to RTF) / Detagger (HTML to text and markup removal) 

 
  Forum options:
Forum Index  Register  Login  Search  FAQ  Log Out
Member options:
My Profile  Inbox  Member List  Address Book  My Subscription  My Forums 
 
 

Note: Some forums require a login other than "Guest" in order to post messages and replies


table data broken into 2 lines

 
Logged in as: Guest
Users viewing this topic: none
  Printable Version
All Forums > [Product forums] > Detagger > table data broken into 2 lines Page: [1]
Login
Message << Older Topic   Newer Topic >
table data broken into 2 lines - 12/18/2007 12:02:29 PM   
mark

 

Posts: 3
Joined: 12/17/2007
Status: offline
Hi,

first of all, I have been hunting around for a software that does convert html to text and detagger is just amazing with all it's functionality. really great piece of software and it's just so obvious that lots of thoughts went into writing it.

I am sure that I'm doing something wrong and was wondering if someone can help me here.

I have an html file which contains a table with a row and table data which goes beyond the usual page width. After converting, it breaks the data into the next line after 76 chars. This is not what I want and it's creating an issue when uploading the file into a database.

I have tried the following:

- on the tables page, uncheck fit table to page size and set a target maximum width to 300 and have tried even higher values, no luck
- on the text formatting tab, I have tried to set the target page width to 300 and have tried higher values. Now I did notice the line breaks after 85 chars, which confused me more than it helped
- last, I have checked the "Preserve layout exactly as it is in the source file" but that made no difference.

Please, can someone take a look. I will upload the file.

the problem arises at line 35 and the converted file breaks the line after the this string: "mounty888 folds," where it should display the whole line until the next br in the source file.

Thanks alot!



Attachment (1)
Post #: 1
RE: table data broken into 2 lines - 12/18/2007 2:07:03 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
Hi Mark,

You've tried the obvious things, namely decoupling the target table width from the default page width.

I'm not in a position to check this right now, but I will try to on return to my office, although I currently have a non-functioning PC there.

Looking at the HTML I notice some "width=1" attributes on some of the image tags.  It may be worth checking the option to ignore all table widths if you've not already done so.

Other than that I'll get back to you as soon as I can test this on a functioning computer 

Cheers, jaf

(in reply to mark)
Post #: 2
RE: table data broken into 2 lines - 12/18/2007 2:13:20 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
Actually looking at it again, I see that the line in question has colspan="2" set on the <td> tag, when - AFAICT - the rest of the table only has one column.

That might be an issue as it would lead to an ambiguous column width.

Try manually editing that attribute out and see what happens.  I'm not suggesting this as a solution, but if that works I will investigate whether the problem persists in the current beta version.

But again... once I get my computer working <sigh>

Cheers, Jaf

(in reply to mark)
Post #: 3
RE: table data broken into 2 lines - 12/18/2007 2:37:02 PM   
mark

 

Posts: 3
Joined: 12/17/2007
Status: offline
hi,

removing the colspan part did actually fix it! Nice!

Thanks a lot!

So I can safely bin my ultraedit macro I wrote to workaround this :-)

good luck with your PC!

< Message edited by Admin -- 12/18/2007 3:30:43 PM >

(in reply to Jaf)
Post #: 4
RE: table data broken into 2 lines - 12/18/2007 3:29:49 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
A single row with a colspan=2 in a table with only one column would confuse the software as it would try to work out how "wide" each column should be, and there's nothing on which to calculate the width of the second one.

I'll run this as a test case through my beta code and if it's better I'll contact you via email, so that you don't need to manually edit files in future.

Technically this is mal-formed HTML, but there's so much of that on the web that Detagger will always try it's best to make sense of it! 

(in reply to mark)
Post #: 5
RE: table data broken into 2 lines - 2/11/2008 11:49:09 PM   
mark

 

Posts: 3
Joined: 12/17/2007
Status: offline
Hi,

it's me again with the same problem.

It took a few thousand files to actually re-appear and nothing I tried has worked, I cannot explain this to me. Within like 100,000 files, it happens about 10 times.

I have attached an example again and it would be super if you might be able to take a look at it?

The file already has colspan removed, the trick that worked for all the other files.

Thanks alot!

Attachment (1)

(in reply to Jaf)
Post #: 6
RE: table data broken into 2 lines - 2/14/2008 9:57:00 PM   
Jaf

 

Posts: 70
Joined: 2/1/2006
Status: offline
There's a policy called Force table rows to be output on a single line of text that would probably do what you want here.  failing that it may be something in your policy file (which you haven't attached)

Unfortunately I can't remember offhand if it's in the officially released version (2.4), or something coming soon in version 3.  It should be in the "Width" section of the "Tables" tab under the

   Conversion Options -> Convert To text

menu option.  If your version doesn't have that, email me and I will get a version to you.

(in reply to mark)
Post #: 7
Page:   [1]
All Forums > [Product forums] > Detagger > table data broken into 2 lines Page: [1]
Jump to:





New Messages No New Messages
Hot Topic w/ New Messages Hot Topic w/o New Messages
Locked w/ New Messages Locked w/o New Messages
 Post New Thread
 Reply to Message
 Post New Poll
 Submit Vote
 Delete My Own Post
 Delete My Own Thread
 Rate Posts