Jaf
Posts: 70
Joined: 2/1/2006 Status: offline
|
quote:
ORIGINAL: Guest You would probably remember me as the crazy user who came to you a few years ago to buy AsctoHTM and ended up suggesting the whole notion of DeTagger to you. In fact, I later served the alpha tester for the original releases of Detagger and watched it evolve from a concept into a fully released and very useful tool. I certainly do remember you, and it's great to hear from you again. quote:
These days I'm regularly faced with a similar dilemma of wanting to remove coding from large files; but rather than straight HTML code, it's now UBB Code. I got to wondering today if you had ever come up with a version of Detagger that would do that? I'm not familiar with UBB and so there isn't an explicit option in Detagger to do this. However a technique I've often used is to use the "Text commands" feature to convert unwanted text into HTML comments, which are then stripped out. This would reply on your UBB code having unique start and end tag markup. If it does than you can define Text commands such as replace_text string "start_text" by_string "<!-- " replace_text string "end_text" by_string " -->" These substitutions are execued against the text before it is converted, so if successful they would turn all your UBB code into HTML comments which Detagger would then strip. Care would need to be taken to avoid false matches, and to disable the formatting of the output text by Detagger (you can get it to leave the format unchanged). You might also have problems if there is an HTML like text in the file. It would probably be best to run the software in markup removal mode and ONLY select removal of HTML tags. This wasn't the job Detagger was designed for (you of all people should know that ), but there's enough flexability in the tool that it may meet your needs. If you have any further questions about this approach feel free to post followups here or email me personally. Take care, jaf
|