|
-
Nov 16th, 2004, 05:22 PM
#1
Thread Starter
PowerPoster
Remove Text from XHTML and Make XML
I have some HTML files I've converted to XHTML using Tidy.
Now I want to strip the text from the XHTML and make the text XML such that the XML looks like:
VB Code:
<FUNCTION>
<NAME>
</NAME>
<DESCRIPTION>
</DESCRIPTION>
<DECLARATION>
</DECLARATION>
<OS>
</OS>
<LIBRARY>
</LIBRARY>
<PARAMETERS>
</PARAMETERS>
<RETURNS>
</RETURNS>
<EXAMPLES>
</EXAMPLES>
<RELATED>
</RELATED>
<FUNCTION>
QUESTION:
What would be the best way to move the text to the above XML format -- using XSL or VB?
If XSL, how do I go about getting the XSL to ignore everything other than the text which is deeply embedded in the HTML?
Stated another way --
If the text is embedded deep in the node, such that there are multi <table> </table> tags before getting to the text of interest, what is the fastest way to skip over these <table> tags until I get to the one that contains the text?
THanks
David
Last edited by dw85745; Nov 17th, 2004 at 08:03 AM.
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
Click Here to Expand Forum to Full Width
|