Hello There,
i need help parsing html using regex..i am hardly find the exact expression to use.
this is my html
i want to get all strings inside <TD NOWRAP></TD> and <TD></TD> also exlude the datetime inside<TD NOWRAP></TD>HTML Code:<HTML> <HEAD> <LINK rel="stylesheet" type="text/css" href="BacoSystem.css"/> </HEAD> <BODY> <DIV class="HdrTitle"> <TABLE width="100%" border="0" cellpadding="0" cellspacing="0" id="Table1"><TR><TD width="55px" align="left"><IMG src="Images/hdSystem.gif" height="24px"/></TD> <TD align="left"><SPAN style="font-size:medium; font-weight:bold">XML: Upload - Results</SPAN></TD></TR> </TABLE> </DIV><DIV class="HdrProps"></DIV> <TABLE class="ListShort"><TR class="Section"><TH colspan="5">Results</TH></TR><TR class="Separator"><TD colspan="5" class="Sep1"></TD></TR><TR class="Header"><TD colspan="2">Date</TD><TD colspan="3">Description</TD></TR> <TR class="DataOutline1"><TD colspan="5">Transactions</TD></TR> <TR class="DataDark"><TD width="20px"><IMG src="Images\imgMessage.gif"/></TD><TD NOWRAP>04/04/2011 10:47</TD><TD NOWRAP>Costcenter</TD><TD NOWRAP></TD><TD>Created: 0, Updated: 1, Rejected: 0</TD></TR> <TR class="DataLight"><TD width="20px"><IMG src="Images\imgMessage.gif"/></TD><TD NOWRAP>04/04/2011 10:47</TD><TD NOWRAP>GLTransaction</TD><TD NOWRAP></TD><TD>Created: 0, Updated: 1, Rejected: 0</TD></TR> </TABLE> </BODY> </HTML>
this is what i had now..it still display the datetime
Code:(NOWRAP>|<TD>)[a-zA-Z0-9.,'() /:&\d_-]+


Reply With Quote