I Need to parse a html file and get only the text from the webpage not the html code it self... how do I do this