com.meterware.httpunit
Class JTidyHTMLParser

java.lang.Object
  |
  +--com.meterware.httpunit.JTidyHTMLParser
All Implemented Interfaces:
com.meterware.httpunit.HTMLParser

public class JTidyHTMLParser
extends java.lang.Object
implements com.meterware.httpunit.HTMLParser

Author:
Russell Gold

Constructor Summary
JTidyHTMLParser()
           
 
Method Summary
 java.lang.String getCleanedText(java.lang.String string)
          Removes any string artifacts placed in the text by the parser.
 org.w3c.dom.Node getDocument(java.net.URL url, java.lang.String pageText)
          Converts an HTML text string to a Document.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

JTidyHTMLParser

public JTidyHTMLParser()
Method Detail

getDocument

public org.w3c.dom.Node getDocument(java.net.URL url,
                                    java.lang.String pageText)
                             throws java.io.IOException,
                                    org.xml.sax.SAXException
Description copied from interface: com.meterware.httpunit.HTMLParser
Converts an HTML text string to a Document. Any error reporting will be annotated with the specified URL.

Specified by:
getDocument in interface com.meterware.httpunit.HTMLParser
java.io.IOException
org.xml.sax.SAXException

getCleanedText

public java.lang.String getCleanedText(java.lang.String string)
Description copied from interface: com.meterware.httpunit.HTMLParser
Removes any string artifacts placed in the text by the parser. For example, a parser may choose to encode an HTML entity as a special character. This method should convert that character to normal text.

Specified by:
getCleanedText in interface com.meterware.httpunit.HTMLParser

Copyright © 2000-2002 Russell Gold. See license agreement for rights granted.