Parse HTML 4.01 Transitional as XML?

Parse HTML 4.01 Transitional as XML?

Is there any way to getURL on a page that is HTML 4.01 Transitional, and convert it to XML to use XPath?  Or is the only way to parse it as a string?

Thanks.