2 Mar 2004 10:41
ParserException: null;
Shantha Jayalal <s.g.v.jayalal <at> cs.keele.ac.uk>
2004-03-02 09:41:11 GMT
2004-03-02 09:41:11 GMT
Hi, I am getting following error when try to extract links from a web site. Any help please. Many Thanks Shantha D:\htmlparser1_4_2>java Robot http://www.keele.ac.uk/depts/cs/dake/vldb2000/pan l2020/DeenVLDB2/index.htm Crawlin Site http://www.keele.ac.uk/depts/cs/dake/vldb2000/panel2020/DeenVLDB2/ ndex.htm 1 Exception in thread "main" org.htmlparser.util.ParserException: null; sun.io.MalformedInputException at sun.io.ByteToCharUTF8.convert(ByteToCharUTF8.java:152) at java.io.InputStreamReader.convertInto(InputStreamReader.java:137) at java.io.InputStreamReader.fill(InputStreamReader.java:186) at java.io.InputStreamReader.read(InputStreamReader.java:249) at org.htmlparser.lexer.Source.fill(Source.java:239) at org.htmlparser.lexer.Source.read(Source.java:322) at org.htmlparser.lexer.Source.read(Source.java:347) at org.htmlparser.lexer.Page.setEncoding(Page.java:698) at org.htmlparser.tags.MetaTag.doSemanticAction(MetaTag.java:115) at org.htmlparser.scanners.TagScanner.scan(TagScanner.java:69) at org.htmlparser.scanners.CompositeTagScanner.scan(CompositeTagScanner java:162) at org.htmlparser.util.IteratorImpl.nextNode(IteratorImpl.java:92) at Robot.crawl(Robot.java:200) at Robot.main(Robot.java:106) -------------------------------------------------------(Continue reading)
RSS Feed