Java HTML Parser

在此介紹一個還不錯的Java HTML Parser元件 - jsoup，引用它，可以快速地將HTML內容Parsing出你要的結果，以下簡單範例介紹。 import java.net.URL ; import java.util.Iterator; import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; public class Parser { /** * @param args */ public static void main( String [] args) throws Exception { // URL URL url = new URL ( "http://tw.yahoo.com" ); // Create the Document Object Document doc = Jsoup . parse (url, 3000); // Get first table Element table = doc.select( "table" ).first(); // Get td Iterator Iterator Element > ite = table.select( "td" ).iterator(); // Print content int cnt = 0; while (ite.hasNext()) { cnt++; System . out .println( "Value " + cnt + ": " + ite.next().text()); } } } <span style="border-collapse: separate; color: #000000; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;"> <span style="border-collapse: separate; color: #000000; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;">在此範例將yahoo的第一個Table中所有td的值Print出來，還有很多應用可自行參考。 <span style="border-collapse: separate; color: #000000; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;"> <span style="border-collapse: separate; color: #000000; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;">jsoup官方網站 <span style="border-collapse: separate; color: #000000; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;"> http://jsoup.org/

流風羽的部落格

歡迎光臨流風羽在痞客邦的小天地

Java HTML Parser - jsoup

部落格廣告

部落格廣告

個人資訊

熱門文章

文章分類

Technology (30)

最新文章

最新留言

文章精選

文章搜尋

參觀人氣