- try{
- URL url = new URL("http://www.51leba.com");
- URLConnection conn = url.openConnection();
- BufferedReader is = new BufferedReader(new InputStreamReader(conn.getInputStream()));
- StringBuffer buffer = new StringBuffer();
- String str;
- while((str = is.readLine()) != null){
- buffer.append(str);
- buffer.append("/n");
- }
- str = buffer.toString().replaceAll("<script(.|/n)+?</script>", "").replaceAll("<(.|/n)+?>", "").replaceAll(" ", " ");
- String[] s = str.split("/n");
- buffer = new StringBuffer();
- for(int i=0;i<s.length;i++){
- if(s[i].trim().equals("") ){
- continue;
- }else{
- buffer.append(s[i]);
- buffer.append("/n");
- }
- }
- System.out.println(buffer.toString());
- is.close();
- }catch (Exception e) {
- e.printStackTrace();
- }
java抓取网页数据
最新推荐文章于 2025-12-05 17:02:52 发布
3324

被折叠的 条评论
为什么被折叠?



