hpple采用了 Xpath 实现对html的导航和解析,该lib使用简单,但需要对项目做一些设置:
链接libxml2库:
1、打开项目的TARGETS,选择ALL标签(如图):
2、在搜索栏搜索“Header Search Path”,添加属性:
“${SDKROOT}/usr/include/libxml2”
3、和第二步一样,在 “Other Linker Flag”下添加:
“-lxml2”
Git链接:
git clone git://github.com/topfunky/hpple.git
导入相应文件:
TFHpple.h
TFHpple.m
TFHppleElement.h
TFHppleElement.m
XPathQuery.h
XPathQuery.m
用法:
// Don't forget
// #import "TFHpple.h"
NSData *htmlData = [[NSString stringWithContentsOfURL:[NSURL URLWithString: @"http://www.objectgraph.com/contact.html"]] dataUsingEncoding:NSUTF8StringEncoding];
TFHpple *xpathParser = [[[TFHpple alloc] initWithHTMLData:htmlData] autorelease];
NSArray *elements = [xpathParser search:@"//h3"]; // get the page title - this is xpath notation
TFHppleElement *element = [elements objectAtIndex:0];
NSString *myTitle = [element content];
NSLog(myTitle);