iOS第三方HTML解析 TFHpple 的簡單使用

準備工作

1.導入TFHpple 2.引入靜態(tài)庫文件libxml2.2.dylib 3.PROJECT 中的 Search Path - header search paths添加 /usr/include/libxml2

解析步驟

1.初始化data 2.根據(jù)data創(chuàng)建TFHpple實例 3.查找節(jié)點存入數(shù)組 4.在該節(jié)點下 循環(huán)查找子節(jié)點

源HTML代碼:

<table cellpadding="0" cellspacing="0" border="0" width="100%">
<tr>

    <td width="48" valign="top" align="center"><a href="/member/zhangyi2099"><img src="http://cdn.v2ex.co/avatar/d00c/ceb1/18330_normal.png?m=1345037943" class="avatar" border="0" align="default"></a></td>
    <td width="10"></td>

    <td width="auto" valign="middle"><span class="item_title"><a href="/t/228173#reply1">看了本「網球優(yōu)等生」</a></span>
    <div class="sep5"></div>
    <span class="small fade"><div class="votes"></div><a class="node" href="/go/acg">ACG</a>  ?  <strong><a href="/member/zhangyi2099">zhangyi2099</a></strong>  ?  20 分鐘前  ?  最后回復來自 <strong><a href="/member/yishanxin">yishanxin</a></strong></span>
    </td>
    <td width="70" align="right" valign="middle">

        <a href="/t/228173#reply1" class="count_livid">1</a>

    </td>
</tr>

</table>
Object-C代碼:

NSData *htmlData = [[NSData alloc]initWithContentsOfURL:[NSURL URLWithString:@"http://www.xxx.com/xxxx?x=1"]];

TFHpple *xpathParser = [[TFHpple alloc]initWithHTMLData:htmlData];

pragma mark 每頁主題
NSArray *itemArray = [xpathParser searchWithXPathQuery:@"http://div[@class = 'cell item']"];

//通過for in 在itemArray數(shù)組中 循環(huán)查找子節(jié)點 for (TFHppleElement *hppleElement in itemArray) {

/** 這段被正則表達代替 @"http://div[@class = 'cell item']"] if ([[hppleElement objectForKey:@"class" ] isEqualToString:@"cell item"]) { [self.allDataMutableArray addObject:hppleElement]; } /

pragma mark 子節(jié)點頭像
NSArray *IMGElementsArr = [hppleElement searchWithXPathQuery:@"http://img"];
for (TFHppleElement *tempAElement in IMGElementsArr) {
NSString *imgStr = [tempAElement objectForKey:@"src"];
NSString *subStr = [@"http:" stringByAppendingString:imgStr];
[self.avatarMutableArray addObject:subStr];
}
pragma mark 子節(jié)點標題/鏈接
NSArray TitleElementArr = [hppleElement searchWithXPathQuery:@"http://span[@class='item_title']"]; for (TFHppleElement tempAElement in TitleElementArr) { //獲得標題 NSString *titleStr = [tempAElement content];

//1.獲得子節(jié)點(正文連接節(jié)點) 2.獲得節(jié)點屬性值 3.加入到字典中
NSArray * arr = [tempAElement children];
TFHppleElement *href = arr.firstObject;
NSString * titleHrefStr = [href objectForKey:@"href"];

[self.allDataMutableDict setObject:titleStr forKey:@"title"];
self.allDataMutableDict[@"titleHref"] = titleHrefStr;

}
pragma mark 子節(jié)點fade
//簡化寫法 簡化3步
NSArray *nodeElementArr = [hppleElement searchWithXPathQuery:@"http://a[@class='node']"];
self.allDataMutableDict[@"node"] = [nodeElementArr.firstObject content];

NSArray *fadeElementArr = [hppleElement searchWithXPathQuery:@"http://span[@class = 'small fade']"];
NSArray *subArray = [ [fadeElementArr.firstObject content] componentsSeparatedByString:@" ? "];

self.allDataMutableDict[@"louZhu"] = [subArray objectAtIndex:1];
self.allDataMutableDict[@"lastTime"] = [subArray objectAtIndex:2];
pragma mark 子節(jié)點回復數(shù)
NSArray * repeatElementArr = [hppleElement searchWithXPathQuery:@"http://a[@class = 'count_livid']"];
if ([repeatElementArr.firstObject content ]) {
self.allDataMutableDict[@"repeatCount"] = [repeatElementArr.firstObject content];
}else{
self.allDataMutableDict[@"repeatCount"] = [NSString stringWithFormat:@"%d",0];
}
pragma mark 轉化model 存進數(shù)組
[model setValuesForKeysWithDictionary:self.allDataMutableDict];
[self.allDataMutableArray addObject:model];
}

最后編輯于
?著作權歸作者所有,轉載或內容合作請聯(lián)系作者
【社區(qū)內容提示】社區(qū)部分內容疑似由AI輔助生成,瀏覽時請結合常識與多方信息審慎甄別。
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發(fā)布,文章內容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務。

相關閱讀更多精彩內容

友情鏈接更多精彩內容