xpaf開源解析框架
XPath-based Parsing Framework (XPaF) 是一個(gè)簡(jiǎn)單、方便的開源解析框架,便于從 HTML 和 XML 文檔中提取語法上的相關(guān)性(subject-predicate-object triples)。
代碼示例:
<table> <tr> <td class="name">Aaron</td> <td class="occ">Engineer</td> </tr> <tr> <td class="name">Jennifer</td> <td class="occ">Archeologist</td> </tr> </table>
parser_name: "my_parser"
relation_tmpls {
subject: "http://td[@class='name']"
predicate: "occupation"
object: "http://td[@class='occ']"
subject_cardinality: MANY
object_cardinality: MANY
}
評(píng)論
圖片
表情
