需要检索内容大概如下
<span class=catfiltertopitem>Connectors</span>
<ul class=catfiltersub>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441876" class=catfilterlink>Accessories (4 items)</a>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1442700" class=catfilterlink>Banana and Tip - Accessories (2 items)</a>
.......需要匹配 每一行中<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441876" class=catfilterlink>Accessories (4 items)</a>的
/scripts/DkSearch/dksus.dll?Cat=1441876
Accessories
4 还有 每一行中<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>的
/scripts/DkSearch/dksus.dll?Cat=1441949
Adapters
1,157类似的数据
我使用正则表达式
var
RegExpr: TRegExpr;
....
RegExpr.Expression := `<a.+href\s*=\s*[\'\"]?([^\s<\'\"]*)[\'\"]?\sclass=catfilterlink>([^<\(].*)\(([0-9\,]+)\sitems\)<\/a>` for i := 0 to RegExpr.SubExprMatchCount do
begin
Memo1.Lines.Add(RegExpr.Match[i]);
end;在输出匹配项中在分析这一行<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>的结果中就会出现
/scripts/DkSearch/dksus.dll?Cat=1441949
Adapters <!--Adaptors-->
1,157
怎么能除去<!--Adaptors--> 这个? 谁能帮帮忙给我个正确匹配的正则表达式? 或则用其他方法?
<span class=catfiltertopitem>Connectors</span>
<ul class=catfiltersub>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441876" class=catfilterlink>Accessories (4 items)</a>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>
<li><a href="/scripts/DkSearch/dksus.dll?Cat=1442700" class=catfilterlink>Banana and Tip - Accessories (2 items)</a>
.......需要匹配 每一行中<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441876" class=catfilterlink>Accessories (4 items)</a>的
/scripts/DkSearch/dksus.dll?Cat=1441876
Accessories
4 还有 每一行中<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>的
/scripts/DkSearch/dksus.dll?Cat=1441949
Adapters
1,157类似的数据
我使用正则表达式
var
RegExpr: TRegExpr;
....
RegExpr.Expression := `<a.+href\s*=\s*[\'\"]?([^\s<\'\"]*)[\'\"]?\sclass=catfilterlink>([^<\(].*)\(([0-9\,]+)\sitems\)<\/a>` for i := 0 to RegExpr.SubExprMatchCount do
begin
Memo1.Lines.Add(RegExpr.Match[i]);
end;在输出匹配项中在分析这一行<li><a href="/scripts/DkSearch/dksus.dll?Cat=1441949" class=catfilterlink>Adapters <!--Adaptors--> (1,157 items)</a>的结果中就会出现
/scripts/DkSearch/dksus.dll?Cat=1441949
Adapters <!--Adaptors-->
1,157
怎么能除去<!--Adaptors--> 这个? 谁能帮帮忙给我个正确匹配的正则表达式? 或则用其他方法?
解决方案 »
免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货