大家帮我看下这个正则表达式怎么写,下载链接提取
本帖最后由 justdomyself 于 2018-11-20 19:11 编辑import re
url ='<a href="forum.php?mod=attachment&aid=MTA5MDUzOTl8ZjZmNmYzOWJ8MTU0MjcxMDA4NHwxNTQyNDEwMzN8MTA5MDU5NzU%3D" onmouseover="' \
'showMenu({"ctrlid":"this.id","pos":"12"})" id="aid10905399" target="_blank">1118-6.torrent</a>'
pt = re.compile(r'<a href="(forum.php?mod=attachment&aid=.*?)".*?</a>',re.S)
list = pt.findall(url)
print(list)
红色部分是需要的, 正则表达式改来改去去,没截取成功,大家帮忙看下,谢谢
图片是网页截图:
编辑原因:添加图片 搞定了。
import reurl ='<a href="forum.php?mod=attachment&aid=MTA5MDUzOTl8ZjZmNmYzOWJ8MTU0MjcxMDA4NHwxNTQyNDEwMzN8MTA5MDU5NzU%3D" onmouseover="' \
'showMenu({"ctrlid":"this.id","pos":"12"})" id="aid10905399" target="_blank">1118-6.torrent</a>'
pt = re.compile(r'<a href="(forum.php\?mod=attachment&aid=.*?)"+\s*.*</a>',re.S)
list = pt.findall(url)
for list_x in list: print(list_x)
页:
[1]