python
a="32<2>fdssa</2>ffdsa32" re.sub(r'<(\d+)>|</(\d+)>',"item",a)
result:'32itemfdssaitemffdsa32'
如何正确替换得到结果:
a="32<item>
fdssa</item>
ffdsa32"
美女,给你两个解法
第一个是用non-greedy match:
In [1]: re.sub(r'<(/?)\d+>+?',"<\g<1>item>","32<2>fdssa</2>ffdsa32")
Out[1]: '32<item>fdssa</item>ffdsa32'
另外一个是传个函数进去:
def repl(match):
if match.group(1):
return "<item>"
elif match.group(2):
return "</item>"
else:
return ""
re.sub(r'(<\d+>)+?|(</\d+>)+?',repl,"32<2>fdssa</2>ffdsa32")
==>'32<item>fdssa</item>ffdsa32'