Python正则表达式找到所有重叠的匹配项？

Question 1

我正在尝试在Python 2.6中使用re查找更大系列的数字中的每10位数字系列。

我很容易就能抓住不重叠的比赛，但我希望数字系列中的每场比赛。例如。

在“ 123456789123456789”中

我应该得到以下列表：

[1234567891,2345678912,3456789123,4567891234,5678912345,6789123456,7891234567,8912345678,9123456789]

我已经找到了对“超前”的引用，但是我所看到的示例仅显示了成对的数字，而不是更大的分组，而且我无法将其转换为两位数以外的数字。

Question 2

I’m trying to find every 10 digit series of numbers within a larger series of numbers using re in Python 2.6.

I’m easily able to grab no overlapping matches, but I want every match in the number series. Eg.

in “123456789123456789”

I should get the following list:

[1234567891,2345678912,3456789123,4567891234,5678912345,6789123456,7891234567,8912345678,9123456789]

I’ve found references to a “lookahead”, but the examples I’ve seen only show pairs of numbers rather than larger groupings and I haven’t been able to convert them beyond the two digits.

Question 3

在前瞻范围内使用捕获组。前瞻捕捉您感兴趣的文本，但是实际匹配在技术上是前瞻之前的零宽度子字符串，因此匹配在技术上是不重叠的：

import re 
s = "123456789123456789"
matches = re.finditer(r'(?=(\d{10}))',s)
results = [int(match.group(1)) for match in matches]
# results: 
# [1234567891,
#  2345678912,
#  3456789123,
#  4567891234,
#  5678912345,
#  6789123456,
#  7891234567,
#  8912345678,
#  9123456789]

Question 4

Use a capturing group inside a lookahead. The lookahead captures the text you’re interested in, but the actual match is technically the zero-width substring before the lookahead, so the matches are technically non-overlapping:

import re 
s = "123456789123456789"
matches = re.finditer(r'(?=(\d{10}))',s)
results = [int(match.group(1)) for match in matches]
# results: 
# [1234567891,
#  2345678912,
#  3456789123,
#  4567891234,
#  5678912345,
#  6789123456,
#  7891234567,
#  8912345678,
#  9123456789]

Question 5

您也可以尝试使用支持重叠匹配（不是re）。

>>> import regex as re
>>> s = "123456789123456789"
>>> matches = re.findall(r'\d{10}', s, overlapped=True)
>>> for match in matches: print match
...
1234567891
2345678912
3456789123
4567891234
5678912345
6789123456
7891234567
8912345678
9123456789

Question 6

You can also try using the (not re), which supports overlapping matches.

>>> import regex as re
>>> s = "123456789123456789"
>>> matches = re.findall(r'\d{10}', s, overlapped=True)
>>> for match in matches: print(match)  # print match
...
1234567891
2345678912
3456789123
4567891234
5678912345
6789123456
7891234567
8912345678
9123456789

Question 7

我喜欢正则表达式，但是这里不需要它们。

只是

s =  "123456789123456789"

n = 10
li = [ s[i:i+n] for i in xrange(len(s)-n+1) ]
print '\n'.join(li)

结果

Question 8

I’m fond of regexes, but they are not needed here.

Simply

s =  "123456789123456789"

n = 10
li = [ s[i:i+n] for i in xrange(len(s)-n+1) ]
print '\n'.join(li)

result

Python正则表达式找到所有重叠的匹配项？

问题：Python正则表达式找到所有重叠的匹配项？

回答 0

回答 1

回答 2

排行榜展示

Python 情人节超强技能导出微信聊天记录生成词云

你不得不知道的python超级文献批量搜索下载工具

7行代码 Python热力图可视化分析缺失数据处理

Python 流程图 — 一键转化代码为流程图

Python 优化—算出每条语句执行时间

你的10W块放哪里能赚最多钱？

文章展示

将NumPy数组追加到NumPy数组

numpy：从2D数组中获取随机的行集

Python 计算瑞幸和星巴克谁的门店最多

OSError：[Errno 2]在Django中使用python子进程时，没有此类文件或目录

如何在Python的类方法中访问“静态”类变量？

为什么会出现AttributeError：’NoneType’对象没有属性’something’？

Python正则表达式找到所有重叠的匹配项？

问题：Python正则表达式找到所有重叠的匹配项？

回答 0

回答 1

回答 2

相关文章

排行榜展示

文章展示