Pythonchallenge Level 4 (python2.7)

最新推荐文章于 2024-06-18 15:21:58 发布

原创最新推荐文章于 2024-06-18 15:21:58 发布 · 86 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#Python #challenge #4

本文深入探讨了一段独特的代码逻辑，通过解析特定网页URL中的参数，逐步揭示隐藏在数字序列背后的秘密。从数值16044出发，读者将跟随作者的脚步，运用正则表达式和循环迭代技巧，解锁最终页面——'peak.html'。此过程不仅展示了网页爬虫的基本原理，还激发了对网页结构和编码方式的好奇心。

>>> pattern='and the next nothing is \d+'
>>> value=12345
>>> while True:
url=Request('http://www.pythonchallenge.com/pc/def/linkedlist.php?nothing=%s'%value)
response=urllib2.urlopen(url)
page=response.read()
match=re.search(pattern,page)
if not match:
break;
else:
print page
v=re.search('[0-9]+',match.group())
value=v.group()

after getting the value 16044
then run the above code again with the initial value=8022:

Answer: peak.html