最近学了BeautifulSoup,也算是为了以后找测试样例方便,写了一个抓取图片的小例子:
平台:python3.6
from bs4 import BeautifulSoup
from urllib import request
def getpage(url):
page = request.urlopen(url)
html = page.read()
return html
soup = BeautifulSoup(getpage(r'http://movie.mtime.com/'))
img = soup.find_all('img')
cnt = 0
for i in img:
path = 'img/%d.jpg'%(cnt)
request.urlretrieve(i['src'],path)
print('processing--------->count=%d-------->'%(cnt))
cnt+=1
print('end-------->result=%d'%(cnt))
最后的成果: