Python 娱乐项目：用BeautifulSoup抓取时光网图片

limn2o4

于 2017-05-29 18:48:30 发布

阅读量539

点赞数

CC 4.0 BY-SA版权

分类专栏： Python 文章标签： python 图片

本文链接：https://blog.csdn.net/lingzidong/article/details/72803399

Python 专栏收录该内容

2 篇文章

订阅专栏

本文介绍了一种利用Python中的BeautifulSoup库从网页抓取图片的方法。通过编写简单的脚本，实现了对mtime电影网主页上所有图片的下载。此教程适用于初学者了解网页爬虫的基本原理。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

最近学了BeautifulSoup，也算是为了以后找测试样例方便，写了一个抓取图片的小例子：
平台：python3.6

from bs4 import BeautifulSoup
from urllib import request

def getpage(url):
    page = request.urlopen(url)
    html = page.read()
    return html
soup = BeautifulSoup(getpage(r'http://movie.mtime.com/'))
img = soup.find_all('img')
cnt = 0
for i in img:
    path = 'img/%d.jpg'%(cnt)
    request.urlretrieve(i['src'],path)
    print('processing--------->count=%d-------->'%(cnt))
    cnt+=1
print('end-------->result=%d'%(cnt))