|
发表于 2017-12-15 11:23:05
|
显示全部楼层
本帖最后由 payton24 于 2017-12-15 11:24 编辑
也才两三个月,复制了楼主的代码试试,结果提示403: Forbidden的错误。
- ============== RESTART: C:/Users/LZY/Desktop/download_mm_new.py ==============
- Traceback (most recent call last):
- File "C:/Users/LZY/Desktop/download_mm_new.py", line 63, in <module>
- download_mm()
- File "C:/Users/LZY/Desktop/download_mm_new.py", line 54, in download_mm
- page_num = int(get_page(url))
- File "C:/Users/LZY/Desktop/download_mm_new.py", line 13, in get_page
- html = url_open(url).decode('utf-8')
- File "C:/Users/LZY/Desktop/download_mm_new.py", line 7, in url_open
- response = urllib.request.urlopen(url)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 223, in urlopen
- return opener.open(url, data, timeout)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 532, in open
- response = meth(req, response)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 642, in http_response
- 'http', request, response, code, msg, hdrs)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 570, in error
- return self._call_chain(*args)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 504, in _call_chain
- result = func(*args)
- File "C:\Users\LZY\AppData\Local\Programs\Python\Python36\lib\urllib\request.py", line 650, in http_error_default
- raise HTTPError(req.full_url, code, msg, hdrs, fp)
- urllib.error.HTTPError: HTTP Error 403: Forbidden
- >>>
复制代码
后来上网查询了一下,把第七句改成:
response = urllib.request.urlopen(req)
403错误消除了,但是重新运行虽然没有报错。
但是经过自己调试,发现代码运行到下面语句:
a = html.find('img src=')
a返回-1值,也就是说找不到图片网址了,这意味着网址已经被加密了。
还有类似网站可以拿来练手的吗? |
|