|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
- import requests
- from bs4 import BeautifulSoup
- header = {
- 'User-Agent':' Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36'
- }
- res = requests.get('https://www.qidian.com/',headers=header)
- soup = BeautifulSoup(res.text,'html.parser')
- print(soup.prettify())
复制代码
初学爬虫,想要爬取起点网的网页源代码
结果出现这个
Traceback (most recent call last):
File "F:/Python/Exercise set/爬虫之旅.1.py", line 6, in <module>
res = requests.get('https://www.qidian.com/',headers=header)
File "C:\Users\DELL\venv\lib\site-packages\requests\api.py", line 72, in get
return request('get', url, params=params, **kwargs)
File "C:\Users\DELL\venv\lib\site-packages\requests\api.py", line 58, in request
return session.request(method=method, url=url, **kwargs)
File "C:\Users\DELL\venv\lib\site-packages\requests\sessions.py", line 494, in request
prep = self.prepare_request(req)
File "C:\Users\DELL\venv\lib\site-packages\requests\sessions.py", line 437, in prepare_request
hooks=merge_hooks(request.hooks, self.hooks),
File "C:\Users\DELL\venv\lib\site-packages\requests\models.py", line 306, in prepare
self.prepare_headers(headers)
File "C:\Users\DELL\venv\lib\site-packages\requests\models.py", line 440, in prepare_headers
check_header_validity(header)
File "C:\Users\DELL\venv\lib\site-packages\requests\utils.py", line 869, in check_header_validity
raise InvalidHeader("Invalid return character or leading space in header: %s" % name)
requests.exceptions.InvalidHeader: Invalid return character or leading space in header: User-Agent
求解答,这是什么意思??
- import requests
- from bs4 import BeautifulSoup
- header = {
- 'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36'
- }
- res = requests.get('https://www.qidian.com/',headers=header)
- soup = BeautifulSoup(res.text,'html.parser')
- print(soup.prettify())
复制代码
头部代理多了个空格,这是在HTTP2协议中不被允许的。
|
|