小白求助爬虫问题,求大神解答一下,感谢!
代码如下,运行的话不报错,但是也没有东西输出来,中间加了几个print也是什么东西都没有!输出:Process finished with exit code 0爬的豆瓣电影https://movie.douban.com/chartimport requests
import re
import json
heads = 'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Mobile Safari/537.36'
#获取网页
def get_page(url):
response = requests.get(url,heads)
return response.text
#解析网页
def re_html(html):
r = re.compile('<table.*?class="p1">(.*?)</span>.*?</table>',re.S)#正则暂时获取总评论人数
item = re.findall(r,html)
print(item)
for i in item:
yield {
'name':i,
# 'start':i,
# 'num':i
}
#写入txt
def writedata(context):
with open('0.txt','a',encoding='utf-8') as f:
f.write(json.dumps(context,ensure_ascii=False) + '\n')
f.close()
#主函数
def main():
url = 'https://movie.douban.com/chart/'
html = get_page(url)
for x in re_html(html):
print (x)
writedata(x)
if __name__ == '__mian__':
main() 倒数第二行的main写错了 if __name__ == '__main__':
main() sky 发表于 2018-1-19 11:40
倒数第二行的main写错了
.........我去 找了一个早上了。。。。。 ba21 发表于 2018-1-19 11:40
if __name__ == '__main__':
main()
谢谢
页:
[1]