LZ写的挺好的 #1

Allianzcortex · 2015-10-31T13:45:24Z

偶然见到LZ写的文章，非常漂亮。虽然和我的教程思路不一样^^ ，用的 cookie 方法也不一样

大概看了一下代码，没有测试。可以重写增加 pipeline ，大概是这样,就可以直接在 ans.json 中
看到内容，并且也处理了 Unicode 到 utf-8 的转换
'
import json
import codecs
class doubanBookPipeline(object):

def __init__(self):
    self.file = codecs.open('ans.json', 'wb', encoding='utf-8')

def process_item(self, item, spider):
    line = json.dumps(dict(item), ensure_ascii=False) + "\n"
    self.file.write(line)
    return item

def spider_closed(self, spider):
    self.file.close()

    file = codecs.open(filename,'wb',encoding='utf-8')

'

还有 LZ 的代码现在应该登录不上去了？ zhihu.com/login 已经被取消了，换用 zhihu.com 直接提交表单应该就可以。

祝顺利啦~

The text was updated successfully, but these errors were encountered:

Allianzcortex · 2015-10-31T13:46:36Z

request 库里有丧心病狂的直接复制粘贴 cookie~~

额，我上面是说我没有下载代码测试…………

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LZ写的挺好的 #1

LZ写的挺好的 #1

Allianzcortex commented Oct 31, 2015

Allianzcortex commented Oct 31, 2015

LZ写的挺好的 #1

LZ写的挺好的 #1

Comments

Allianzcortex commented Oct 31, 2015

Allianzcortex commented Oct 31, 2015