xiaohongshu_spider

爬取小红书相关评论注:本代码仅为兴趣爱好探究，请勿进行商业利用或非法研究，负责后果自负，与创作者无关

一.总体概述

爬取的数据包括

评论者昵称，id，评论级别，评论内容

先上个图

二.爬虫过程

打开小红书页面，f12大法查看xhr请求，找到对应内容

内容都在comments后面，翻页通过cursor翻页，逻辑如下

 next_cursor = json_text['data']['cursor']

 if page == 1:
    url = 'https://edith.xiaohongshu.com/api/sns/web/v2/comment/page?note_id={}&cursor=&top_comment_id=&image_formats=jpg,webp,avif'.format(note_id)
 else:
    print(colorama.Fore.GREEN + "[info] 进入下一轮循环")
    url = 'https://edith.xiaohongshu.com/api/sns/web/v2/comment/page?note_id={}&cursor={}&top_comment_id=&image_formats=jpg,webp,avif'.format(note_id,next_cursor)