abbypan / Novel-Robot-Parser

get novel / bbs content from website, 小说站点解析引擎

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NAME

Novel::Robot::Parser

get novel / bbs content from website

小说站点解析引擎

SUPPORT SITE

support download novel from website

支持小说/论坛站点下载

%SITE_DOM_NAME in Parser.pm 站点列表

support txt file

支持处理txt

txt : 指定解析txt文件

support raw file

支持处理序列化数据

raw : 指定解析经MessagePack压缩的二进制文件

INIT

new

init funtion, need set sitename,or url 初始化解析模块,需指定站点名称或网址

#site name : 直接指定站点
my $parser = Novel::Robot::Parser->new( site => 'jjwxc' );
 
#url : 通过url自动检测站点
my $url = 'http://www.jjwxc.net/onebook.php?novelid=2456';
my $parser = Novel::Robot::Parser->new( site => $url );

#site name : default,不明确指定站点名称
my $url = 'http://www.zhonghuawuxia.com/book/71';
my $parser = Novel::Robot::Parser->new( site => 'default' );

NOVEL FUNCTION

get_item_ref

get novel / forum thread data hash ref

获取小说/贴子内容,返回一个hash引用

my $r = $parser->get_item_ref($url, %opt);

get_item_info

get novel index info / forum thread first page info

获取小说/贴子首页信息

my $r = $parser->get_item_info($url);

BOARD FUNCTION

writer -> multi books, forum -> multi threads

get_board_ref

get writer / board info, 获取版块信息

my $r = $parser->get_board_ref($url, %opt);

QUERY FUNCTION

get_query_ref

query info, 获取查询结果

my $query_type = '作者';
my $query_keyword = '顾漫';

my ($info, $items_ref) = $parser->get_query_ref( $query_keyword, 
    query_type => $query_type );

About

get novel / bbs content from website, 小说站点解析引擎


Languages

Language:Perl 100.0%