GaoShen's repositories

spider

A configurable web spider with a easy-to-use web console

Language:JavaLicense:GPL-3.0Stargazers:987Issues:122Issues:28

DistributeCrawler

基于Map/Reduce爬虫,可抽取各大新闻网站的新闻正文并进行分类和聚类

stickerchat

Dataset for WWW 2020 paper "Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog"

Language:PythonStargazers:38Issues:3Issues:0

productqa

Product-Aware Answer Generation in E-Commerce Question-Answering

proto-summ

Dataset proposed by ''How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing''

DistributedCrawler

DistributeCrawler的Maven版

Language:JavaLicense:Apache-2.0Stargazers:10Issues:3Issues:0

HeteroQA

WSDM 2022 paper HeteroQA: Learning towards Question-and-Answering through Multiple Information Sources via Heterogeneous Graph Modeling

Language:HTMLStargazers:2Issues:0Issues:0

table-summ

BioGen: Generating Biography Summary under Table Guidance on Wikipedia

Stargazers:2Issues:0Issues:0

char-rnn-tensorflow

Multi-layer Recurrent Neural Networks (LSTM, RNN) for character-level language models in Python using Tensorflow

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

dialog-summ

Code for ACL 2023 paper "Dialogue Summarization with Structure-aware Graph Modeling via Static-Dynamic Fused Graph"

Beijing_Daxuexi_Simple

北京 青年大学习 使用Github Actions自动完成

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Commons

常用的Java和Python库

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

DateConverter

Convert the date in Excel

Language:JavaStargazers:0Issues:1Issues:0

DistributedWebSearcher

DistributedCrawler的Web搜索站点

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

docs

Little book of webmagic.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

GitStudy

用于Git操作的学习

License:Apache-2.0Stargazers:0Issues:1Issues:0

LuceneStudy

Lucene3.0的学习

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

markdownj

MarkdownJ

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

PKUAutoSubmit

PKU一键出入校备案小工具

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

TextBox

TextBox 2.0 is a text generation library with pre-trained language models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

webmagic

A scalable web crawler framework.

Language:JavaStargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0