Jimmy Lin (lintool)

lintool

Geek Repo

0

following

0

stars

Company:University of Waterloo

Location:Nearby data lake

Home Page:https://cs.uwaterloo.ca/~jimmylin/

Twitter:@lintool

Github PK Tool:Github PK Tool


Organizations
afterburnerdb
archivesunleashed
beir-cellar
castorini
dsg-uwaterloo
dstlry
gwf-uwaterloo
liarr2017
osirrc
project-miracl
recommenders
rsvp-ai
trecrts

Jimmy Lin's repositories

MapReduceAlgorithms

Data-Intensive Text Processing with MapReduce

Language:TeXStargazers:617Issues:79Issues:0

guide

The Student's Guide to @lintool

bespin

Reference implementations of data-intensive algorithms in MapReduce and Spark

Language:JavaLicense:NOASSERTIONStargazers:81Issues:14Issues:10

bigdata-2018w

CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo

Language:HTMLStargazers:71Issues:10Issues:0

bigcows

Scrapes citation statistics from Google Scholar

my-data-is-bigger-than-your-data

My data is bigger than your data!

Language:HTMLStargazers:39Issues:8Issues:0

wikiclean

A Java Wikipedia markup to plain text converter

bigdata-2018f

CS 451/651 Data-Intensive Distribute Computing (Fall 2018) at the University of Waterloo

Language:HTMLStargazers:23Issues:6Issues:0

tools

Lintools: tools by @lintool

Language:JavaLicense:NOASSERTIONStargazers:22Issues:7Issues:1

art-science-empirical-cs-2022f

The Art and Science of Empirical Computer Science (Fall 2022)

robust04-analysis

Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)

art-science-empirical-cs-2023f

The Art and Science of Empirical Computer Science (Fall 2023)

non-blind-review

My proposal for non-blind reviewing at *ACL

IR-Reproducibility2

The Replicability of IR Replicability Experiments

Language:ShellStargazers:5Issues:4Issues:0

UROC-projects

Undergraduate Research Opportunities Conference sponsored by the University of Waterloo

bespin-data

Datasets for Bespin

Language:PythonStargazers:4Issues:3Issues:0

AnseriniMaven

Maven repo for some Anserini dependencies.

aut

The Archives Unleashed Toolkit is an open-source platform for analyzing web archives.

Language:ScalaLicense:NOASSERTIONStargazers:2Issues:3Issues:0

msmarco

website for MS Marco

Language:JavaScriptLicense:CC-BY-4.0Stargazers:2Issues:2Issues:0
Language:DockerfileStargazers:2Issues:3Issues:0
Language:PythonLicense:CC-BY-4.0Stargazers:2Issues:2Issues:0

TREC-2019-Deep-Learning

Website for the TREC Deep Learning Track 2019

Language:PythonLicense:CC-BY-4.0Stargazers:2Issues:3Issues:0

TS4

Tweet Streaming Selective Search with Spark

Language:JavaStargazers:2Issues:3Issues:0

cs-big-cows

List of people with great achievements in Computer Science

csranking-aica

Visualizations of top Canadian universities for AI research by CSRankings

Language:HTMLStargazers:0Issues:2Issues:0