Jimmy Lin (lintool)

lintool

Geek Repo

0

following

0

stars

Company:University of Waterloo

Location:Nearby data lake

Home Page:https://cs.uwaterloo.ca/~jimmylin/

Twitter:@lintool

Github PK Tool:Github PK Tool


Organizations
afterburnerdb
archivesunleashed
beir-cellar
castorini
dsg-uwaterloo
dstlry
gwf-uwaterloo
liarr2017
osirrc
project-miracl
recommenders
rsvp-ai
trecrts

Jimmy Lin's repositories

Cloud9

Cloud9 is a Hadoop toolkit for working with big data

Language:JavaStargazers:236Issues:30Issues:0

warcbase

Warcbase is an open-source platform for managing analyzing web archives

Mr.LDA

Scalable Topic Modeling using Variational Inference in MapReduce

Language:JavaLicense:Apache-2.0Stargazers:149Issues:32Issues:8

Ivory

A Hadoop toolkit for web-scale information retrieval research

Language:JavaStargazers:79Issues:21Issues:0

UMD-courses

Course homepages for courses that I've taught at the University of Maryland

Language:HTMLStargazers:53Issues:14Issues:0

IR-Reproducibility

Open-Source Information Retrieval Reproducibility Challenge

bigdata-2016w

CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo

Language:HTMLStargazers:38Issues:6Issues:0

SparkTutorial

Spark Tutorial at the University of Maryland

Stargazers:38Issues:0Issues:0

clueweb

Hadoop tools for manipulating ClueWeb collections

Language:JavaStargazers:26Issues:7Issues:0

chrome-archive-this-page

Internet Archive "Save a Page" Plug-In for Chrome

Language:JavaScriptStargazers:23Issues:5Issues:0

bigdata-2017w

CS 489/698 Big Data Infrastructure (Winter 2017) at the University of Waterloo

Language:HTMLStargazers:15Issues:5Issues:0

TweetAnalysisWithSpark

Tweet Analysis with Spark

Language:ScalaStargazers:15Issues:4Issues:0

JASS

Anytime Ranking for Impact-Ordered Indexes

Language:CStargazers:12Issues:4Issues:0

JScene

A proof-of-concept in-browser JavaScript-based search engine

Language:JavaScriptStargazers:12Issues:4Issues:0

Enron2mbox

Converting the Enron email collection to mbox format

Language:PythonStargazers:10Issues:0Issues:0

OptTrees

Source code for: Nima Asadi, Jimmy Lin, and Arjen P. de Vries. Runtime Optimizations for Tree-Based Machine Learning Models. IEEE Transactions on Knowledge and Data Engineering, 26(9):2281-2292, 2014.

Language:CStargazers:9Issues:0Issues:0

Cassovary-vs-GraphJet

Performance comparison between Cassovary and GraphJet

Stargazers:5Issues:0Issues:0

c-bfscan

Implementations of brute force scans for document retrieval in C

Language:CStargazers:3Issues:6Issues:0

bfscan

Document retrieval using brute force scans

Language:JavaStargazers:2Issues:0Issues:0

BuboQA

Question answering over knowledge graphs

Language:PythonStargazers:2Issues:0Issues:0

GiraphTutorial

Giraph Tutorial

Stargazers:2Issues:0Issues:0
Language:JavaStargazers:2Issues:0Issues:0

NSF-projects

NSF project homepages

Language:CSSStargazers:2Issues:0Issues:0

wiki-tools

Collection of tools for working with Wikipedia

Language:JavaStargazers:2Issues:0Issues:0

Zambezi

Real-time indexer and search engine

Language:CStargazers:2Issues:3Issues:0

c-bfscan-1

Brute force scan in C

Language:CStargazers:1Issues:3Issues:0

cassovary

Cassovary is a simple big graph processing library for the JVM

Language:ScalaStargazers:1Issues:0Issues:0

IR-Reproducibility-exp

Experimental runs from the Open-Source Information Retrieval Reproducibility Challenge.

Language:MAXScriptStargazers:1Issues:0Issues:0

TweetTap

Simple program to tap the Twitter sample stream

Language:JavaStargazers:1Issues:0Issues:0

wikiduper

Utility for finding similar sentences on wikipedia

Language:MathematicaStargazers:1Issues:0Issues:0