Jeff Zemerick (jzonthemtn)

jzonthemtn

Geek Repo

Company:@mtnfog

Location:Pittsburgh

Home Page:jeffzemerick.dev

Twitter:@jzonthemtn

Github PK Tool:Github PK Tool


Organizations
mtnfog
philterd

Jeff Zemerick's starred repositories

search_fundamentals_course

Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/course/search-fundamentals?utm_source=daniel

Language:PythonLicense:Apache-2.0Stargazers:39Issues:0Issues:0

phileas-connector

Trino connector for Phileas PII engine

Language:JavaLicense:Apache-2.0Stargazers:2Issues:0Issues:0

philter

Philter finds and manipulates sensitive information in text.

Language:CSSLicense:Apache-2.0Stargazers:3Issues:0Issues:0

reposilite

Lightweight and easy-to-use repository management software dedicated for the Maven based artifacts in the JVM ecosystem 📦

Language:KotlinLicense:Apache-2.0Stargazers:1337Issues:0Issues:0

phileas

The open source PII and PHI redaction engine

Language:JavaLicense:Apache-2.0Stargazers:21Issues:0Issues:0

langchain4j

Java version of LangChain

Language:JavaLicense:Apache-2.0Stargazers:4346Issues:0Issues:0

user-behavior-insights

User Behavior Insights plugin for OpenSearch

Language:JavaLicense:Apache-2.0Stargazers:16Issues:0Issues:0

libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

Language:CLicense:MITStargazers:4025Issues:0Issues:0

fundus

A very simple news crawler with a funny name

Language:PythonLicense:MITStargazers:270Issues:0Issues:0

logdy-core

Web based real-time log viewer. Stream ANY content to a web UI with autogenerated filters. Parse any format with TypeScript.

Language:GoLicense:Apache-2.0Stargazers:1169Issues:0Issues:0

DocLayNet

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

License:NOASSERTIONStargazers:230Issues:0Issues:0

secret-llama

Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.

Language:TypeScriptLicense:Apache-2.0Stargazers:2426Issues:0Issues:0

terrascan

Detect compliance and security violations across Infrastructure as Code to mitigate risk before provisioning cloud native infrastructure.

Language:GoLicense:Apache-2.0Stargazers:4667Issues:0Issues:0

trdsql

CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats.

Language:GoLicense:MITStargazers:1946Issues:0Issues:0

go-mysql-server

A MySQL-compatible relational database with a storage agnostic query engine. Implemented in pure Go.

Language:GoLicense:Apache-2.0Stargazers:2312Issues:0Issues:0

GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Language:PythonLicense:Apache-2.0Stargazers:1204Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:13212Issues:0Issues:0

common-utils

Offers a library of utilities for building Java-based OpenSearch plugins

Language:KotlinLicense:Apache-2.0Stargazers:20Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:9225Issues:0Issues:0

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7682Issues:0Issues:0

opensearch-ubi

OpenSearch plugin for User Behavior Insights

Language:JavaLicense:Apache-2.0Stargazers:6Issues:0Issues:0

opensearch-plugins

For all things OpenSearch plugins. You want to install, or develop a plugin? You've come to the right place.

License:Apache-2.0Stargazers:49Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:43742Issues:0Issues:0

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:20559Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:25359Issues:0Issues:0

searcharray

Full text search in your Pandas dataframe

Language:PythonLicense:Apache-2.0Stargazers:196Issues:0Issues:0

jvector

JVector: the most advanced embedded vector search engine

Language:JavaLicense:Apache-2.0Stargazers:1455Issues:0Issues:0

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:GPL-3.0Stargazers:39508Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:36631Issues:0Issues:0

fabricator

[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.

Language:PythonLicense:Apache-2.0Stargazers:99Issues:0Issues:0