John Semerdjian (semerj)

semerj

Geek Repo

Company:https://snorkel.ai/

Github PK Tool:Github PK Tool

John Semerdjian's starred repositories

api-guidelines

Microsoft REST API Guidelines

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19579Issues:299Issues:1354

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18739Issues:117Issues:527

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:16976Issues:139Issues:725

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:8187Issues:47Issues:553

llama-cpp-python

Python bindings for llama.cpp

Language:PythonLicense:MITStargazers:7764Issues:71Issues:1084

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7595Issues:48Issues:647

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6844Issues:150Issues:184

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:6646Issues:35Issues:731

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:4076Issues:56Issues:19

lmql

A language for constraint-guided and efficient LLM programming.

Language:PythonLicense:Apache-2.0Stargazers:3623Issues:22Issues:252

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonLicense:MITStargazers:2901Issues:41Issues:260

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1577Issues:22Issues:66

lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Language:PythonLicense:MITStargazers:926Issues:18Issues:113

prompt-poet

Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.

Language:PythonLicense:MITStargazers:842Issues:5Issues:6

UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Agentless

Agentless🐱: an agentless approach to automatically solve software development problems

Language:PythonLicense:MITStargazers:671Issues:9Issues:21

colpali

The code used to train and run inference with the ColPali architecture.

Language:PythonLicense:MITStargazers:527Issues:8Issues:28
Language:PythonLicense:Apache-2.0Stargazers:441Issues:10Issues:34

fonduer

A knowledge base construction engine for richly formatted data

Language:PythonLicense:MITStargazers:407Issues:27Issues:179

xmc.dspy

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Language:PythonLicense:MITStargazers:363Issues:24Issues:8

realworldnlp

Example code for "Real-World Natural Language Processing"

targer

BiLSTM-CNN-CRF tagger

Language:PythonLicense:Apache-2.0Stargazers:164Issues:13Issues:4

allennlp-as-a-library-example

A simple example for how to build your own model using AllenNLP as a dependency.

allennlp_tutorial

Tutorial on how to use AllenNLP for sequence modeling (including hierarchical LSTMs and CRF decoding)

Language:PythonLicense:MITStargazers:85Issues:8Issues:12

Cosmos

Knowledge base construction from raw scientific documents

tagruler

Data programming by demonstration for information extraction and span annotation

Language:JavaScriptLicense:Apache-2.0Stargazers:35Issues:5Issues:6