pbarker

Patrick Barker's starred repositories

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT150100

LLM4Teach

Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model

Language:Python2000

rund

OCI Container Runtime for Darwin

Language:GoApache-2.044100

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:Python60200

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION168800

sqlite-vec

A vector search SQLite extension that runs anywhere!

Language:CApache-2.0327700

sqlite-vss

A SQLite extension for efficient vector search, based on Faiss!

Language:C++MIT165600

pyvecdb

A Python library for efficient similarity search using high-dimensional vectors.

Language:Python200

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT591900

agenata

Build Web Datasets with Ease

Language:JavaScript3200

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonApache-2.0150700

Phi3-Vision-Finetune

An open-source implementaion for fine-tuning Phi3-Vision-128k-insturct by Microsoft.

Language:PythonApache-2.03600

lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Language:PythonMIT43300

digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Language:PythonApache-2.018600

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:Python220800

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION955400

ruff

An extremely fast Python linter and code formatter, written in Rust.

Language:RustMIT2994500

awesome-ai-agents

A list of AI autonomous agents

NOASSERTION924800

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonGPL-3.0956500

datamodel-code-generator

Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.

Language:PythonMIT254700

ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonApache-2.0295300

The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of the screen2words models (our paper accepted by UIST'21 will be linked soon).

4300

widget-caption

The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget captioning model (please see the EMNLP'20 paper: https://arxiv.org/abs/2010.04295).

1600

taperception

This repository contains the datasets that were used for the research described in "Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis" by Eldon Schoop, Xin Zhou, Gang Li, Zhourong Chen, Bjoern Hartmann and Yang Li, which is to appear in CHI 2022.

500