Daniel Radu's starred repositories

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5053Issues:0Issues:0

quaterion

Blazing fast framework for fine-tuning similarity learning models

Language:PythonLicense:Apache-2.0Stargazers:627Issues:0Issues:0

PurpleLlama

Set of tools to assess and improve LLM security.

Language:PythonLicense:NOASSERTIONStargazers:2086Issues:0Issues:0

LLMEvaluation

A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.

Stargazers:37Issues:0Issues:0

Monocle

Tooling backed by an LLM for performing natural language searches against compiled target binaries. Search for encryption logic, password strings, vulnerabilities, etc.

Language:PythonLicense:GPL-3.0Stargazers:128Issues:0Issues:0

osquery-attck

Mapping the MITRE ATT&CK Matrix with Osquery

License:Apache-2.0Stargazers:761Issues:0Issues:0

hurl

Hurl, run and test HTTP requests with plain text.

Language:RustLicense:Apache-2.0Stargazers:12078Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1641Issues:0Issues:0

lida

Automatic Generation of Visualizations and Infographics using Large Language Models

Language:Jupyter NotebookLicense:MITStargazers:2513Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53853Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:10755Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14225Issues:0Issues:0

griptape

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

Language:PythonLicense:Apache-2.0Stargazers:1718Issues:0Issues:0

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:10859Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32544Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:9653Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14250Issues:0Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:5967Issues:0Issues:0

fastbloom

A fast bloom filter implemented by Rust for Python! 10x faster than pybloom!

Language:RustLicense:Apache-2.0Stargazers:68Issues:0Issues:0

apitable

πŸš€πŸŽ‰πŸ“š APITable, an API-oriented low-code platform for building collaborative apps and better than all other Airtable open-source alternatives.

Language:TypeScriptLicense:AGPL-3.0Stargazers:12228Issues:0Issues:0

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

License:MITStargazers:6871Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:33154Issues:0Issues:0

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:5692Issues:0Issues:0

google-api-python-client

🐍 The official Python client library for Google's discovery based APIs.

Language:PythonLicense:Apache-2.0Stargazers:7504Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18905Issues:0Issues:0

splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language:PythonLicense:NOASSERTIONStargazers:678Issues:0Issues:0

karton

Distributed malware processing framework based on Python, Redis and S3.

Language:PythonLicense:BSD-3-ClauseStargazers:374Issues:0Issues:0

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:105900Issues:0Issues:0

BTVM

C++11 implementation of 010 Editor's template language

Language:C++License:GPL-3.0Stargazers:33Issues:0Issues:0

HexRaysPyTools

IDA Pro plugin which improves work with HexRays decompiler and helps in process of reconstruction structures and classes

Language:PythonStargazers:113Issues:0Issues:0