Challenging's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:4658Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:82643Issues:0Issues:0

CodeQwen1.5

CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonStargazers:148Issues:0Issues:0

NCISurvey

Neural Code Intelligence Survey 2024; Reading lists and resources

License:MITStargazers:153Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonLicense:MITStargazers:9683Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3285Issues:0Issues:0

OSWorld

OSWorld: A unified, real computer environment for multimodal agents to evaluate open-ended computer tasks involving arbitrary apps and interfaces on Ubuntu, Windows, and macOS

Language:PythonLicense:Apache-2.0Stargazers:308Issues:0Issues:0

code-html-to-markdown

A lightweight script for processing HTML page to markdown format with support for code blocks

Language:HTMLLicense:MITStargazers:52Issues:0Issues:0

CodeScope

Benchmark, datasets and code for the paper CodeScope.

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:5233Issues:0Issues:0

KnowCoder

Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose KnowCoder, the most powerful large language model so far for universal information extraction.

License:NOASSERTIONStargazers:14Issues:0Issues:0

magicoder

Magicoder: Source Code Is All You Need

Language:PythonLicense:MITStargazers:1859Issues:0Issues:0

Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

Stargazers:128Issues:0Issues:0

Awesome-LM-SSP

A reading list for large models safety, security, and privacy.

License:Apache-2.0Stargazers:376Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:22486Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:16739Issues:0Issues:0

diffusion-of-thoughts

Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Language:PythonStargazers:44Issues:0Issues:0

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:2432Issues:0Issues:0

human-eval-infilling

Code for the paper "Efficient Training of Language Models to Fill in the Middle"

Language:PythonLicense:MITStargazers:144Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:10187Issues:0Issues:0

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2071Issues:0Issues:0
License:MITStargazers:2Issues:0Issues:0

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:136Issues:0Issues:0

RAGxplorer

Open-source tool to visualise your RAG 🔮

Language:Jupyter NotebookLicense:MITStargazers:958Issues:0Issues:0

RefChecker

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Language:PythonLicense:Apache-2.0Stargazers:189Issues:0Issues:0

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language:PythonLicense:Apache-2.0Stargazers:2949Issues:0Issues:0

dateutil

Useful extensions to the standard Python datetime features

Language:PythonLicense:NOASSERTIONStargazers:2244Issues:0Issues:0
Language:PythonLicense:MITStargazers:814Issues:0Issues:0

LongLM

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:461Issues:0Issues:0

MESED

[AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities

Language:PythonStargazers:12Issues:0Issues:0