There are 0 repository under reasoning-agent topic.
MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025
Democratizing AI scientists with ToolUniverse
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
[Up-to-date] Awesome Agentic Deep Research Resources
Open-source generalized AI agent for everyday task automations.
This repository collects papers on VLLM applications. We will update new papers irregularly.
A powerful Python framework for orchestrating AI agents and managing complex LLM-driven tasks with ease.
A Generative AI Assistant with advance agentic capabilities. Codebuddy uses machine learning to generate code, complete tasks, and streamline coding tasks workflow.
It shows case studies of the LangGraph agent.
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
LLMs as Method Actors: A Model for Prompt Engineering and Architecture
Analyzing and scoring reasoning traces of LLMs
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
AI-powered search engine powered by reasoning models to refine queries, synthesize data, and provide insightful research responses.
AI Forecasting tools to help humans forecast the future. Additionally a framework for building a Metaculus AI Benchmarking Tournament Bot
[Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search
Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用于DeepSeek-R1等推理模型的思维过程干预,有效控制推理思考过程
MindBridge is an AI orchestration MCP server that lets any app talk to any LLM — OpenAI, Anthropic, DeepSeek, Ollama, and more — through a single unified API. Route queries, compare models, get second opinions, and build smarter multi-LLM workflows.
A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs. (ICCV2025)
Fragaria is a powerful and flexible Chain of Thought (CoT) reasoning API that leverages various Language Model (LLM) providers and incorporates Reinforcement Learning (RL) techniques to solve complex problems and answer intricate questions.
Code and dataset for the ICLR 2024 paper "Thought Propagation: An analogical Approach to Complex Reasoning with Large Language Models."
ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture
Turn stories, strategies, or systems into insight. Auto-generate Dialectical Wheels (DWs) from any text to reveal blind spots, surface polarities, and trace dynamic paths toward synthesis. DWs are semantic maps that expose tension, transformation, and coherence within a system—whether narrative, ethical, organizational, or technological.
Codebase and tutorial of ContPhy dataset generation for ICML 2024 paper "ContPhy: Continuum Physical Concept Learning and Reasoning from Videos"
Autonomous conversational AI agent to work as your assistant coach, strategist, psycologist and personal life ally
Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation
A Multi-Agent Reasoning Problem Solver. You build teams and they work together to solve the problems you give them.
Weaver: A modular agentic pipeline that dynamically combines SQL and LLMs for advanced table-based question answering
Official repository for the paper 'Meta-Reasoning Improves Tool Use in Large Language Models'.
A flexible foundation AI system for creating A2A-compatible autonomous AI agents that can collaborate, reason, and execute complex tasks through standardized agent-to-agent communication protocols.
High level Multi Intelligent Agent
Code for paper: Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study