AutoResearch: Autonomous ML Research Framework

Published: May 01, 2026

🔬 AutoResearch Overview

AutoResearch is an agents and device-agnostic autonomous machine learning research framework that enables AI agents to independently conduct end-to-end experiments. Unlike traditional AutoML tools, AutoResearch provides a complete research loop where agents can modify code, run experiments, evaluate results, and learn from a persistent semantic memory.

🌟 Key Features

🤖 Advanced Agent Integration: Built-in protocols for advanced agentic systems like Claude Code, OpenCode, and OpenClaw.
🧠 Semantic Research Memory: Long-term RAG-based memory using ChromaDB allows agents to learn from every past experiment.
📈 Industrial Observability: Native integration with Weights & Biases (W&B) for real-time metric tracking and artifact management.
⚡ High-Performance Backends: Native support for Apple MLX, JAX, and PyTorch (CUDA/MPS/CPU).
🚀 Distributed Scaling: Parallelize research across clusters using Ray.
🛡️ Robust Orchestration: SQL-backed metadata management and automatic Git-based experiment versioning.

🏗️ Architecture

AutoResearch acts as the Research Platform (The Orchestrator) while the AI system acts as the Brain (The Agent).

🚀 Quick Start

Clone and navigate to the directory then:

1. Install

pip install .

2. Initialize a Project

autoresearch init my_research --name "Optimizing-Transformer"
cd my_research

3. Run with an External Agent (e.g., Claude Code) Configure your autoresearch.yaml:

agent:
  type: "external"
  command: "claude-code"

Then start the autonomous loop:

autoresearch run

Or run a single step as a tool:

autoresearch step --description "Trying Muon optimizer instead of AdamW"

🛰️ Autopilot Mode (Pro)

For high-throughput research, AutoResearch supports a fully autonomous “Autopilot” mode. This allows the framework to automatically drive interactive CLI agents like OpenCode or Claude Code by feeding them prompts and auto-exiting sessions once the code is optimized.

Install Driver: pip install pexpect
Configure: Set autopilot: true in your autoresearch.yaml.
Deploy: Run autoresearch run and the framework will handle $N$ experiments completely unattended.

📊 Comparison: AutoResearch vs. Traditional AutoML

Feature	Traditional AutoML	AutoResearch
Scope	Hyperparameter tuning only	Full code & architecture modification
Agent Control	Fixed search space	AI decides what to change
Learning	Grid/Bayesian search	Semantic memory (RAG) of past results
Device Support	Varies by tool	Native MLX, JAX, CUDA, MPS
Integration	Limited to configs	Direct integration with Claude Code/GPT

👥 Team Roster for `ajeetkbhardwaj/automlresearch`

Run the GitHub Action to auto-populate your team members here!

📅 Weekly Plan & Updates

Write your weekly plan, problems tackled, and achievements here. The automated script will never overwrite this text!

👑 Team Leader Update

Solved: [What did you solve?]

👨‍💻 Team Member Updates

Solved: [What did the team solve?]

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Ajeet Kumar