AutoResearch: Autonomous ML Research Framework
Published:
π¬ AutoResearch Overview
AutoResearch is an agents and device-agnostic autonomous machine learning research framework that enables AI agents to independently conduct end-to-end experiments. Unlike traditional AutoML tools, AutoResearch provides a complete research loop where agents can modify code, run experiments, evaluate results, and learn from a persistent semantic memory.
π Key Features
- π€ Advanced Agent Integration: Built-in protocols for advanced agentic systems like Claude Code, OpenCode, and OpenClaw.
- π§ Semantic Research Memory: Long-term RAG-based memory using ChromaDB allows agents to learn from every past experiment.
- π Industrial Observability: Native integration with Weights & Biases (W&B) for real-time metric tracking and artifact management.
- β‘ High-Performance Backends: Native support for Apple MLX, JAX, and PyTorch (CUDA/MPS/CPU).
- π Distributed Scaling: Parallelize research across clusters using Ray.
- π‘οΈ Robust Orchestration: SQL-backed metadata management and automatic Git-based experiment versioning.
ποΈ Architecture
AutoResearch acts as the Research Platform (The Orchestrator) while the AI system acts as the Brain (The Agent).
π Quick Start
Clone and navigate to the directory then:
1. Install
pip install .
2. Initialize a Project
autoresearch init my_research --name "Optimizing-Transformer"
cd my_research
3. Run with an External Agent (e.g., Claude Code) Configure your autoresearch.yaml:
agent:
type: "external"
command: "claude-code"
Then start the autonomous loop:
autoresearch run
π°οΈ Autopilot Mode (Pro)
For high-throughput research, AutoResearch supports a fully autonomous βAutopilotβ mode. This allows the framework to automatically drive interactive CLI agents like OpenCode or Claude Code by feeding them prompts and auto-exiting sessions once the code is optimized.
π Comparison: AutoResearch vs. Traditional AutoML
| Feature | Traditional AutoML | AutoResearch |
|---|---|---|
| Scope | Hyperparameter tuning only | Full code & architecture modification |
| Agent Control | Fixed search space | AI decides what to change |
| Learning | Grid/Bayesian search | Semantic memory (RAG) of past results |
| Device Support | Varies by tool | Native MLX, JAX, CUDA, MPS |
| Integration | Limited to configs | Direct integration with Claude Code/GPT |
π₯ Team Roster for ajeetkbhardwaj/automlresearch
- π Team Leader:
ajeetkbhardwaj (Active Commits)
- π¨βπ» Team Members:
Open-Source Contributors (Active Commits)
π Weekly Plan & Updates
π Team Leader Update (Ajeet Kumar)
- Solved: Configured
autopilot: truein theautoresearch.yamlfile so the framework can automatically drive the agents unattended. - Completed: Implemented the long-term semantic memory (RAG) using ChromaDB, allowing the agents to reference previous runs.
- Next Steps: Optimize the SQL-backed metadata management for faster Git-based experiment versioning.
π¨βπ» Team Member Updates
- Solved: Fixed the Weights & Biases (W&B) integration for real-time metric tracking.
- Working on: Parallelizing the research environments across clusters using Ray.
