con un clic
autoresearch
// Self-improving research loops with hypothesis generation, experiment design, and result analysis
// Self-improving research loops with hypothesis generation, experiment design, and result analysis
Full-stack research and code generation pipeline - research, code, and create
Natural language web UI control — element detection, targeted interaction, and automated form filling
Public opinion analysis and sentiment at scale — sentiment scoring, stance detection, and multi-dimensional bias measurement.
Meta-specialist that auto-discovers and scaffolds new specialists from trending GitHub repos
Safe multi-language code execution via alibaba/OpenSandbox
Multi-agent financial analysis pipeline: fundamental analysis, technical indicators, and news sentiment scoring for any ticker symbol. Wraps patterns from virattt/ai-hedge-fund and ZhuLinsen/daily_stock_analysis.
| name | autoresearch |
| display_name | Autoresearch Specialist |
| description | Self-improving research loops with hypothesis generation, experiment design, and result analysis |
| version | 0.1.0 |
| source_repo | karpathy/autoresearch |
| license | MIT |
| tier | core |
| capabilities | ["research","hypothesis_generation","experiment_design","result_analysis"] |
| allowed_tools | ["run_experiment","analyze_results","generate_hypothesis"] |
| output_formats | ["python_api","cli","mcp_server","agent_skill","rest_api"] |
Wraps the karpathy/autoresearch patterns to provide self-improving research loops inside OSS Agent Lab. Given a topic or question, the specialist generates falsifiable hypotheses, runs simulated experiments, and produces an evidence-backed analysis with confidence scores and next-step recommendations — ready to iterate.
| Tool | Description | Side Effects |
|---|---|---|
generate_hypothesis | Generates structured hypotheses for a topic | None |
run_experiment | Simulates running an experiment for a hypothesis | None (v1 is local; future: network) |
analyze_results | Analyzes experiment findings; returns insights and confidence score | None |
from agents.specialists.autoresearch.agent import AutoresearchSpecialist
from oss_agent_lab.contracts import Intent, Query, SpecialistRequest
specialist = AutoresearchSpecialist()
request = SpecialistRequest(
intent=Intent(action="research", domain="science", confidence=0.9),
query=Query(user_input="effects of sleep deprivation on cognitive performance"),
specialist_name="autoresearch",
)
result = await specialist.execute(request)
print(result.result["analysis"]["summary"])
oss-lab run autoresearch "effects of sleep deprivation on cognitive performance"
request = SpecialistRequest(
intent=Intent(
action="research",
domain="science",
confidence=0.9,
parameters={"method": "simulation"},
),
query=Query(user_input="quantum error correction thresholds"),
specialist_name="autoresearch",
)
{
"topic": "...",
"hypotheses": [
{"id": "h1", "text": "...", "confidence": 0.75, "rationale": "...", "testable": true}
],
"recommended_hypothesis": "h1",
"experiment": {"id": "abc12345", "method": "literature_review", "status": "completed"},
"analysis": {
"summary": "...",
"key_insights": ["..."],
"confidence_score": 0.575,
"next_steps": ["..."]
}
}
Wraps karpathy/autoresearch.