Agent - ICRL

Constructor

Agent(
    llm: LLMProvider,
    db_path: str,
    plan_prompt: str,
    reason_prompt: str,
    act_prompt: str,
    k: int = 3,
    max_steps: int = 30,
    seed_trajectories: list[Trajectory] | None = None,
    on_step: Callable[[Step, StepContext], None] | None = None,
    curation_threshold: float = 0.3,
    curation_min_retrievals: int = 5,
    verify_trajectory: Callable[[Trajectory], bool] | None = None,
)

Core Methods

await agent.train(env, goal)
await agent.run(env, goal)
agent.train_sync(env, goal)
agent.run_sync(env, goal)
await agent.train_batch(env_factory, goals)
await agent.run_batch(env_factory, goals)
agent.get_stats()

Notes

train stores successful trajectories (subject to verify_trajectory).
run never writes new trajectories.
database property exposes the underlying TrajectoryDatabase.

Stats Shape

get_stats() returns:

total_trajectories
successful_trajectories
success_rate

API Overview

Trajectory

​Constructor

​Core Methods

​Notes

​Stats Shape

Constructor

Core Methods

Notes

Stats Shape