Objective: Implement a stateless reflex agent and compare its performance to the greedy goal-based agent from the concluding exercise.
Background: The greedy agent from Section 1.6.1: Concluding Design Exercise: Robotic Warehouse Agent maintains internal state (recent positions for loop detection). A simple reflex agent has no memory—it chooses actions based only on the current percept, using condition-action rules.
Task:
warehouse_agent_reflex.py) with the following rules:
Copilot Coaching:
Deliverables:
warehouse_agent_reflex.py: reflex agent implementationcompare_agents.py: multi-episode runner and statistical analysisObjective: Establish a random agent baseline and map out the performance spectrum from random to greedy behavior.
Background: A random agent selects valid actions uniformly at random. This provides a lower bound on performance—any intelligent agent should outperform random behavior. By comparing multiple agents, we can quantify the value of different design choices.
Task:
warehouse_agent_random.py):
[N, S, E, W, PICK, DROP]warehouse_agent_weighted.py):
Copilot Coaching:
Deliverables:
warehouse_agent_random.py: pure random agentwarehouse_agent_weighted.py: biased random agentperformance_spectrum.py: multi-agent comparison scriptspectrum_report.pdf: 2-page report with performance spectrum plot and discussion of how "intelligence" emerges from random to greedy behaviorObjective: Design, implement, and evaluate a novel agent that outperforms the greedy baseline of Section 1.6.1: Concluding Design Exercise: Robotic Warehouse Agent.
Background: The greedy Manhattan agent often gets stuck or makes suboptimal choices. Your task is to design a better agent using principles from Chapter 1, leveraging Copilot to accelerate implementation.
Task:
warehouse_agent_custom.pyCopilot Coaching:
Deliverables:
agent_design.pdf: 1-page design document (written before coding)warehouse_agent_custom.py: your novel agent implementationcopilot_log.md: brief log showing (1) key prompts used, (2) one example of improving Copilot's outputevaluation_dashboard.py: multi-layout evaluation scriptfinal_report.pdf: 4-5 page report with performance dashboard, comparison to baselines, discussion of design choices, and reflection on using Copilot as a coding assistantGrading Rubric: