• Keine Ergebnisse gefunden

Foundations of Artificial Intelligence 3. Introduction: Rational Agents Malte Helmert

N/A
N/A
Protected

Academic year: 2022

Aktie "Foundations of Artificial Intelligence 3. Introduction: Rational Agents Malte Helmert"

Copied!
20
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

3. Introduction: Rational Agents

Malte Helmert

University of Basel

March 3, 2021

(2)

Introduction: Overview

Chapter overview: introduction 1. What is Artificial Intelligence?

2. AI Past and Present 3. Rational Agents

4. Environments and Problem Solving Methods

(3)

Agents

(4)

Heterogeneous Application Areas

AI systems are used for verydifferent tasks:

controlling manufacturing plants detecting spam emails

intra-logistic systems in warehouses giving shopping advice on the Internet playing board games

finding faults in logic circuits . . .

How do we capture this diversity in asystematic framework emphasizingcommonalitiesand differences?

common metaphor: rational agents and their environments German: rationale Agenten, Umgebungen

(5)

Heterogeneous Application Areas

AI systems are used for verydifferent tasks:

controlling manufacturing plants detecting spam emails

intra-logistic systems in warehouses giving shopping advice on the Internet playing board games

finding faults in logic circuits . . .

How do we capture this diversity in asystematic framework emphasizingcommonalitiesand differences?

common metaphor: rational agents and their environments German: rationale Agenten, Umgebungen

(6)

Agents

? agent percepts

sensors

actions environment

actuators

Agents

agent functions map sequences of observationsto actions:

f :P+ → A

agent program: runs on physicalarchitecture and computesf Examples: human, robot, web crawler, thermostat, OS scheduler German: Agenten, Agentenfunktion, Wahrnehmung, Aktion

(7)

Introducing: an Agent

(8)

Vacuum Domain

A B

observations: location and cleanness of current room:

ha,cleani,ha,dirtyi,hb,cleani,hb,dirtyi actions: left, right, suck, wait

(9)

Vacuum Agent

a possible agent function:

observation sequence action

ha,cleani right

ha,dirtyi suck

hb,cleani left

hb,dirtyi suck

ha,cleani,hb,cleani left ha,cleani,hb,dirtyi suck

. . . . . .

(10)

Reflexive Agents

Reflexiveagents compute next action only based on last observationin sequence:

very simple model very restricted

corresponds to Mealy automaton (a kind of DFA) with only 1 state

practical examples?

German: reflexiver Agent

Example (A Reflexive Vacuum Agent) def reflex-vacuum-agent(location,status):

if status= dirty: returnsuck else if location = a: returnright else if location = b: returnleft

(11)

Evaluating Agent Functions

What is theright agent function?

(12)

Rationality

(13)

Rationality

Rational Behavior

Evaluate behavior of agents withperformance measure (related terms: utility,cost).

perfect rationality:

always select an action maximizing expected value of future performance

given available information (observations so far)

German: Performance-Mass, Nutzen, Kosten, perfekte Rationalit¨at

(14)

Is Our Agent Perfectly Rational?

Question: Is the reflexive vacuum agent of the example perfectly rational?

depends on performance measure and environment!

Do actions reliably have the desired effect?

Do we know the initial situation?

Can new dirt be produced while the agent is acting?

(15)

Is Our Agent Perfectly Rational?

Question: Is the reflexive vacuum agent of the example perfectly rational?

depends on performance measure and environment!

Do actions reliably have the desired effect?

Do we know the initial situation?

Can new dirt be produced while the agent is acting?

(16)

Rational Vacuum Agent

Example (Vacuum Agent) performance measure:

+100 units for each cleaned cell

−10 units for eachsuck action

−1 units for eachleft/right action environment:

actions and observations reliable

world only changes through actions of the agent all initial situations equally probable

How should a perfect agent behave?

(17)

Rationality: Discussion

perfect rationality 6= omniscience

incomplete information (due to limited observations) reduces achievable utility

perfect rationality 6= perfect prediction of future uncertain behavior of environment (e.g., stochastic action effects) reduces achievable utility

perfect rationality is rarely achievable

limited computational power bounded rationality German: begrenzte Rationalit¨at

(18)

Summary

(19)

Summary (1)

common metaphor for AI systems: rational agents agentinteracts withenvironment:

sensors perceive observationsabout state of the environment actuators perform actionsmodifying the environment

formally: agent functionmaps observation sequences to actions

reflexive agent: agent function only based on last observation

(20)

Summary (2)

rationalagents:

try to maximizeperformance measure (utility)

perfect rationality: achieve maximal utility in expectation given available information

for “interesting” problems rarely achievable bounded rationality

Referenzen

ÄHNLICHE DOKUMENTE

“[the automation of] activities that we associate with hu- man thinking, activities such as decision-making, problem solving, learning” (Bellman, 1978).. “the study of mental

1950s/1960s: beginnings of AI; early enthusiasm 1970s: micro worlds and knowledge-based systems. 1980s: gold rush of expert systems followed by

breadth-first search ( this chapter) uniform cost search ( Chapter 11) depth-first search ( Chapter 12) depth-limited search ( Chapter 12) iterative deepening search ( Chapter

breadth-first search optimal if all action costs equal otherwise no optimality guarantee example:.. remedy: uniform

heuristics estimate distance of a state to the goal can be used to focus search on promising states soon: search algorithms that

Missionaries and Cannibals people on wrong river

basic idea of abstraction heuristics: estimate solution cost by considering a smaller planning task. formally: abstraction function α maps states to abstract states and thus

Hence the objective of our landmark heuristics is to approximate the optimal delete relaxed heuristic h + as accurately as possible.. More advanced landmark techniques work directly