Informationsansvarig: Gösta H. Granlund, webmaster@isy.liu.se
Sidan uppdaterades senast: 2010-08-21
LiU » ISY » Personliga sidor »
Datorseende »
gosta »
Gösta H. Granlund
The first part should use the incoming image information to produce a description of the image, where different objects are recognized and assigned to the proper categories, together with information about position and other relevant parameters. A second unit would now use this information to produce actions into the physical world, e.g. to implement a robot.
This structure has not worked out very well for reasons which will become apparent. It turns out that in fact the order between the parts shall be the opposite. See figure below.
The first part of the system is a reactive percept-to-action mapper. After this follows, if necessary for the application, a part which performs a symbolic processing for categorization and reasoning, in brief what we refer to as AI, and communication. For completeness, we will give a slightly more detailed structure diagram, around which the further discussion will center.
The distinctive characteristic of this structure is that percepts are mapped directly onto actions, rather than descriptions, as it was in the first figure. The reason is that this strategy allows the system to learn objects and other aspects of the environment by itself. Descriptions, of which assignment to category is one example, are generated in the symbolic part of the structure, for communication to other systems or for use in symbolic reasoning.
For a detailed discussion of these issues, reference is made to ECVision Research Background Document, and articles [Granlund99a] and [Granlund99b], but a few major points will be listed here:
The preceding is exactly the strategy which has very successfully been employed by biological systems.
The workings of the symbolic structure and its different implementations, such as for planning, language, communication, etc., is well documented in literature and will be omitted in this exposition. What is important to observe is that the spatial/cognitive domain and the symbolic/language domain are two different worlds, where different rules and methods apply.