Искусственный интеллект
Информатика и вычислительная техника
  • формат pdf
  • размер 11.72 МБ
  • добавлен 12 ноября 2011 г.
Weber C., Elshaw M., Mayer N.M. (eds.) Reinforcement Learning. Theory and Applications
Издательство InTech, 2011, -434 pp.

Brains rule the world, and brain-like computation is increasingly used in computers and electronic devices. Brain-like computation is about processing and interpreting data or directly putting forward and performing actions. Leaing is a very important aspect. This book is on reinforcement leaing which involves performing actions to achieve a goal. Two other leaing paradigms exist. Supervised leaing has initially been successful in prediction and classification tasks, but is not brain-like. Unsupervised leaing is about understanding the world by passively mapping or clustering given data according to some order principles, and is associated with the cortex in the brain. In reinforcement leaing an agent leas by trial and error to perform an action to receive a reward, thereby yielding a powerful method to develop goal-directed action strategies. It is predominately associated with the basal ganglia in the brain.
The first 11 chapters of this book, Theory, describe and extend the scope of reinforcement leaing. The remaining 11 chapters, Applications, show that there is already wide usage in numerous fields. Reinforcement leaing can tackle control tasks that are too complex for traditional, hand-designed, non-leaing controllers. As leaing computers can deal with technical complexities, the tasks of human operators remain to specify goals on increasingly higher levels.
This book shows that reinforcement leaing is a very dynamic area in terms of theory and applications and it shall stimulate and encourage new research in this field. We would like to thank all contributors to this book for their research and effort.
Summary of Theory:
Chapters 1 and 2 create a link to supervised and unsupervised leaing, respectively, by regarding reinforcement leaing as a prediction problem, and chapter 3 looks at fuzzycontrol with a reinforcement-based genetic algorithm. Reinforcement algorithms are modified in chapter 4 for future parallel and quantum computing, and in chapter 5 for a more general class of state-action spaces, described by grammars. Then follow biological views; in chapter 6 how reinforcement leaing occurs on a single neuron level by considering the interaction between a spatio-temporal leaing rule and Hebbian leaing, and in a global brain view of chapter 7, unsupervised leaing is depicted as a means of data pre-processing and arrangement for reinforcement algorithms. A table presents a ready-to-implement description of standard reinforcement leaing algorithms. The following chapters consider multi agent systems where a single agent has only partial view of the entire system. Multiple agents can work cooperatively on a common goal, as considered in chapter 8, or rewards can be individual but interdependent, such as in game play, as considered in chapters 9, 10 and 11.
Summary of Applications:
Chapter 12 continues with game applications where a robot cup middle size league robot leas a strategic soccer move. A dialogue manager for man-machine dialogues in chapter 13 interacts with humans by communication and database queries, dependent on interaction strategies that gove the Markov decision processes. Chapters 14, 15, 16 and 17 tackle control problems that may be typical for classical methods of control like PID controllers and hand-set rules. However, traditional methods fail if the systems are too complex, timevarying, if knowledge of the state is imprecise, or if there are multiple objectives. These chapters report examples of computer applications that are tackled only with reinforcement leaing such as water allocation improvement, building environmental control, chemical processing and industrial process control. The reinforcement-controlled systems may continue leaing during operation. The next three chapters involve path optimization. In chapter 18, inteet routers explore different links to find more optimal routes to a destination address. Chapter 19 deals with optimizing a travel sequence w.r.t. both time and distance. Chapter 20 proposes an untypical application of path optimization: a path from a given patte to a target patte provides a distance measure. An unclassified medical image can thereby be classified dependent on whether a path from it is shorter to an image of healthy or unhealthy tissue, specifically considering lung nodules classification using 3D geometric measures extracted from the lung lesions Computerized Tomography (CT) images. Chapter 21 presents a physicians' decision support system for diagnosis and treatment, involving a knowledgebase server. In chapter 22 a reinforcement leaing sub-module improves the efficiency for the exchange of messages in a decision support system in air traffic management.

Neural Forecasting Systems
Reinforcement leaing in system identification
Reinforcement Evolutionary Leaing for Neuro-Fuzzy Controller Design
Superposition-Inspired Reinforcement Leaing and Quantum Reinforcement Leaing
An Extension of Finite-state Markov Decision Process and an Application of Grammatical Inference
Interaction between the Spatio-Temporal Leaing Rule (non Hebbian) and Hebbian in Single Cells: A cellular mechanism of reinforcement leaing
Reinforcement Leaing Embedded in Brains and Robots
Decentralized Reinforcement Leaing for the Online Optimization of Distributed System
Multi-Automata Leaing
Abstraction for Genetics-based Reinforcement Leaing
Dynamics of the Bush-Mosteller leaing algorithm in 2x2 games
Modular Leaing Systems for Behavior Acquisition in Multi-Agent Environment
Optimising Spoken Dialogue Strategies within the Reinforcement Leaing Paradigm
Water Allocation Improvement in River Basin Using Adaptive Neural Fuzzy Reinforcement Leaing Approach
Reinforcement Leaing for Building Environmental Control
Model-Free Leaing Control of Chemical Processes
Reinforcement Leaing-Based Supervisory Control Strategy for a Rotary Kiln Process
Inductive Approaches based on Trial/Error Paradigm for Communications Network
The Allocation of Time and Location Information to Activity-Travel Sequence Data by means of Reinforcement Leaing
Application on Reinforcement Leaing for Diagnosis based on Medical Image
RL based Decision Support System for u-Healthcare Environment
Reinforcement Leaing to Support Meta-Level Control in Air Traffic Management
Похожие разделы
Смотрите также

Alpaydin E. Introduction to Machine Learning

  • формат pdf
  • размер 2.87 МБ
  • добавлен 05 октября 2011 г.
Издательство MIT Press, 2010, -581 pp. Machine learning is programming computers to optimize a performance criterion using example data or past experience. We need learning in cases where we cannot directly write a computer program to solve a given problem, but need example data or experience. One case where learning is necessary is when human expertise does not exist, or when humans are unable to explain their expertise. Consider the recognitio...

Er M.J., Zhou Y. (eds.) Theory and Novel Applications of Machine Learning

  • формат pdf
  • размер 6.75 МБ
  • добавлен 12 ноября 2011 г.
Издательство InTech, 2009, -386 pp. Even since computers were invented many decades ago, many researchers have been trying to understand how human beings learn and many interesting paradigms and approaches towards emulating human learning abilities have been proposed. The ability of learning is one of the central features of human intelligence, which makes it an important ingredient in both traditional Artificial Intelligence (AI) and emerging...

Mellouk A. (ed.) Advances in Reinforcement Learning

  • формат pdf
  • размер 12.29 МБ
  • добавлен 26 октября 2011 г.
Издательство InTech, 2011, -482 pp. Reinforcement Learning (RL) is oft en referred to as a branch of artificial intelligence and has been one of the central topics in a broad range of scientific fields for the last two decades. Understanding of RL is expected to provide a systematic understanding of adaptive behaviors, including simple classical and operant conditioning of animals as well as all complex social and economical human behaviors that...

Mellouk A., Chebira A. (eds.) Machine Learning

  • формат pdf
  • размер 8.06 МБ
  • добавлен 12 ноября 2011 г.
Издательство InTech, 2009, -430 pp. Machine Learning is often referred to as a branch of artificial intelligence which deals with the design and the development of algorithms and techniques that help machines to learn. Hence, it is closely related to various scientific domains as Optimization, Vision, Robotic and Control, Theoretical Computer Science, etc. Based on this, Machine Learning can be defined in various ways related to a scientific do...

Mitchell Т. Machine learning

  • формат pdf
  • размер 17.24 МБ
  • добавлен 05 марта 2011 г.
This book covers the field of machine learning, which is the study of algorithms that allow computer programs to automatically improve through experience. The book is intended to support upper level undergraduate and introductory level graduate courses in machine learning. 1997, р. 414. The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with experience. In recent years m...

Wang C., Hill D.J. Deterministic Learning Theory for Identification, Recognition and Control

  • формат pdf
  • размер 10.94 МБ
  • добавлен 29 ноября 2011 г.
Издательство CRC Press, 2010, -218 pp. The problem of learning in dynamic environments is important and challenging. In the 1960s, learning from control of dynamical systems was studied extensively. At that time, learning was similar in meaning to other terms such as adaptation and self-organizing. Since the 1970s, learning theory has become a research discipline in the context of machine learning, and more recently as computational or statistic...

Zhang Y. (ed.) Machine Learning

  • формат pdf
  • размер 14.6 МБ
  • добавлен 12 ноября 2011 г.
Издательство InTech, 2010, -446 pp. The goal of this book is to present the key algorithms, theory and applications that from the core of machine learning. Learning is a fundamental activity. It is the process of constructing a model from complex world. And it is also the prerequisite for the performance of any new activity and, later, for the improvement in this performance. Machine learning is concerned with constructing computer programs tha...

Zhang Y. (ed.) New Advances in Machine Learning

  • формат pdf
  • размер 16.95 МБ
  • добавлен 12 ноября 2011 г.
Издательство InTech, 2010, -374 pp. The purpose of this book is to provide an up-to-data and systematical introduction to the principles and algorithms of machine learning. The definition of learning is broad enough to include most tasks that we commonly call Learning tasks, as we use the word in daily life. It is also broad enough to encompass computer that improve from experience in quite straight forward ways. Machine learning addresses the...