Welcome, visitor! [ Login

 

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

  • Listed: 8 May 2026 0 h 46 min

Description

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

**”Reinforcement Learning for Robots Using Neural Networks”**

In the realm of artificial intelligence (AI) and robotics, the field of reinforcement learning (RL) has emerged as a groundbreaking technique for enabling robots to learn and improve their behavior through trial and error. One of the pioneers in this field is L.-J. Lin, who in his 1993 PhD thesis, “Reinforcement Learning for Robots Using Neural Networks,” proposed the use of neural networks to facilitate reinforcement learning for robots.

Reinforcement learning is a type of machine learning that involves training an agent to take actions in an environment to maximize a reward signal. This type of learning is particularly well-suited for robotic systems, which often require adapting to new environments and learning from their experiences. In the context of RL, a neural network acts as a function approximator, learning to predict the expected return of a particular action based on the current state of the environment.

Lin’s research demonstrated the feasibility of using neural networks to implement reinforcement learning for robots. He proposed the “TD-Gammon” algorithm, a type of temporal difference (TD) learning method that uses a neural network to estimate the value function of a state-action pair. The algorithm learns to approximate the value function by iteratively adjusting the weights of the neural network to minimize the temporal difference error.

The use of neural networks in reinforcement learning has several advantages. Firstly, neural networks can learn complex and non-linear relationships between the state and action spaces of a robotic system. Secondly, they can handle large and complex state spaces with ease, making them particularly well-suited for robotic systems. Finally, neural networks can learn to adapt to changing environments and unexpected events, which is essential for robots that operate in dynamic and uncertain settings.

Today, neural network-based reinforcement learning is used in a wide range of robotic applications, including robotic arms, autonomous vehicles, and humanoid robots. Researchers and developers are actively exploring the use of neural networks in RL to improve the performance, efficiency, and robustness of robotic systems.

While much progress has been made in this field, there is still much to be discovered. Future research directions include the development of more efficient and scalable algorithms, the exploration of new neural network architectures, and the integration of reinforcement learning with other AI techniques such as computer vision and natural language processing. As the field continues to evolve, we can expect to see more sophisticated and capable robots that can learn and adapt to complex and dynamic environments using the principles outlined by L.-J. Lin in his seminal PhD thesis.

**Keywords:** Reinforcement learning, neural networks, robots, artificial intelligence, machine learning, TD-Gammon algorithm, temporal difference learning.

This article aims to provide an in-depth look into the concept of reinforcement learning for robots using neural networks. By exploring the history and principles of this field, it is hoped that readers will gain a better understanding of the potential applications and future directions of this exciting area of research.

No Tags

24 total views, 3 today

  

Listing ID: N/A

Report problem

Processing your request, Please wait....

Sponsored Links

 

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban Regions, in: Papers presented at the 2005 International Symposium on GPS/GNSS, December 8-10, […]

No views yet

 

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise Environments with the Minimum Range ERror Algorithm (MRERA), in: Papers presented at […]

1 total views, 1 today

 

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die Stadt Hamburg, Diploma thesis, Department Informatics, University Hamburg, Germany. Okay, so the user wants a blog […]

1 total views, 1 today

 

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking and Navigation Using an Adaptive Knowledge System Based on Neural Networks, Journal of Applied Geodesy, […]

2 total views, 2 today

 

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Contactless Smart Cards and Identification, Carl Hanser Verlag, Munich, Germany. Okay, the user wants a blog […]

1 total views, 1 today

 

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improvement of MEMS-IMU/GPS Performance Using Fuzzy Modeling, GPS Solutions, No. 10/06, pp. 1-11. Okay, the […]

1 total views, 1 today

 

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Ambiguity Estimation, Proceedings of ENC-GNSS 2004, May 2004, Rotterdam, Netherlands “Vollath, U., K. Sauer (2004) […]

1 total views, 1 today

 

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, Proceedings of ION-GPS/GNSS 2003, Sept. 2003, pp. 1709-1717 Okay, I need to create a […]

1 total views, 1 today

 

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Positioning using Virtual Reference Stations, Proceedings of the International Symposium on Kinematic Systems […]

2 total views, 2 today

 

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Base RTK Positioning using Virtual Reference Stations, Proceedings of ION-GPS 2000, Sept. 2000, Salt […]

1 total views, 1 today

 

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban Regions, in: Papers presented at the 2005 International Symposium on GPS/GNSS, December 8-10, […]

No views yet

 

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise Environments with the Minimum Range ERror Algorithm (MRERA), in: Papers presented at […]

1 total views, 1 today

 

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die Stadt Hamburg, Diploma thesis, Department Informatics, University Hamburg, Germany. Okay, so the user wants a blog […]

1 total views, 1 today

 

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking and Navigation Using an Adaptive Knowledge System Based on Neural Networks, Journal of Applied Geodesy, […]

2 total views, 2 today

 

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Contactless Smart Cards and Identification, Carl Hanser Verlag, Munich, Germany. Okay, the user wants a blog […]

1 total views, 1 today

 

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improvement of MEMS-IMU/GPS Performance Using Fuzzy Modeling, GPS Solutions, No. 10/06, pp. 1-11. Okay, the […]

1 total views, 1 today

 

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Ambiguity Estimation, Proceedings of ENC-GNSS 2004, May 2004, Rotterdam, Netherlands “Vollath, U., K. Sauer (2004) […]

1 total views, 1 today

 

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, Proceedings of ION-GPS/GNSS 2003, Sept. 2003, pp. 1709-1717 Okay, I need to create a […]

1 total views, 1 today

 

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Positioning using Virtual Reference Stations, Proceedings of the International Symposium on Kinematic Systems […]

2 total views, 2 today

 

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Base RTK Positioning using Virtual Reference Stations, Proceedings of ION-GPS 2000, Sept. 2000, Salt […]

1 total views, 1 today