L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Listed: 8 May 2026 0 h 46 min

Description

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

**”Reinforcement Learning for Robots Using Neural Networks”**

In the realm of artificial intelligence (AI) and robotics, the field of reinforcement learning (RL) has emerged as a groundbreaking technique for enabling robots to learn and improve their behavior through trial and error. One of the pioneers in this field is L.-J. Lin, who in his 1993 PhD thesis, “Reinforcement Learning for Robots Using Neural Networks,” proposed the use of neural networks to facilitate reinforcement learning for robots.

Reinforcement learning is a type of machine learning that involves training an agent to take actions in an environment to maximize a reward signal. This type of learning is particularly well-suited for robotic systems, which often require adapting to new environments and learning from their experiences. In the context of RL, a neural network acts as a function approximator, learning to predict the expected return of a particular action based on the current state of the environment.

Lin’s research demonstrated the feasibility of using neural networks to implement reinforcement learning for robots. He proposed the “TD-Gammon” algorithm, a type of temporal difference (TD) learning method that uses a neural network to estimate the value function of a state-action pair. The algorithm learns to approximate the value function by iteratively adjusting the weights of the neural network to minimize the temporal difference error.

The use of neural networks in reinforcement learning has several advantages. Firstly, neural networks can learn complex and non-linear relationships between the state and action spaces of a robotic system. Secondly, they can handle large and complex state spaces with ease, making them particularly well-suited for robotic systems. Finally, neural networks can learn to adapt to changing environments and unexpected events, which is essential for robots that operate in dynamic and uncertain settings.

Today, neural network-based reinforcement learning is used in a wide range of robotic applications, including robotic arms, autonomous vehicles, and humanoid robots. Researchers and developers are actively exploring the use of neural networks in RL to improve the performance, efficiency, and robustness of robotic systems.

While much progress has been made in this field, there is still much to be discovered. Future research directions include the development of more efficient and scalable algorithms, the exploration of new neural network architectures, and the integration of reinforcement learning with other AI techniques such as computer vision and natural language processing. As the field continues to evolve, we can expect to see more sophisticated and capable robots that can learn and adapt to complex and dynamic environments using the principles outlined by L.-J. Lin in his seminal PhD thesis.

**Keywords:** Reinforcement learning, neural networks, robots, artificial intelligence, machine learning, TD-Gammon algorithm, temporal difference learning.

This article aims to provide an in-depth look into the concept of reinforcement learning for robots using neural networks. By exploring the history and principles of this field, it is hoped that readers will gain a better understanding of the potential applications and future directions of this exciting area of research.

No Tags

24 total views, 3 today

Listing ID: N/A

Report problem

Processing your request, Please wait....

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

lyndalevesque86 3 minutes ago

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban Regions, in: Papers presented at the 2005 International Symposium on GPS/GNSS, December 8-10, […]

No views yet

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

lyndalevesque86 6 minutes ago

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise Environments with the Minimum Range ERror Algorithm (MRERA), in: Papers presented at […]

1 total views, 1 today

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

lyndalevesque86 9 minutes ago

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die Stadt Hamburg, Diploma thesis, Department Informatics, University Hamburg, Germany. Okay, so the user wants a blog […]

1 total views, 1 today

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

lyndalevesque86 12 minutes ago

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking and Navigation Using an Adaptive Knowledge System Based on Neural Networks, Journal of Applied Geodesy, […]

2 total views, 2 today

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

lyndalevesque86 15 minutes ago

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Contactless Smart Cards and Identification, Carl Hanser Verlag, Munich, Germany. Okay, the user wants a blog […]

1 total views, 1 today

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

lyndalevesque86 18 minutes ago

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improvement of MEMS-IMU/GPS Performance Using Fuzzy Modeling, GPS Solutions, No. 10/06, pp. 1-11. Okay, the […]

1 total views, 1 today

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

lyndalevesque86 22 minutes ago

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Ambiguity Estimation, Proceedings of ENC-GNSS 2004, May 2004, Rotterdam, Netherlands “Vollath, U., K. Sauer (2004) […]

1 total views, 1 today

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

lyndalevesque86 25 minutes ago

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, Proceedings of ION-GPS/GNSS 2003, Sept. 2003, pp. 1709-1717 Okay, I need to create a […]

1 total views, 1 today

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

lyndalevesque86 28 minutes ago

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Positioning using Virtual Reference Stations, Proceedings of the International Symposium on Kinematic Systems […]

2 total views, 2 today

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

lyndalevesque86 31 minutes ago

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Base RTK Positioning using Virtual Reference Stations, Proceedings of ION-GPS 2000, Sept. 2000, Salt […]

1 total views, 1 today

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

lyndalevesque86 3 minutes ago

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban Regions, in: Papers presented at the 2005 International Symposium on GPS/GNSS, December 8-10, […]

No views yet

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

lyndalevesque86 6 minutes ago

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise Environments with the Minimum Range ERror Algorithm (MRERA), in: Papers presented at […]

1 total views, 1 today

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

lyndalevesque86 9 minutes ago

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die Stadt Hamburg, Diploma thesis, Department Informatics, University Hamburg, Germany. Okay, so the user wants a blog […]

1 total views, 1 today

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

lyndalevesque86 12 minutes ago

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking and Navigation Using an Adaptive Knowledge System Based on Neural Networks, Journal of Applied Geodesy, […]

2 total views, 2 today

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

lyndalevesque86 15 minutes ago

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Contactless Smart Cards and Identification, Carl Hanser Verlag, Munich, Germany. Okay, the user wants a blog […]

1 total views, 1 today

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

lyndalevesque86 19 minutes ago

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improvement of MEMS-IMU/GPS Performance Using Fuzzy Modeling, GPS Solutions, No. 10/06, pp. 1-11. Okay, the […]

1 total views, 1 today

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

lyndalevesque86 22 minutes ago

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Ambiguity Estimation, Proceedings of ENC-GNSS 2004, May 2004, Rotterdam, Netherlands “Vollath, U., K. Sauer (2004) […]

1 total views, 1 today

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

lyndalevesque86 25 minutes ago

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, Proceedings of ION-GPS/GNSS 2003, Sept. 2003, pp. 1709-1717 Okay, I need to create a […]

1 total views, 1 today

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

lyndalevesque86 28 minutes ago

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Positioning using Virtual Reference Stations, Proceedings of the International Symposium on Kinematic Systems […]

2 total views, 2 today

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

lyndalevesque86 31 minutes ago

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Base RTK Positioning using Virtual Reference Stations, Proceedings of ION-GPS 2000, Sept. 2000, Salt […]

1 total views, 1 today

View More Ads

lesoutrali international

Contact
Poster

Monsieur WordPress on Bonjour tout le monde !5 September 2014
Bonjour, ceci est un commentaire. Pour supprimer un commentaire, connectez-vous et affichez les commentaires de cet article. Vous pourrez alors…

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Description

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Sponsored Links

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

Mok E. and Xia L. (2005) Strategies for Geolocation Optimization in Urban R...

Mok E. and Lau L. (2001) GPS Vehicle Location Tracking in Dense High-Rise E...

Hohenschuh F. (2004) Prototyping eines mobilen Navigationssystems für die S...

Grejner-Brzezinska D., Toth C. and Moafipoor S. (2007) Pedestrian Tracking ...

Finkenzeller K. (2002) RFID Handbook: Fundamentals and Application in Conta...

Abdel-Hamid W., Abdelazim T., El-Sheimy N. and Lachapelle G. (2006) Improve...

Vollath, U., K. Sauer (2004) FAMCAR Approach for Efficient Multi-Carrier Am...

Vollath, U., Brockmann, E., Chen, X. (2003) Troposphere: Signal or Noise?, ...

Vollath, U., Deking, A., Landau, H., Pagels, C. (2001) Long Range RTK Posit...

Vollath, U., Deking, A., Landau, H., Pagels, C., Wagner, B. (2000) Multi-Ba...

Recent Posts

Meta

Recent Comments

lesoutrali international

Other items listed by lyndalevesque86