L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Listed: 8 May 2026 0 h 46 min

Description

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

**”Reinforcement Learning for Robots Using Neural Networks”**

In the realm of artificial intelligence (AI) and robotics, the field of reinforcement learning (RL) has emerged as a groundbreaking technique for enabling robots to learn and improve their behavior through trial and error. One of the pioneers in this field is L.-J. Lin, who in his 1993 PhD thesis, “Reinforcement Learning for Robots Using Neural Networks,” proposed the use of neural networks to facilitate reinforcement learning for robots.

Reinforcement learning is a type of machine learning that involves training an agent to take actions in an environment to maximize a reward signal. This type of learning is particularly well-suited for robotic systems, which often require adapting to new environments and learning from their experiences. In the context of RL, a neural network acts as a function approximator, learning to predict the expected return of a particular action based on the current state of the environment.

Lin’s research demonstrated the feasibility of using neural networks to implement reinforcement learning for robots. He proposed the “TD-Gammon” algorithm, a type of temporal difference (TD) learning method that uses a neural network to estimate the value function of a state-action pair. The algorithm learns to approximate the value function by iteratively adjusting the weights of the neural network to minimize the temporal difference error.

The use of neural networks in reinforcement learning has several advantages. Firstly, neural networks can learn complex and non-linear relationships between the state and action spaces of a robotic system. Secondly, they can handle large and complex state spaces with ease, making them particularly well-suited for robotic systems. Finally, neural networks can learn to adapt to changing environments and unexpected events, which is essential for robots that operate in dynamic and uncertain settings.

Today, neural network-based reinforcement learning is used in a wide range of robotic applications, including robotic arms, autonomous vehicles, and humanoid robots. Researchers and developers are actively exploring the use of neural networks in RL to improve the performance, efficiency, and robustness of robotic systems.

While much progress has been made in this field, there is still much to be discovered. Future research directions include the development of more efficient and scalable algorithms, the exploration of new neural network architectures, and the integration of reinforcement learning with other AI techniques such as computer vision and natural language processing. As the field continues to evolve, we can expect to see more sophisticated and capable robots that can learn and adapt to complex and dynamic environments using the principles outlined by L.-J. Lin in his seminal PhD thesis.

**Keywords:** Reinforcement learning, neural networks, robots, artificial intelligence, machine learning, TD-Gammon algorithm, temporal difference learning.

This article aims to provide an in-depth look into the concept of reinforcement learning for robots using neural networks. By exploring the history and principles of this field, it is hoped that readers will gain a better understanding of the potential applications and future directions of this exciting area of research.

No Tags

27 total views, 3 today

Listing ID: N/A

Report problem

Processing your request, Please wait....

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...

lyndalevesque86 1 minute ago

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodriguez, R. and Pratt, T. (2002) Status of Galileo Frequency and Signal Design. LeMaster, […]

No views yet

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...

lyndalevesque86 4 minutes ago

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GPS Signal Reconstitution. Proceedings of ION GNSS, Long Beach, CA. **”Ganguly, S., Jovancevic, A., […]

1 total views, 1 today

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...

lyndalevesque86 8 minutes ago

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Dissertation submitted to the department of Aeronautics and Astronautics, Stanford University. “Cobb H. S. (1997) […]

No views yet

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...

lyndalevesque86 11 minutes ago

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Dinesh, M., Petrovski, I., Asako, M. (1997) Development of a BOC/CA Pseudo QZS […]

2 total views, 2 today

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...

lyndalevesque86 14 minutes ago

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. (2006) Measuring Multipath Error of a Pseudo Quasi-Zenith Satellite, International Symposium on […]

2 total views, 2 today

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...

lyndalevesque86 17 minutes ago

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning System Using High Altitude Platforms Systems (HAPS), Journal of the Japan Society for Aeronautical […]

2 total views, 2 today

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...

lyndalevesque86 20 minutes ago

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experiments of inverted pseudolite positioning for airship-based GPS augmentation system, Proc. of 15th […]

2 total views, 2 today

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...

lyndalevesque86 23 minutes ago

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Development of Highway ITS and Pedestrian ITS Based on RTK Network, Pseudolite and PN […]

2 total views, 2 today

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...

lyndalevesque86 26 minutes ago

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N., Sawabe, M., Kawanichi, T., Yoshitomi, S., and Terada, K. (2007) QZSS System design […]

2 total views, 2 today

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...

lyndalevesque86 29 minutes ago

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progress and problems. Journal of Global Positioning Systems, 1(1): 48-56. Okay, let’s tackle this blog post. […]

1 total views, 1 today

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...

lyndalevesque86 1 minute ago

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodriguez, R. and Pratt, T. (2002) Status of Galileo Frequency and Signal Design. LeMaster, […]

No views yet

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...

lyndalevesque86 4 minutes ago

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GPS Signal Reconstitution. Proceedings of ION GNSS, Long Beach, CA. **”Ganguly, S., Jovancevic, A., […]

1 total views, 1 today

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...

lyndalevesque86 8 minutes ago

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Dissertation submitted to the department of Aeronautics and Astronautics, Stanford University. “Cobb H. S. (1997) […]

No views yet

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...

lyndalevesque86 11 minutes ago

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Dinesh, M., Petrovski, I., Asako, M. (1997) Development of a BOC/CA Pseudo QZS […]

2 total views, 2 today

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...

lyndalevesque86 14 minutes ago

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. (2006) Measuring Multipath Error of a Pseudo Quasi-Zenith Satellite, International Symposium on […]

2 total views, 2 today

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...

lyndalevesque86 17 minutes ago

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning System Using High Altitude Platforms Systems (HAPS), Journal of the Japan Society for Aeronautical […]

2 total views, 2 today

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...

lyndalevesque86 20 minutes ago

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experiments of inverted pseudolite positioning for airship-based GPS augmentation system, Proc. of 15th […]

2 total views, 2 today

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...

lyndalevesque86 23 minutes ago

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Development of Highway ITS and Pedestrian ITS Based on RTK Network, Pseudolite and PN […]

2 total views, 2 today

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...

lyndalevesque86 26 minutes ago

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N., Sawabe, M., Kawanichi, T., Yoshitomi, S., and Terada, K. (2007) QZSS System design […]

2 total views, 2 today

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...

lyndalevesque86 29 minutes ago

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progress and problems. Journal of Global Positioning Systems, 1(1): 48-56. Okay, let’s tackle this blog post. […]

1 total views, 1 today

View More Ads

lesoutrali international

Contact
Poster

Monsieur WordPress on Bonjour tout le monde !5 September 2014
Bonjour, ceci est un commentaire. Pour supprimer un commentaire, connectez-vous et affichez les commentaires de cet article. Vous pourrez alors…

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Description

L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.

Sponsored Links

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...

Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...

Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...

Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...

Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...

Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...

Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...

Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...

Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...

Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...

Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...

Recent Posts

Meta

Recent Comments

lesoutrali international

Other items listed by lyndalevesque86