Bonjour, ceci est un commentaire. Pour supprimer un commentaire, connectez-vous et affichez les commentaires de cet article. Vous pourrez alors…
L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.
- Listed: 8 May 2026 0 h 46 min
Description
L.-J. Lin, “Reinforcement Learning for Robots Using Neural Networks,” PhD thesis, Carnegie Mellon Univer-sity, Pittsburgh, 1993.
**”Reinforcement Learning for Robots Using Neural Networks”**
In the realm of artificial intelligence (AI) and robotics, the field of reinforcement learning (RL) has emerged as a groundbreaking technique for enabling robots to learn and improve their behavior through trial and error. One of the pioneers in this field is L.-J. Lin, who in his 1993 PhD thesis, “Reinforcement Learning for Robots Using Neural Networks,” proposed the use of neural networks to facilitate reinforcement learning for robots.
Reinforcement learning is a type of machine learning that involves training an agent to take actions in an environment to maximize a reward signal. This type of learning is particularly well-suited for robotic systems, which often require adapting to new environments and learning from their experiences. In the context of RL, a neural network acts as a function approximator, learning to predict the expected return of a particular action based on the current state of the environment.
Lin’s research demonstrated the feasibility of using neural networks to implement reinforcement learning for robots. He proposed the “TD-Gammon” algorithm, a type of temporal difference (TD) learning method that uses a neural network to estimate the value function of a state-action pair. The algorithm learns to approximate the value function by iteratively adjusting the weights of the neural network to minimize the temporal difference error.
The use of neural networks in reinforcement learning has several advantages. Firstly, neural networks can learn complex and non-linear relationships between the state and action spaces of a robotic system. Secondly, they can handle large and complex state spaces with ease, making them particularly well-suited for robotic systems. Finally, neural networks can learn to adapt to changing environments and unexpected events, which is essential for robots that operate in dynamic and uncertain settings.
Today, neural network-based reinforcement learning is used in a wide range of robotic applications, including robotic arms, autonomous vehicles, and humanoid robots. Researchers and developers are actively exploring the use of neural networks in RL to improve the performance, efficiency, and robustness of robotic systems.
While much progress has been made in this field, there is still much to be discovered. Future research directions include the development of more efficient and scalable algorithms, the exploration of new neural network architectures, and the integration of reinforcement learning with other AI techniques such as computer vision and natural language processing. As the field continues to evolve, we can expect to see more sophisticated and capable robots that can learn and adapt to complex and dynamic environments using the principles outlined by L.-J. Lin in his seminal PhD thesis.
**Keywords:** Reinforcement learning, neural networks, robots, artificial intelligence, machine learning, TD-Gammon algorithm, temporal difference learning.
This article aims to provide an in-depth look into the concept of reinforcement learning for robots using neural networks. By exploring the history and principles of this field, it is hoped that readers will gain a better understanding of the potential applications and future directions of this exciting area of research.
27 total views, 3 today
Sponsored Links
Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...
Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodriguez, R. and Pratt, T. (2002) Status of Galileo Frequency and Signal Design. LeMaster, […]
No views yet
Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...
Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GPS Signal Reconstitution. Proceedings of ION GNSS, Long Beach, CA. **”Ganguly, S., Jovancevic, A., […]
1 total views, 1 today
Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...
Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Dissertation submitted to the department of Aeronautics and Astronautics, Stanford University. “Cobb H. S. (1997) […]
No views yet
Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...
Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Dinesh, M., Petrovski, I., Asako, M. (1997) Development of a BOC/CA Pseudo QZS […]
2 total views, 2 today
Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...
Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. (2006) Measuring Multipath Error of a Pseudo Quasi-Zenith Satellite, International Symposium on […]
2 total views, 2 today
Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...
Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning System Using High Altitude Platforms Systems (HAPS), Journal of the Japan Society for Aeronautical […]
2 total views, 2 today
Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...
Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experiments of inverted pseudolite positioning for airship-based GPS augmentation system, Proc. of 15th […]
2 total views, 2 today
Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...
Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Development of Highway ITS and Pedestrian ITS Based on RTK Network, Pseudolite and PN […]
2 total views, 2 today
Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...
Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N., Sawabe, M., Kawanichi, T., Yoshitomi, S., and Terada, K. (2007) QZSS System design […]
2 total views, 2 today
Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...
Wang J. (2002) Applications of pseudolites in geodetic positioning: Progress and problems. Journal of Global Positioning Systems, 1(1): 48-56. Okay, let’s tackle this blog post. […]
1 total views, 1 today
Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodr...
Hein, G. W., Godet, J., Issler J.-L., Martin, J.-C., Erhard, P., Lucas-Rodriguez, R. and Pratt, T. (2002) Status of Galileo Frequency and Signal Design. LeMaster, […]
No views yet
Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GP...
Ganguly, S., Jovancevic, A., Kirchner, M., Noronha, J., Zigic, S. (2004) GPS Signal Reconstitution. Proceedings of ION GNSS, Long Beach, CA. **”Ganguly, S., Jovancevic, A., […]
1 total views, 1 today
Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Disser...
Cobb H. S. (1997) GPS pseudolites: theory, design, and applications. Dissertation submitted to the department of Aeronautics and Astronautics, Stanford University. “Cobb H. S. (1997) […]
No views yet
Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Di...
Tsujii, T., Tomita, H., Okuno, Y., Kogure, S., Kishimoto, M., Okano, K., Dinesh, M., Petrovski, I., Asako, M. (1997) Development of a BOC/CA Pseudo QZS […]
2 total views, 2 today
Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. ...
Tsujii, T., Tomita, H., Okuno, Y., Okano, K., Asako, M., and Petrovski, I. (2006) Measuring Multipath Error of a Pseudo Quasi-Zenith Satellite, International Symposium on […]
2 total views, 2 today
Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning S...
Tsujii, T., Harigae, M., and Harada, M. (2004) Navigation and Positioning System Using High Altitude Platforms Systems (HAPS), Journal of the Japan Society for Aeronautical […]
2 total views, 2 today
Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experim...
Tsujii, T., Harigae, M., Barnes, J., Wang, J., and Rizos, C. (2002) Experiments of inverted pseudolite positioning for airship-based GPS augmentation system, Proc. of 15th […]
2 total views, 2 today
Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Devel...
Petrovski, I., , Kawaguchi, S., Torimoto, H., and Hasegawa, T. (2001) Development of Highway ITS and Pedestrian ITS Based on RTK Network, Pseudolite and PN […]
2 total views, 2 today
Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N.,...
Kishimoto, M., Hase, H., Matsumoto, A., Tsuruta, T., Kogure, S., Inaba, N., Sawabe, M., Kawanichi, T., Yoshitomi, S., and Terada, K. (2007) QZSS System design […]
2 total views, 2 today
Wang J. (2002) Applications of pseudolites in geodetic positioning: Progres...
Wang J. (2002) Applications of pseudolites in geodetic positioning: Progress and problems. Journal of Global Positioning Systems, 1(1): 48-56. Okay, let’s tackle this blog post. […]
1 total views, 1 today
Recent Comments