Thursday, April 2, 2026
Science
No Result
View All Result
  • Login
  • HOME
  • SCIENCE NEWS
  • CONTACT US
  • HOME
  • SCIENCE NEWS
  • CONTACT US
No Result
View All Result
Scienmag
No Result
View All Result
Home Science News Policy

Discrete-time rewards efficiently guide the extraction of continuous-time optimal control policy from system data

June 28, 2024
in Policy
Reading Time: 3 mins read
0
Schematic framework of the reinforcement learning algorithm using policy iteration for continuous-time dynamical systems
66
SHARES
599
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT

This study is led by an international team of scientists including Dr. Ci Chen (School of Automation, Guangdong University of Technology, China), Dr. Lihua Xie (School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore), and Dr. Shengli Xie (Guangdong-HongKong-Macao Joint Laboratory for Smart Discrete Manufacturing, Guangdong Key Laboratory of IoT Information Technology, China), co-contributed by Dr. Yilu Liu (Department of Electrical Engineering and Computer Science, University of Tennessee, USA) and Dr. Frank L. Lewis (UTA Research Institute, The University of Texas at Arlington, USA).

Schematic framework of the reinforcement learning algorithm using policy iteration for continuous-time dynamical systems

Credit: ©Science China Press

This study is led by an international team of scientists including Dr. Ci Chen (School of Automation, Guangdong University of Technology, China), Dr. Lihua Xie (School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore), and Dr. Shengli Xie (Guangdong-HongKong-Macao Joint Laboratory for Smart Discrete Manufacturing, Guangdong Key Laboratory of IoT Information Technology, China), co-contributed by Dr. Yilu Liu (Department of Electrical Engineering and Computer Science, University of Tennessee, USA) and Dr. Frank L. Lewis (UTA Research Institute, The University of Texas at Arlington, USA).

The concept of Reward is central in reinforcement learning and is also widely used in the natural sciences, engineering, and social sciences. Organisms learn behavior by interacting with their environment and observing the resulting rewarding stimuli. The expression of rewards largely represents the perception of the system and defines the behavioral state of the dynamic system. In reinforcement learning, finding rewards that explain behavioral decisions of dynamic systems has been an open challenge.

The work aims to propose reinforcement learning algorithms using discrete-time rewards in both continuous time and action space, where the continuous space corresponds to the phenomena or behaviors of a system described by the laws of physics. The approach of feeding state derivatives back into the learning process has led to the development of an analytical framework for reinforcement learning based on discrete-time rewards, which is essentially different from existing integral reinforcement learning frameworks. “When the idea of feedbacking the derivative into the learning process struck, it felt like lightning! And guess what? It mathematically ties into the discrete-time reward-based policy learning!” Chen recalls his Eureka moment and says.

Under the guidance of discrete-time reward, the search process of behavioral decision law is divided into two stages: feed-forward signal learning and feedback gain learning. In their study, it was found that the optimal decision law for continuous-time dynamic systems can be searched from real-time data of dynamic systems using the discrete-time reward-based technique. The above method has been applied to power system state regulation to achieve optimal design of output feedback. This process eliminates the intermediate stage of identifying dynamic models and significantly improves the computational efficiency by removing the reward integrator operator from the existing integral reinforcement learning framework.

This research uses discrete-time reward guidance to discover optimization strategies for continuous-time dynamical systems, and constructs a computational tool for understanding and improving dynamical systems. This result can play an important role in natural science, engineering, and social science.

This work was supported by the National Natural Science Foundation of China and the Fundamental and Applied Basic Research Fund of Guangdong Province.

 

See the article:

Learning the Continuous-Time Optimal Decision Law from Discrete-Time Rewards



Journal

National Science Open

DOI

10.1360/nso/20230054

Share26Tweet17
Previous Post

Deep learning-assisted lesion segmentation in PET/CT imaging: A feasibility study for salvage radiation therapy in prostate cancer

Next Post

Prostate cancer test is missing early disease in transgender women

Related Posts

blank
Policy

University of Cincinnati Launches Innovative Center for Public Health Research

April 2, 2026
blank
Policy

Aging Populations and Rising Solo Households May Hinder Decarbonization Efforts and Increase Energy Poverty, Study Finds

April 2, 2026
blank
Policy

Why Governments Struggle to Invest in Risk Prevention

April 2, 2026
blank
Policy

New Study Identifies Household Cleaning Products as Major Contributor to Childhood Injuries

April 2, 2026
blank
Policy

Shisha Smoking Remains an Underrecognized Public Health Concern

April 1, 2026
blank
Policy

AIBS Announces Winners of Photo Contest in New Report

April 1, 2026
Next Post
Prostate cancer test is missing early disease in transgender women

Prostate cancer test is missing early disease in transgender women

  • Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

    27631 shares
    Share 11049 Tweet 6906
  • University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

    1033 shares
    Share 413 Tweet 258
  • Bee body mass, pathogens and local climate influence heat tolerance

    673 shares
    Share 269 Tweet 168
  • Researchers record first-ever images and data of a shark experiencing a boat strike

    537 shares
    Share 215 Tweet 134
  • Groundbreaking Clinical Trial Reveals Lubiprostone Enhances Kidney Function

    523 shares
    Share 209 Tweet 131
Science

Embark on a thrilling journey of discovery with Scienmag.com—your ultimate source for cutting-edge breakthroughs. Immerse yourself in a world where curiosity knows no limits and tomorrow’s possibilities become today’s reality!

RECENT NEWS

  • Revolutionary Magnetic Biochar Gel Tackles Arsenic and Antimony Pollution in Rice Cultivation
  • From Coffee Waste to Cutting-Edge Biodegradable Insulation: A Green Innovation
  • New Study Links Obstructive Sleep Apnea to Increased Risk of Mortality and Cardiovascular Events
  • Optimizing Biochar Temperature Unlocks Significant Nitrogen Savings in Food Waste Composting

Categories

  • Agriculture
  • Anthropology
  • Archaeology
  • Athmospheric
  • Biology
  • Biotechnology
  • Blog
  • Bussines
  • Cancer
  • Chemistry
  • Climate
  • Earth Science
  • Editorial Policy
  • Marine
  • Mathematics
  • Medicine
  • Pediatry
  • Policy
  • Psychology & Psychiatry
  • Science Education
  • Social Science
  • Space
  • Technology and Engineering

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 5,146 other subscribers

© 2025 Scienmag - Science Magazine

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • HOME
  • SCIENCE NEWS
  • CONTACT US

© 2025 Scienmag - Science Magazine

Discover more from Science

Subscribe now to keep reading and get access to the full archive.

Continue reading