莫烦Python RL 代码阅读一

莫烦Python RL 代码阅读一算法分析def build_q_table(n_states, actions)def choose_action(st

2023-12-08 技术

莫烦Python RL 代码阅读一

算法分析
- def build_q_table(n_states, actions)
- def choose_action(state, q_table)
- def rl()
Python函数功能补漏
- np.random.seed() 函数
- np.zeros(a,b)函数
- pd.DataFrame()函数
- - 功能函数
  - 根据字典创建
- q_table.iloc()函数
- np.random.uniform() 函数
- .all()函数
- .idmax()函数
- .format()函数

算法分析

不分析如何实现 environment

def build_q_table(n_states, actions)

def build_q_table(n_states, actions):table = pd.DataFrame(np.zeros((n_states, len(actions))),     # q_table initial valuescolumns=actions,    # actions's name)# print(table)    # show tablereturn table

pd.DataFrame()
np.zeros(a,b)
在下方版块有整理

此函数用于构建n行，len（actions）列的Q-table，算是典型的表格型方法

def choose_action(state, q_table)

def choose_action(state, q_table):# This is how to choose an actionstate_actions = q_table.iloc[state, :]if (np.random.uniform() > EPSILON) or

版权声明

本文来自互联网用户投稿，文章观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处。如若内容有涉嫌抄袭侵权/违法违规/事实不符，请点击举报进行投诉反馈！