在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称:JuliaML/Reinforce.jl开源软件地址:https://github.com/JuliaML/Reinforce.jl开源编程语言:Julia 100.0%开源软件介绍:DEPRECATEDThis package is discontinued. Please check ReinforcementLearning.jl, POMDPs.jl or AlphaZero.jl instead. ReinforceReinforce.jl is an interface for Reinforcement Learning. It is intended to connect modular environments, policies, and solvers with a simple interface. Packages which build on Reinforce:
Environment InterfaceNew environments are created by subtyping
and optional overrides:
which map to
An environment may be fully observable (MDP) or partially observable (POMDP).
In the case of a partially observable environment, the state
The terminating condition of an episode is control by
An minimal example for testing purpose is TODO: more details and examples Policy InterfaceAgents/policies are created by subtyping struct RandomPolicy <: AbstractPolicy end
action(π::RandomPolicy, r, s, A) = rand(A) Where
Episode IteratorIterate through episodes using the ep = Episode(env, π)
for (s, a, r, s′) in ep
# do some custom processing of the sars-tuple
end
R = ep.total_reward
T = ep.niter There is also a convenience method R = run_episode(env, π) do
# anything you want... this section is called after each step
end Tom BreloffAuthor: |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论