- 1、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
Training Reinforcement Neurocontrollers Using the Polytope Algorithm.pdf
TRAINING REINFORCEMENT NEUROCONTROLLERS USING THE POLYTOPE ALGORITHM Aristidis Likas and Isaac Lagaris Department of Computer Science 8 University of Ioannina 9 9 P.O. Box. 1186 - GR 45110 Ioannina, Greece 1 c e D Correspondence: A. Likas 3 Department of Computer Science ] E University of Ioannina N. P.O. Box. 1186 - GR 45110 Ioannina, Greece s c tel: +30-651-97310 [ fax: +30-651-48131 1 v e-mail: arly@cs.uoi.gr 2 0 0 2 1 Abstract 8 9 A new training algorithm is presented for delayed reinforcement learn- / s ing problems that does not assume the existence of a critic model and c : employs the polytope optimization algorithm to adjust the weights of the v i action network so that a simple direct measure of the training performance X is maximized. Experimental results from the application of the method to r the pole balancing problem indicate improved training performance com- a pared with critic-based and genetic reinforcement approaches. Keywords: reinforcement learning, neurocontrol, optimization, polytope al- gorithm, pole balancing, genetic reinforcement. 1 TRAINING REINFORCEMENT NEUROCONTROLLERS USING THE POLYTOPE ALGORITHM Abstract A new training algorithm is presented for delayed reinforcement learn- ing problems that does not assu
您可能关注的文档
- Three Novel Lanthanide MOFs Constructed from 1,3-Benzenedicarboxylic Acid and 1, 10-Phenanthroline.pdf
- Ti6Al4V的有限元模拟.pdf
- TiB_2_TiC复相陶瓷的结构与性能研究.pdf
- TiC_TiB_2复相陶瓷的自蔓延高温合成研究.pdf
- Tight security proofs for the bounded-storage model.pdf
- TIM Flow EN.ppt
- TIME DEIXIS.ppt
- Time Series and Related Topics. In Memory of Ching-Zong Wei.pdf
- Time-Oriented Skeletal Plans Support to Design and Execution.pdf
- Time-travel,fiction and drama.ppt
- Transcription factors that control inner ear development and their potential for.pdf
- Translation of Argumentative Essays.ppt
- Trial balance detail2009-1-1 to 2009-12-31.xls
- Trial Bank Publishing Phase I Results.pdf
- TRNSYS介绍ppt.pdf
- TRS957_2010 含毒质药品WHO生产管理规范.doc
- TR_BT04_C1_1 随机接入专题.pdf
- TS16949_2002 检查清单.xls
- TS2012EIJT,TS2012EIJT,TS2012EIJT, 规格书,Datasheet 资料.pdf
- TS4962IQT,TS4962IQT,TS4962IQT, 规格书,Datasheet 资料.pdf
最近下载
- 《低钠血症的中国专家共识(2023)》解读PPT课件.pptx VIP
- 初中语文通用版 现代文阅读答题技巧(公式化模板 + 完整版提分攻略).docx VIP
- 期刊合作办刊协议书.docx VIP
- 驭胜s350维修手册及电路图n351整车电路图全.pdf VIP
- 混凝土热工计算软件.xls VIP
- 小学信息技术教学计划.docx VIP
- 八 观察物体(二)(单元教学设计)苏教版 三年级上册数学2025版.pdf
- 七上语文常考必背重点知识梳理总结(答案版)【2024新版】.pdf VIP
- 最全面总工会招聘考试工会知识模拟试卷及答案(共五套).docx
- 2020年总工会招聘考试工会知识模拟试卷及答案(一).docx VIP
有哪些信誉好的足球投注网站
文档评论(0)