首页
技术日记
编程
旅游
数码
登录
标签
Reinforcement
分层强化学习综述:Hierarchical reinforcement learning: A comprehensive survey
论文名称:Hierarchical reinforcement learning: A comprehensive survey 论文发表期刊:ACM Computing Surveys 期刊影响因子:10.282(2022年) 论文作者:
Reinforcement
Hierarchical
learning
Survey
Comprehensive
admin
3月前
49
0
IntelliLight: a Reinforcement Learning Approach for Intelligent Traffic Light Control 论文阅读
IntelliLight 全文脉络概述1、本文贡献1)Experiments with real traffic data.2)Interpretations of the policy.3&am
论文
learning
Approach
IntelliLight
Reinforcement
admin
4月前
58
0
A Minimalist Approach to Offline Reinforcement Learning[TD3+BC]阅读笔记
A Minimalist Approach to Offline Reinforcement Learning[TD3BC]阅读笔记 文章目录A Minimalist Approach to Offline Reinforcement Le
笔记
Offline
Approach
Minimalist
Reinforcement
admin
4月前
41
0
[NIPS2017] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning 笔记
文章目录前言Background and Related WorkNeural Fictitious Self-PlayPolicy-Space Response OraclesMeta-Strategy SolversDeep Cogni
笔记
GAME
Theoretic
Unified
Reinforcement
admin
4月前
43
0
《A Distributional Perspective on Reinforcement Learning》的理解
近日看一本关于Reinforcement Learning的入门书《Deep Reinforcement Learning Hands On》,甚有收获。该书由PacktPublishing在2018年出版,它不仅介绍了RL的基础理论,而且
perspective
Distributional
learning
Reinforcement
admin
6月前
127
0
A Distributional Perspective on Reinforcement Learning
本文论证了值分布的基本重要性:强化学习智能体收到的随机回报的分布。这与强化学习的常见方法相反,后者对这种回报或价值的期望进行建模。尽管已经建立了研究价值分布的文献体系,但迄今为止ÿ
perspective
Distributional
learning
Reinforcement
admin
6月前
107
0
Human-level control through deep reinforcement learning
Abstract 强化学习理论在动物行为上,深入到心理和神经科学的角度,关于在一个环境中如何使得智能体优化他们的控制,提供了一个正式的规范。为了利用强化学习成功的接近现实世界
control
Level
human
learning
Reinforcement
admin
7月前
113
0
深度强化学习综述论文 A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning 深度强化学习的简要概述 作者: Kai Arulkumaran, Marc Peter Deisenroth, Miles
深度
论文
Survey
learning
Reinforcement
admin
7月前
120
0
Reinforcement Learning with Human in the Loop & Human Feedback
人在环路的强化学习(Reinforcement Learning with Human in the Loop, HIL) 和 人类反馈的强化学习(Reinforcement
human
learning
Reinforcement
feedback
amp
admin
7月前
88
0
大模型微调实战之 Transformer 强化学习(TRL Reinforcement Learning)(三)Proximal Policy Optimization
大模型微调实战之 Transformer 强化学习(TRL Reinforcement Learning)(三)Proximal Policy Optimization Proximal Policy Optimization 这是一个
实战
模型
TRL
Transformer
Reinforcement
admin
7月前
75
0
Deep Reinforcement Learning + Potential Game + Vehicular Edge Computing
文献 [1] 采用deep reinforcement learning和potential game研究vehicular edge computing场景下的任务卸载和资源优化分配策略 文献[2] 采用potential game设计
learning
potential
deep
Reinforcement
Edge
admin
2025-1-31
87
0
【论文翻译】A Comprehensive Survey on Safe Reinforcement Learning
本篇译文为方便自己再次阅读而记录,源自Google翻译和CNKI翻译助手。习惯用语保持英文(例:agent),一些细微之处结合自己
论文
Comprehensive
Survey
learning
Reinforcement
admin
2025-1-31
101
0
Reinforcement
Reinforcement
admin
2023-7-1
108
0