Linux大棚 – 不忘初心的技术博客,浮躁时代的安静角落
  •  首页
  •  技术日记
  •  编程
  •  旅游
  •  数码
  •  登录
  1. 标签
  2. Reinforcement
  • 分层强化学习综述:Hierarchical reinforcement learning: A comprehensive survey

    论文名称:Hierarchical reinforcement learning: A comprehensive survey 论文发表期刊:ACM Computing Surveys 期刊影响因子:10.282(2022年) 论文作者:
    Reinforcement Hierarchical learning Survey Comprehensive
    admin 3月前
    49 0
  • IntelliLight: a Reinforcement Learning Approach for Intelligent Traffic Light Control 论文阅读

    IntelliLight 全文脉络概述1、本文贡献1)Experiments with real traffic data.2)Interpretations of the policy.3&am
    论文 learning Approach IntelliLight Reinforcement
    admin 4月前
    58 0
  • A Minimalist Approach to Offline Reinforcement Learning[TD3+BC]阅读笔记

    A Minimalist Approach to Offline Reinforcement Learning[TD3BC]阅读笔记 文章目录A Minimalist Approach to Offline Reinforcement Le
    笔记 Offline Approach Minimalist Reinforcement
    admin 4月前
    41 0
  • [NIPS2017] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning 笔记

    文章目录前言Background and Related WorkNeural Fictitious Self-PlayPolicy-Space Response OraclesMeta-Strategy SolversDeep Cogni
    笔记 GAME Theoretic Unified Reinforcement
    admin 4月前
    43 0
  • 《A Distributional Perspective on Reinforcement Learning》的理解

    近日看一本关于Reinforcement Learning的入门书《Deep Reinforcement Learning Hands On》,甚有收获。该书由PacktPublishing在2018年出版,它不仅介绍了RL的基础理论,而且
    perspective Distributional learning Reinforcement
    admin 6月前
    127 0
  • A Distributional Perspective on Reinforcement Learning

    本文论证了值分布的基本重要性:强化学习智能体收到的随机回报的分布。这与强化学习的常见方法相反,后者对这种回报或价值的期望进行建模。尽管已经建立了研究价值分布的文献体系,但迄今为止&#xff
    perspective Distributional learning Reinforcement
    admin 6月前
    107 0
  • Human-level control through deep reinforcement learning

    Abstract 强化学习理论在动物行为上,深入到心理和神经科学的角度,关于在一个环境中如何使得智能体优化他们的控制,提供了一个正式的规范。为了利用强化学习成功的接近现实世界
    control Level human learning Reinforcement
    admin 7月前
    113 0
  • 深度强化学习综述论文 A Brief Survey of Deep Reinforcement Learning

    A Brief Survey of Deep Reinforcement Learning 深度强化学习的简要概述 作者: Kai Arulkumaran, Marc Peter Deisenroth, Miles
    深度 论文 Survey learning Reinforcement
    admin 7月前
    120 0
  • Reinforcement Learning with Human in the Loop & Human Feedback

    人在环路的强化学习(Reinforcement Learning with Human in the Loop, HIL) 和 人类反馈的强化学习(Reinforcement
    human learning Reinforcement feedback amp
    admin 7月前
    88 0
  • 大模型微调实战之 Transformer 强化学习(TRL Reinforcement Learning)(三)Proximal Policy Optimization

    大模型微调实战之 Transformer 强化学习(TRL Reinforcement Learning)(三)Proximal Policy Optimization Proximal Policy Optimization 这是一个
    实战 模型 TRL Transformer Reinforcement
    admin 7月前
    75 0
  • Deep Reinforcement Learning + Potential Game + Vehicular Edge Computing

    文献 [1] 采用deep reinforcement learning和potential game研究vehicular edge computing场景下的任务卸载和资源优化分配策略 文献[2] 采用potential game设计
    learning potential deep Reinforcement Edge
    admin 2025-1-31
    87 0
  • 【论文翻译】A Comprehensive Survey on Safe Reinforcement Learning

    本篇译文为方便自己再次阅读而记录,源自Google翻译和CNKI翻译助手。习惯用语保持英文(例:agent),一些细微之处结合自己
    论文 Comprehensive Survey learning Reinforcement
    admin 2025-1-31
    101 0
  • Reinforcement

    Reinforcement
    admin 2023-7-1
    108 0
CopyRight © 2022 All Rights Reserved 豫ICP备2021025688号-21
Processed: 0.016 , SQL: 9