Shunyu 的博客 | Shunyu's Blog

[置顶] Collections

资料收藏

教程/文档 Python Python教程 - 廖雪峰的官方网站 Python 基础教程 - 菜鸟教程莫烦PYTHON Pytorch PYTORCH DOCUMENTATION PyTorch 中文文档 PyTorch 中文网《动手学深度学习》(PyTorch版) Tensorflow TensorFlow TensorFlow...

Posted by Shunyu on August 8, 2188

python 中 os 与 shutil 的使用

python 中 os 与 shutil 的使用。 os 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 import os os.system() # 运行 shell 命令 os.linesep # 当前操作系统的换行符 os.sep # 当前操作系统的路径分隔符 os.getcwd() #...

Posted by Shunyu on October 21, 2020

crontab 执行定时任务

crontab 执行定时任务配置

crontab 执行定时任务配置。基础命令 crontab 的使用命令 1 2 3 4 5 6 7 8 # 编辑 crontab 时程表 crontab -e # 列出 crontab 时程表 crontab -l # 删除 crontab 时程表 crontab -r crontab 服务的重启关闭开启 Mac： 1 2 3 sudo /usr/sbin/cron st...

Posted by Shunyu on August 19, 2020

OpenAI Gym 库使用小技巧

OpenAI Gym 库使用小技巧。 gym 库渲染显示 1、使用虚拟帧缓冲区打开 notebook 1 xvfb-run -s "-screen 0 1400x900x24" jupyter notebook 2、实现在 notebook 中显示 gym 库的渲染显示 1 2 3 4 5 6 7 8 9 10 11 12 13 import matplotlib.pyplot ...

Posted by Shunyu on August 8, 2020

ssh 密钥登陆服务器

ssh 设置密钥及配置 config

首先进行 ssh 设置密钥，然后在本地配置 ssh config 文件实现快捷登陆。 ssh 设置密钥 1. 在本地电脑上制作密钥对首先在本地电脑上制作密钥对，执行以下命令： 1 2 3 4 5 6 7 8 9 10 [root@local ~]$ ssh-keygen <== 建立密钥对 Generating public/private rsa key pair. En...

Posted by Shunyu on August 8, 2020

强化学习论文（17）MAAC

Actor-Attention-Critic for Multi-Agent Reinforcement Learning

标签： MAAC; actor-critic; off-policy; model-free; communication; continuous communication channel; discrete action space; continuous state space; mixed task; cooperative task; competitive task; cent...

Posted by Shunyu on July 7, 2020

强化学习论文（16）Qatten

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

标签： Qatten; value-based; off-policy; model-free; discrete action space; continuous state space; cooperative task; credit assignment; centralized training with decentralized execution; value functi...

Posted by Shunyu on July 7, 2020

强化学习论文（15）Lenient-DQN

Lenient Multi-Agent Deep Reinforcement Learning

标签： Lenient-DQN; value-based; off-policy; model-free; discrete action space; continuous state space; cooperative task; decentralized approach; multi-agent; 论文链接创新点及贡献 1、论文提出了 Lenient-DQN 方法，将 ...

Posted by Shunyu on July 1, 2020

强化学习论文（14）Dec-HDRQNs&CERTs

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

标签： Dec-HDRQNs; CERTs; value-based; off-policy; model-free; discrete action space; continuous state space; cooperative task; decentralized approach; multi-agent; multi-task; transfer learning; dis...

Posted by Shunyu on June 29, 2020

强化学习论文（13）Fingerprints

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

标签： Fingerprints; value-based; off-policy; model-free; discrete action space; continuous state space; cooperative task; decentralized approach; multi-agent; 论文链接创新点及贡献 1、针对 IQL 中的非平稳性导致的经验回放技术...

Posted by Shunyu on June 28, 2020

Shunyu's Blog

[置顶] Collections

资料收藏

python 中 os 与 shutil 的使用

python 中 os 与 shutil 的使用

crontab 执行定时任务

crontab 执行定时任务配置

OpenAI Gym 库使用小技巧

OpenAI Gym 库使用小技巧

ssh 密钥登陆服务器

ssh 设置密钥及配置 config

强化学习论文（17）MAAC

Actor-Attention-Critic for Multi-Agent Reinforcement Learning

强化学习论文（16）Qatten

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

强化学习论文（15）Lenient-DQN

Lenient Multi-Agent Deep Reinforcement Learning

强化学习论文（14）Dec-HDRQNs&CERTs

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

强化学习论文（13）Fingerprints

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

FEATURED TAGS

ABOUT ME

FRIENDS