AlphaStar: Mastering the Real-Time Strategy Game StarCraft II 博客阅读

admin•2025-09-17 07:21:21•网站建设•阅读18

原文：https:deepmindblogalphastar-mastering-real-time-strategy-game-starcraft-ii SLsupervised learning, RL

原文：https://deepmind/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii

SL = supervised learning, RL = reinforcement learning

how AlphaStar is trained

units, properties -> DNN -> instructions

DNN: transform torso(relational deep RL), deep LSTM core, auto-regressive policy head with pointer network, centralised value baseline

train: SL -> mico/macro strategies

compete -> hyper parameters updated by RL -> Nash distribution -> final agent

multi-agent RL: play against each other: population-based, multi-agent RL -> huge strategic space -> defeat strongest and eariler ones

explore new build orders, unit compositions, micro-management plans

personal objective: beat specific competitor/beat distribution of competitors/building more of specific unit

NN weights: off-policy actor-critic RL with experience replay, self-imitation learning, policy distillation

run on TPUs, final agent: Nash distribution of the league: best mixture of strategies

how AlphaStar plays and how to evaluate

TLO/MaNa ~ 100 APM

agent ~ 1000, 10000 APM

AlphaStar vs. TLO/MaNa ~280 APM (read screen frames use raw interface)

AlphaStar act: observation -> action: 350ms/avg, process every frame

results: 5:0

转载于:https://wwwblogs/yaoyaohust/p/10815039.html

发布者：admin，转转请注明出处：http://www.yc00.com/web/1754916712a5215012.html

博客 Real Time AlphaStar Mastering

admin

建站资讯
Real
admin
2023-7-12
620
建站资讯
collections, time, os,sys模块
admin
2024-3-29
2170
网站建设
You Only Look Once: Unified, Real
admin
10月前
170
网站建设
uthentic, genuine, real, veritable区别
admin
10月前
170
网站建设
如何制作linux再生龙u盘制作,再生龙制作U盘启动盘教程 | 楚盟博客
再生龙是备份linux系统的利器，本人也是小白试过多种备份方法可能技艺不精备份没问题但是还原总是遇到各种各样的问题，不过使用再生龙非常得心应手，备份多次linux系统也顺利的
admin
9月前
190
网站建设
Python基础（九） time random collections itertools标准库详解(1)
‘h’ **（2）choices(seq,weightsNone, k)——对序列类型进行k次重复采样，可设置权重**choices([‘win’, ‘lose’, ‘dra
admin
9月前
280
网站建设
2022年总结：感谢十二年的陪伴——分享回归，不忘初心（Eastmount博客总结及未来规划）
曾记否，2021年4月28日，为了更好地从事科研和学习，当时给所有读者群发了我在CSDN唯一的私信，感谢大家十年的陪伴，短暂消失&
admin
8月前
170
网站建设
使用WinPE安装Windows 7——思齐博客
使用WinPE安装Windows 7 发表于 2013-02-13 由思齐刚才介绍了如何硬盘安装Windows 7和Vista，现在思齐再来补充一种安装方法，使用WinPE来安装Windows
admin
6月前
380
网站建设
YuanDaiMa2048博客资源空间
欢迎来到 YuanDaiMa2048 的学习空间！ 在这里，会不定期更新自己学习中遇到的问题、解决方案以及学习笔记，涵盖了从基础知识到前沿技术的各类主题。希望这些内容能为您提
admin
6月前
420
网站建设
更改时间后Windows Time不被禁用（保持windows Time处于自动状态）
在运行 Windows 7、Windows Server 2008 R2 或更高版本的工作组计算机上，Windows 时间服务在系统启动后立即停止。即使启动类型从 “手动 ”更改为“ 自动”，也
admin
1月前
330
网站建设
ARMBIAN安装博客
armbain安装博客在安装Armbian操作系统时，在服务器或嵌入式设备上搭建博客通常涉及到使用轻量级的Web服务器软件，如Nginx或Apache，以及内容管理系统&am
admin
1月前
240
网站建设
前端学习博客和文章存档
前端 100 问 https:blog.csdngithub_34708151articledetails95165589 深入浅出经典面试题：从浏览器中输入URL到页面加载发生了什么 - Part 3
admin
1月前
190
网站建设
让你不得不学Python 的十大重要理由_让你不得不学python 的十大重要理由_dm_it的博客-csdn博客(1)
**速度快：**Python 的底层是用 C 语言写的，很多标准库和第三方库也都是用 C 写的，运行速度非常快。 **免费、开源：**Python是F
admin
1月前
200
网站建设
Markdown即时发布神器： Docsify如何简化技术博客搭建与远程协作流程
文章目录前言1. 本地部署Docsify2. 使用Docsify搭建个人博客封面配置文件（_coverpage.md）3. 安装Cpolar内网穿透工具4. 配置公网地址5. 配置固定公网地址前言
admin
1月前
230
网站建设
AlphaStar: Mastering the Real-Time Strategy Game StarCraft II 博客阅读
原文：https:deepmindblogalphastar-mastering-real-time-strategy-game-starcraft-ii SLsupervised learning, RL
admin
1月前
190
网站建设
Android学习博客和文章存档
［Android基础］Android总结篇http:blog.csdncodeemperorarticledetails51004189Android最佳性能实践(一)——合理管理
admin
1月前
240
网站建设
萌侠博客、萌侠直播介绍
萌侠博客是一款及时通讯类软件。支持将 QQ群，钉钉群，微信群的消息对接到萌侠博客上面。支持团队管理（可以开会员账号，授权会员账号的直播间观看时间&
admin
1月前
180
网站建设
关于Trying to backward through the graph a second time, but the saved intermediate 的问题
目录问题描述出现此问题的原因解决的办法方法一、方法二、（这篇文章的重点） 问题描述 Trying to backward through the graph a second t
admin
1月前
220
网站建设
loss回传报错RuntimeError: Trying to backward through the graph a second time解决方法
1.背景：在做机器学习的时候，我很多时候都将输入的数据进行进一步的修改，在一次的修改后，程序报错RuntimeError: Trying to back
admin
1月前
190
网站建设
RuntimeError: Trying to backward through the graph a second time (or directly access saved tensor...
原因：在跑深度学习中出现：RuntimeError: Trying to backward through the graph a second time (or directly access
admin
1月前
170