Sunbelt Computer Software

Deep Learning Playground

这个仓库目前包含两个本地强化学习迷宫实验项目：

ai-maze-lab/：轻量 Q-learning + NumPy + Pygame 版本，包含训练日志、曲线、热力图、钥匙和门规则，以及 macOS 双击启动包装。
ai-maze-ppo-lab/：局部小网格观察 + Gymnasium + Stable-Baselines3/sb3-contrib PPO 版本，用 Recurrent CNN PPO 训练更通用的迷宫策略网络；界面分为训练场和试验场，也包含 macOS 双击启动包装。

快速入口

Q-learning 版本：

cd ai-maze-lab
python main.py --map maps/level_1.txt

PPO 版本：

cd ai-maze-ppo-lab
python train_ppo.py --timesteps 500000 --n-envs 4 --random-maps 200 --ent-coef 0.03 --view-range 3 --view-width 3 --random-rows 11 --random-cols 15 --wall-density 0.12 --trap-density 0.04

PPO 图形界面：

cd ai-maze-ppo-lab
python ppo_app.py

具体安装和运行方式见各子项目内的 README.md。

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
ai-maze-lab		ai-maze-lab
ai-maze-ppo-lab		ai-maze-ppo-lab
.gitignore		.gitignore
README.md		README.md

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Playground

快速入口

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Playground

快速入口

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages