Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.), creating custom Gym environments, implementing callbacks for monitoring and control,
sources/ricable/claude-scientific-skills/scientific-skills/stable-baselines3/SKILL.md(main)