This repository has been archived by the owner on Aug 15, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
カリキュラム学習の導入とagilerl_trainにaction maskを追加
- Loading branch information
1 parent
b248a9c
commit d8b05a2
Showing
7 changed files
with
1,673 additions
and
120 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
--- | ||
# AgileRL Soccer Lesson 1 | ||
|
||
opponent: self | ||
opponent_pool_size: 1 # Size of opponent pool for self-play | ||
opponent_upgrade: 6000 # Epoch frequency to update opponent pool | ||
#eval_opponent: strong # 'random', 'weak' or 'strong' | ||
pretrained_path: models/DQN/lesson3_trained_agent.pt # Path to pretrained model weights | ||
save_path: models/DQN/lesson4_trained_agent.pt # Path to save trained model | ||
max_train_episodes: 1000 # Maximum number of training episodes in environment | ||
|
||
## Game specific: | ||
buffer_warm_up: false # Fill replay buffer with random experiences | ||
warm_up_opponent: # Difficulty level of warm up experiences | ||
agent_warm_up: 0 # Number of epochs to warm up agent by training on random experiences | ||
rewards: # Rewards for different outcomes | ||
score_goal: 1000 | ||
approach_ball: 0.2 | ||
dribble_or_kick: 10 | ||
off_field: -1.0 | ||
fall_down: -10 | ||
lose_point: -1000 | ||
play_continues: 0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
--- | ||
# AgileRL Soccer Lesson 1 | ||
|
||
opponent: self | ||
opponent_pool_size: 1 # Size of opponent pool for self-play | ||
opponent_upgrade: 6000 # Epoch frequency to update opponent pool | ||
#eval_opponent: strong # 'random', 'weak' or 'strong' | ||
pretrained_path: models/DQN/lesson3_trained_agent.pt # Path to pretrained model weights | ||
save_path: models/DQN/lesson4_trained_agent.pt # Path to save trained model | ||
max_train_episodes: 2000 # Maximum number of training episodes in environment | ||
|
||
## Game specific: | ||
buffer_warm_up: false # Fill replay buffer with random experiences | ||
warm_up_opponent: # Difficulty level of warm up experiences | ||
agent_warm_up: 0 # Number of epochs to warm up agent by training on random experiences | ||
rewards: # Rewards for different outcomes | ||
score_goal: 1000 | ||
approach_ball: 0.1 | ||
dribble_or_kick: 10 | ||
off_field: -1.0 | ||
fall_down: -10 | ||
lose_point: -1000 | ||
play_continues: 0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
--- | ||
# AgileRL Soccer Lesson 1 | ||
|
||
opponent: self | ||
opponent_pool_size: 1 # Size of opponent pool for self-play | ||
opponent_upgrade: 6000 # Epoch frequency to update opponent pool | ||
#eval_opponent: strong # 'random', 'weak' or 'strong' | ||
pretrained_path: models/DQN/lesson3_trained_agent.pt # Path to pretrained model weights | ||
save_path: models/DQN/lesson4_trained_agent.pt # Path to save trained model | ||
max_train_episodes: 7000 # Maximum number of training episodes in environment | ||
|
||
## Game specific: | ||
buffer_warm_up: false # Fill replay buffer with random experiences | ||
warm_up_opponent: # Difficulty level of warm up experiences | ||
agent_warm_up: 0 # Number of epochs to warm up agent by training on random experiences | ||
rewards: # Rewards for different outcomes | ||
score_goal: 1000 | ||
approach_ball: 0 | ||
dribble_or_kick: 10 | ||
off_field: -1.0 | ||
fall_down: -10 | ||
lose_point: -1000 | ||
play_continues: 0 |
Oops, something went wrong.