Sang Hun Kim

AI/Optimization/Scheduling/Etc

Study Log (2021.07)

less than 1 minute read

2021-07-01

S-K RL
- train_FT10_ppo_node_only.py
  - do_simulate_on_aggregated_state()
  - value_loss, action_loss, dist_entropy = agent.fit(eval=0, reward_setting=’utilization’, device=device, return_scaled=False)
  - eval_performance = evaluate_agent_on_aggregated_state(simulator=sim, agent=agent, device=’cpu’, mode=’node_mode’)
  - val_performance = validation(agent, path, mode=’node_mode’)
- pyjssp 버전 구분
  - GNN-MARL Lastest용
  - GNN-MARL Stable용

Template

Twitter Facebook LinkedIn

Comments

You May Also Enjoy

Study Log (2022.09)

less than 1 minute read

Study Log (2022.09)

1 minute read

2022-09-20 모델 성능 개선으로 익히는 강화학습 A-Z Part06. 모델 기반 강화학습 Ch 03. 최적제어와 모델기반 강화학습 07. pytorch 모델 MPC 구...

Study Log (2022.08)

1 minute read

2022-08-31 모델 성능 개선으로 익히는 강화학습 A-Z Part 5. 심층강화학습 Ch 01. 심층강화학습 논문 읽기 ...

Study Log (2022.07)

less than 1 minute read

2022-07-05 모델 성능 개선으로 익히는 강화학습 A-Z Part 5. 심층강화학습 Ch 01. 심층강화학습 논문 읽기 11. Asynchrnous Advantage A...