Hi, I’m from a university student RL team and we are going to make a new multi-agent cooperative-adversarial environment for learning optimal strategies for short-term scheduling of incoming bids (surgical procedures in our case). Please take a look at the current description of the environment. What is missing? Maybe there is something superfluous? Any experience you share with us would be appreciated! Of course, the results will be available under MIT license for the development of Julia’s MARL community.