A Unity ID allows you to buy and/or subscribe to Unity products and services, shop in the Asset Store and participate in the Unity community.
Separate names with a comma.
Every number type you send to the network will be converted to a float no matter what type you sent it as, neural networks are made of and only...
Buffer sensor observations pass through an attention layer before reaching the policy network so I wouldn't put anything the can't potentially be...
Stacked observations are just all observations from the previous n steps (n being the number of stacked obs you've set) being sent to the model on...
If there's no memory task and no stacked observations then there would likely not be much benefit to giving the agent observations between...
Yup, that's your answer. If you turn off automatic stepping and invoke the EnvironmentStep() method manually, you'll get the behavior you're...
There's probably a group of services you can restart if you don't want to do a full reboot, but I've never been bothered by the issue enough to...
This is an identified issue with how sockets are assigned by linux. ML Agents cant close the open socket connection (for whatever reason, I'm not...
I've done 35 environments and not had any issues so my guess would be something specific to your environment. How many cpu cores do you have?...
I also see you are encoding the goal position directly in the observations and not the enemy's position. You can try a sanity check by encoding...
I don't see the logic for the enemy cube, I'm guessing from your results that it is randomly placed. It seems the random placement doesn't...
MLAgent's implementation of self-play largely follows this openai paper - https://arxiv.org/pdf/1710.03748.pdf Here's the blog post too if you...
Yes your interpretation is correct, a larger buffer_size than team_change would mean the agent would change sides and continue adding experiences...
Yeah stacked observations have never been useful in my experiments. The technique I've previously used to avoid the overhead of a memory task...
Yeah those numbers make sense to me, that is an extremely large network and LSTMs tend to be poor performers. Running VR rendering and a large NN...
You're transforming the position into a local-relative directional vector which does not convey a local-relative position. You probably meant to...
That's your buffer hitting full size and the model weights being updated. When the buffer_size is hit you'll run through a gradient descent for...
Are you doing inference on the gpu or cpu?
AFAIK the stacked vector only contains observations from previous time steps and is fed into the current step along with current observations....
Yes but not through the rewards, just use statsrecorder when the reward is assigned -...
The agents needs to know everything you would need to know in order perform the same task yourself. This depends entirely on the complexity of...