How to get the total reward point

#1
by MonteXiaofeng - opened

Thanks for this great work. Tested with the example provided and the output is as follows. Is the average used as the total score (I found that the scores of each attribute vary greatly, such as -17.8157 42.4842)? What is the range of scores for each attribute and how are these attributes weighted to get the total score?

reward_quantiles: tensor([[ -5.9229, -17.8157, -7.9583, -1.7157, 6.2691, 11.7173, 3.8997,
-1.2744, 0.7031, 6.7963, 28.9960, 41.4233, 30.3169, 32.1501,
38.3505, 42.4842, 38.2890, 34.5722, 43.0714]])

Sign up or log in to comment