We should not draw conclusions from statistically insignificant results and people don’t have an intuitive understanding of statistical significance, so we need to rely on objective numbers such as the p-value, to guide our decisions. I’m personally always surprised by how many users need to join an experiment, for the results to become statistically significant.
The p-value can be computed from the numbers displayed on the experiment page, but since everyone should do it, it would be better to include it by default for everyone.
Firebase does this in its experiment module: