--- title: Wilcoxon emoji: 🤗 colorFrom: blue colorTo: green sdk: gradio sdk_version: 3.0.2 app_file: app.py pinned: false tags: - evaluate - comparison description: >- Wilcoxon's test is a signed-rank test for comparing paired samples. --- # Comparison Card for Wilcoxon ## Comparison description Wilcoxon's test is a non-parametric signed-rank test that tests whether the distribution of the differences is symmetric about zero. It can be used to compare the predictions of two models. ## How to use The Wilcoxon comparison is used to analyze paired ordinal data. ## Inputs Its arguments are: `predictions1`: a list of predictions from the first model. `predictions2`: a list of predictions from the second model. ## Output values The Wilcoxon comparison outputs two things: `stat`: The Wilcoxon statistic. `p`: The p value. ## Examples Example comparison: ```python wilcoxon = evaluate.load("wilcoxon") results = wilcoxon.compute(predictions1=[-7, 123.45, 43, 4.91, 5], predictions2=[1337.12, -9.74, 1, 2, 3.21]) print(results) {'stat': 5.0, 'p': 0.625} ``` ## Limitations and bias The Wilcoxon test is a non-parametric test, so it has relatively few assumptions (basically only that the observations are independent). It should be used to analyze paired ordinal data only. ## Citations ```bibtex @incollection{wilcoxon1992individual, title={Individual comparisons by ranking methods}, author={Wilcoxon, Frank}, booktitle={Breakthroughs in statistics}, pages={196--202}, year={1992}, publisher={Springer} } ```