Spaces:

AI-Secure
/

llm-trustworthy-leaderboard

Running on CPU Upgrade

danielz02 commited on Jan 16, 2024

Commit

7e99ffa

unverified ·

1 Parent(s): 9672344

Add additional information about metrics

Files changed (1) hide show

src/display/about.py CHANGED Viewed

@@ -29,7 +29,7 @@ TITLE = """<h1 align="center" id="space-title">Trustworthy LLM leaderboard</h1>"
 INTRODUCTION_TEXT = """Powered by the DecodingTrust platform, which provides comprehensive safety and trustworthiness
 evaluation for LLMs, this leaderboard is designed to help researchers and practitioners better understand the
 capabilities, limitations, and potential risks of state-of-the-art Large Language Models (LLMs). See our paper for
-details. Access the DecodingTrust platform website [here](https://decodingtrust.github.io/)"""
 # Which evaluations are you running? how can people reproduce what you have?
 LLM_BENCHMARKS_TEXT = f"""
@@ -51,6 +51,8 @@ This project is organized around the following eight primary perspectives of tru
 + Machine Ethics
 + Fairness
 ## Reproducibility
 To reproduce our results, checkout https://github.com/AI-secure/DecodingTrust

 INTRODUCTION_TEXT = """Powered by the DecodingTrust platform, which provides comprehensive safety and trustworthiness
 evaluation for LLMs, this leaderboard is designed to help researchers and practitioners better understand the
 capabilities, limitations, and potential risks of state-of-the-art Large Language Models (LLMs). See our paper for
+details. Access the DecodingTrust platform website [here](https://decodingtrust.github.io/)."""
 # Which evaluations are you running? how can people reproduce what you have?
 LLM_BENCHMARKS_TEXT = f"""
 + Machine Ethics
 + Fairness
+We normalize the score of each perspective as 0-100, and these scores are the higher the better.
 ## Reproducibility
 To reproduce our results, checkout https://github.com/AI-secure/DecodingTrust