Mariusz Kossakowski commited on
Commit
a619100
·
1 Parent(s): acb3b1d

Adapt app to older Python version

Browse files
Files changed (1) hide show
  1. app.py +35 -36
app.py CHANGED
@@ -39,43 +39,42 @@ selected_dataset = st.sidebar.selectbox(
39
 
40
 
41
  def load_hf_dataset():
42
- match selected_dataset:
43
- case "clarin-pl/polemo2-official":
44
- data = load_dataset("clarin-pl/polemo2-official")
45
- DATA_DICT = {
46
- "train": data["train"].to_pandas(),
47
- "validation": data["validation"].to_pandas(),
48
- "test": data["test"].to_pandas(),
49
- }
50
- DATA_DESCRIPTION = """The PolEmo2.0 is a dataset of online consumer reviews from four domains: medicine,
51
- hotels, products, and university. It is human-annotated on a level of full reviews and individual
52
- sentences. Current version (PolEmo 2.0) contains 8,216 reviews having 57,466 sentences. Each text and
53
- sentence was manually annotated with sentiment in the 2+1 scheme, which gives a total of 197,
54
- 046 annotations. About 85% of the reviews are from the medicine and hotel domains. Each review is
55
- annotated with four labels: positive, negative, neutral, or ambiguous. """
56
- case "laugustyniak/abusive-clauses-pl":
57
- DATA_DICT = load_data()
58
- DATA_DESCRIPTION = """
59
- ''I have read and agree to the terms and conditions'' is one of the biggest lies on the Internet.
60
- Consumers rarely read the contracts they are required to accept. We conclude agreements over the Internet daily.
61
- But do we know the content of these agreements? Do we check potential unfair statements? On the Internet,
62
- we probably skip most of the Terms and Conditions. However, we must remember that we have concluded many more
63
- contracts. Imagine that we want to buy a house, a car, send our kids to the nursery, open a bank account,
64
- or many more. In all these situations, you will need to conclude the contract, but there is a high probability
65
- that you will not read the entire agreement with proper understanding. European consumer law aims to prevent
66
- businesses from using so-called ''unfair contractual terms'' in their unilaterally drafted contracts,
67
- requiring consumers to accept.
 
 
 
68
 
69
- Our dataset treats ''unfair contractual term'' as the equivalent of an abusive clause. It could be defined as a
70
- clause that is unilaterally imposed by one of the contract's parties, unequally affecting the other, or creating a
71
- situation of imbalance between the duties and rights of the parties.
72
-
73
- On the EU and at the national such as the Polish levels, agencies cannot check possible agreements by hand. Hence,
74
- we took the first step to evaluate the possibility of accelerating this process. We created a dataset and machine
75
- learning models to automate potentially abusive clauses detection partially. Consumer protection organizations and
76
- agencies can use these resources to make their work more effective and efficient. Moreover, consumers can automatically
77
- analyze contracts and understand what they agree upon.
78
- """
79
  return DATA_DICT, DATA_DESCRIPTION
80
 
81
 
 
39
 
40
 
41
  def load_hf_dataset():
42
+ if selected_dataset == "clarin-pl/polemo2-official":
43
+ data = load_dataset("clarin-pl/polemo2-official")
44
+ DATA_DICT = {
45
+ "train": data["train"].to_pandas(),
46
+ "validation": data["validation"].to_pandas(),
47
+ "test": data["test"].to_pandas(),
48
+ }
49
+ DATA_DESCRIPTION = """The PolEmo2.0 is a dataset of online consumer reviews from four domains: medicine,
50
+ hotels, products, and university. It is human-annotated on a level of full reviews and individual
51
+ sentences. Current version (PolEmo 2.0) contains 8,216 reviews having 57,466 sentences. Each text and
52
+ sentence was manually annotated with sentiment in the 2+1 scheme, which gives a total of 197,
53
+ 046 annotations. About 85% of the reviews are from the medicine and hotel domains. Each review is
54
+ annotated with four labels: positive, negative, neutral, or ambiguous. """
55
+ elif selected_dataset == "laugustyniak/abusive-clauses-pl":
56
+ DATA_DICT = load_data()
57
+ DATA_DESCRIPTION = """
58
+ ''I have read and agree to the terms and conditions'' is one of the biggest lies on the Internet.
59
+ Consumers rarely read the contracts they are required to accept. We conclude agreements over the Internet daily.
60
+ But do we know the content of these agreements? Do we check potential unfair statements? On the Internet,
61
+ we probably skip most of the Terms and Conditions. However, we must remember that we have concluded many more
62
+ contracts. Imagine that we want to buy a house, a car, send our kids to the nursery, open a bank account,
63
+ or many more. In all these situations, you will need to conclude the contract, but there is a high probability
64
+ that you will not read the entire agreement with proper understanding. European consumer law aims to prevent
65
+ businesses from using so-called ''unfair contractual terms'' in their unilaterally drafted contracts,
66
+ requiring consumers to accept.
67
+
68
+ Our dataset treats ''unfair contractual term'' as the equivalent of an abusive clause. It could be defined as a
69
+ clause that is unilaterally imposed by one of the contract's parties, unequally affecting the other, or creating a
70
+ situation of imbalance between the duties and rights of the parties.
71
 
72
+ On the EU and at the national such as the Polish levels, agencies cannot check possible agreements by hand. Hence,
73
+ we took the first step to evaluate the possibility of accelerating this process. We created a dataset and machine
74
+ learning models to automate potentially abusive clauses detection partially. Consumer protection organizations and
75
+ agencies can use these resources to make their work more effective and efficient. Moreover, consumers can automatically
76
+ analyze contracts and understand what they agree upon.
77
+ """
 
 
 
 
78
  return DATA_DICT, DATA_DESCRIPTION
79
 
80