NimaBoscarino commited on
Commit
a16e6a1
1 Parent(s): 4b07be6

Rename charters/bigscience.txt to charters/bigscience.md

Browse files
charters/{bigscience.txt → bigscience.md} RENAMED
@@ -1,51 +1,80 @@
1
- BigScience Ethical Charter
2
 
3
- Preamble
4
- Introduction
 
 
5
  The development and applications of research in NLP are advancing rapidly, with direct real-world consequences. As a result, possible societal benefits exist, but related risks also increase considerably. Aware of these potential challenges, BigScience drafted an ethical charter formalizing its core values and how they are articulated.
6
- Scope
 
 
7
  The scope of this ethical charter is threefold:
8
- 1. To establish the core values of BigScience in order to allow its contributors to commit to them, both individually and collectively.
9
- 2. To serve as a pivot for drafting BigScience documents intended to frame specific issues ethically and legally.
10
- 3. To enable Big Science to promote values within the research community through scientific publication, dissemination, and popularization.
11
- People concerned
 
 
 
12
  The members of BigScience hold the values stated in this ethical charter. As ethical guidelines, they apply to any activities and documents governing a specific aspect of the project.
13
- Limitations of this ethical charter
 
 
14
  Given the breadth of the scope of BigScience and thriving to seek progress in NLP research, we recognize that not all scientific research will have a positive impact on society. It is difficult to predict all the uses the scientific community will make of our artifacts. Therefore, we defer to our license and model card for further information.
15
- Relevance over time
 
 
16
  We interpret ethics as an ongoing process, not a time-fixed code with universal validity. For these reasons, when needed, BigScience will review, update and adapt the ethical charter from time to time.
17
- Legitimacy
 
 
18
  The elaboration of this ethical charter results from a bottom-up collaboration that tried to collect all the different thoughts and opinions of BigScience participants. Then, experts in applied ethics and law did a final revision. We aim for consensus: if any BigScience member individually does not feel aligned with one or more of the values inscribed in this ethical charter, the member will have the right to object at appropriate times and places to that end.
19
- Ethical approach
 
 
20
  We assume the basis of value pluralism within our community, and we cherish it. That is why the ethical notion of harmony (和) in Confucian moral theory seemed to be the appropriate approach for such an international and interdisciplinary scientific community as BigScience. “Harmony is by its very nature relational. It presupposes the coexistence of multiple parties; […] harmony is always contextual; epistemologically it calls for a holistic approach[1].”
21
- Ethical compliance
 
 
22
  We distinguish two levels of ethical compliance operating within the charter: individual and collective. We are held accountable for ethical compliance both as individual BigScience contributors and as a collective research entity.
23
- Other documents articulation
 
 
24
  Given the pivotal function of this ethical charter, we will refer to the other BigScience documents intended to govern specific issues directly where needed in the relevant paragraph.
25
- BigScience Values
 
26
 
27
  We apply the distinction between intrinsic and extrinsic values in the structure of this ethical charter. The former refers to “what is valuable for its own sake, in itself […], as an end[2]”; the latter is characterized as “what is valuable as a means, or for something else’s work[3]”. We distinguish between intrinsic and extrinsic values because the latter can vary more efficiently to achieve the former goals: the latter are substitutable. This structure will help the reader understand how the two types of values combine and allow the BigScience community to adapt this ethical charter over time.
28
- Intrinsic Values
 
29
 
30
- * Inclusivity
 
31
  We work to ensure welcomeness in the process and equal access to the BigScience artifacts without any form of discrimination (e.g., religion, ethnicity, sexual orientation, gender, political orientation, age, ability). We believe that “inclusivity” is not just non-discrimination, but also a sense of belonging.
32
- * Diversity
 
 
33
  The BigScience community has over 900 researchers and communities (see some listed collaborations here) from 50 countries covering over 20 languages. The collaborators bring together their expertise from various sources of knowledge, scientific fields, and institutional contexts (academia, industry, research institutions, etc).
34
- * Reproducibility
 
 
35
  The BigScience project was born with the clear intention of being a research initiative devoted to open science. BigScience aims at ensuring the reproduction of the research experiments and scientific conclusions developed under its aegis.
36
- * Openness
37
- Openness takes two dimensions, one focused on the process, and the other focused on its result. BigScience aims to be an open science framework whereby NLP, and broadly, AI-related researchers from all over the world can contribute and join the initiative.
38
- With regards to the results of our research, such as the future Large Language Model, these are created by the research community to the research community, and therefore will be released on an open basis, taking into account the risks derived from the use of the model.
39
- * Responsibility
40
- Each contributor has both an individual and a collective responsibility for their work within the BigScience project. This responsibility is both social and environmental. BigScience intends to positively impact stakeholders through its artifacts regarding the former. Concerning the latter, BigScience is committed to developing tools to monitor and lower its artifacts’ carbon footprint and energy consumption.
41
- Moreover, other tools such as an open legal playbook for NLP researchers guiding them regarding the use and respect of IP and privacy rights also seek to promote responsibility around the scientific community.
42
- Extrinsic Values
43
- * Accessibility
44
- As a means to achieve openness
45
 
 
46
 
47
- BigScience puts in its best efforts to make our research and technological outputs easily interpretable and explained to the wider public, outside the scientific community, especially to communities that have participated in data sharing.
 
 
 
 
48
 
 
 
 
 
 
 
 
49
 
50
  Currently instrumentalized in:
51
  * no-code tools for exploring the catalog, trained models, etc.
@@ -53,31 +82,29 @@ Currently instrumentalized in:
53
  * journalism (articles published on the project)
54
  * linked to multidisciplinarity - legal hackathon as a step toward “non-technical” presentation
55
 
56
- * Transparency
57
- As a means to achieve reproducibility
58
 
 
59
 
60
  BigScience work is actively promoted at various conferences, webinars, academic research, and scientific popularization so others can see our work.
61
- We have set up a management framework to oversee the use of BigScience models, datasets, and tools, e.g. through working groups.
62
- All BigScience internal meetings and work progress are publicly shared within the Community, e.g. through public episodes.
63
- We are committed to building tools to interpret, monitor, explain, and make intelligible the artifacts developed by BigScience.
64
- * Interdisciplinarity
65
- As a means to achieve inclusivity
66
-
67
-
68
- We are constantly building bridges among computer science, linguistics, law, sociology, philosophy, and other relevant disciplines in order to adopt a holistic approach in developing BigScience artifacts.
69
- * Multilingualism
70
- As a means to achieve diversity
71
 
 
72
 
73
- By having a system that is multilingual from its conception, with the immediate goal of covering the 20 most spoken languages in the world and a broad reach to include up to hundreds based on collaborations with native speakers, we aim to reduce existing disparities in language and foster a more equitable distribution of the benefits of our artifacts.
74
 
 
75
 
 
76
 
 
77
 
 
78
 
 
79
 
 
80
 
 
81
 
82
  ________________
83
  [1] Chenyang Li, “The Confucian Ideal of Harmony”, in Philosophy East and West, vol. 56, no. 4, 2006, p. 589.
 
1
+ # BigScience Ethical Charter
2
 
3
+ ## Preamble
4
+
5
+ ### Introduction
6
+
7
  The development and applications of research in NLP are advancing rapidly, with direct real-world consequences. As a result, possible societal benefits exist, but related risks also increase considerably. Aware of these potential challenges, BigScience drafted an ethical charter formalizing its core values and how they are articulated.
8
+
9
+ ### Scope
10
+
11
  The scope of this ethical charter is threefold:
12
+
13
+ 1. To establish the core values of BigScience in order to allow its contributors to commit to them, both individually and collectively.
14
+ 2. To serve as a pivot for drafting BigScience documents intended to frame specific issues ethically and legally.
15
+ 3. To enable Big Science to promote values within the research community through scientific publication, dissemination, and popularization.
16
+
17
+ ### People concerned
18
+
19
  The members of BigScience hold the values stated in this ethical charter. As ethical guidelines, they apply to any activities and documents governing a specific aspect of the project.
20
+
21
+ ### Limitations of this ethical charter
22
+
23
  Given the breadth of the scope of BigScience and thriving to seek progress in NLP research, we recognize that not all scientific research will have a positive impact on society. It is difficult to predict all the uses the scientific community will make of our artifacts. Therefore, we defer to our license and model card for further information.
24
+
25
+ ### Relevance over time
26
+
27
  We interpret ethics as an ongoing process, not a time-fixed code with universal validity. For these reasons, when needed, BigScience will review, update and adapt the ethical charter from time to time.
28
+
29
+ ### Legitimacy
30
+
31
  The elaboration of this ethical charter results from a bottom-up collaboration that tried to collect all the different thoughts and opinions of BigScience participants. Then, experts in applied ethics and law did a final revision. We aim for consensus: if any BigScience member individually does not feel aligned with one or more of the values inscribed in this ethical charter, the member will have the right to object at appropriate times and places to that end.
32
+
33
+ ### Ethical approach
34
+
35
  We assume the basis of value pluralism within our community, and we cherish it. That is why the ethical notion of harmony (和) in Confucian moral theory seemed to be the appropriate approach for such an international and interdisciplinary scientific community as BigScience. “Harmony is by its very nature relational. It presupposes the coexistence of multiple parties; […] harmony is always contextual; epistemologically it calls for a holistic approach[1].”
36
+
37
+ ### Ethical compliance
38
+
39
  We distinguish two levels of ethical compliance operating within the charter: individual and collective. We are held accountable for ethical compliance both as individual BigScience contributors and as a collective research entity.
40
+
41
+ ### Other documents articulation
42
+
43
  Given the pivotal function of this ethical charter, we will refer to the other BigScience documents intended to govern specific issues directly where needed in the relevant paragraph.
44
+
45
+ ## BigScience Values
46
 
47
  We apply the distinction between intrinsic and extrinsic values in the structure of this ethical charter. The former refers to “what is valuable for its own sake, in itself […], as an end[2]”; the latter is characterized as “what is valuable as a means, or for something else’s work[3]”. We distinguish between intrinsic and extrinsic values because the latter can vary more efficiently to achieve the former goals: the latter are substitutable. This structure will help the reader understand how the two types of values combine and allow the BigScience community to adapt this ethical charter over time.
48
+
49
+ ### Intrinsic Values
50
 
51
+ #### Inclusivity
52
+
53
  We work to ensure welcomeness in the process and equal access to the BigScience artifacts without any form of discrimination (e.g., religion, ethnicity, sexual orientation, gender, political orientation, age, ability). We believe that “inclusivity” is not just non-discrimination, but also a sense of belonging.
54
+
55
+ #### Diversity
56
+
57
  The BigScience community has over 900 researchers and communities (see some listed collaborations here) from 50 countries covering over 20 languages. The collaborators bring together their expertise from various sources of knowledge, scientific fields, and institutional contexts (academia, industry, research institutions, etc).
58
+
59
+ #### Reproducibility
60
+
61
  The BigScience project was born with the clear intention of being a research initiative devoted to open science. BigScience aims at ensuring the reproduction of the research experiments and scientific conclusions developed under its aegis.
 
 
 
 
 
 
 
 
 
62
 
63
+ #### Openness
64
 
65
+ Openness takes two dimensions, one focused on the process, and the other focused on its result. BigScience aims to be an open science framework whereby NLP, and broadly, AI-related researchers from all over the world can contribute and join the initiative. With regards to the results of our research, such as the future Large Language Model, these are created by the research community to the research community, and therefore will be released on an open basis, taking into account the risks derived from the use of the model.
66
+
67
+ #### Responsibility
68
+
69
+ Each contributor has both an individual and a collective responsibility for their work within the BigScience project. This responsibility is both social and environmental. BigScience intends to positively impact stakeholders through its artifacts regarding the former. Concerning the latter, BigScience is committed to developing tools to monitor and lower its artifacts’ carbon footprint and energy consumption. Moreover, other tools such as an open legal playbook for NLP researchers guiding them regarding the use and respect of IP and privacy rights also seek to promote responsibility around the scientific community.
70
 
71
+ ### Extrinsic Values
72
+
73
+ #### Accessibility
74
+
75
+ As a means to achieve openness.
76
+
77
+ BigScience puts in its best efforts to make our research and technological outputs easily interpretable and explained to the wider public, outside the scientific community, especially to communities that have participated in data sharing.
78
 
79
  Currently instrumentalized in:
80
  * no-code tools for exploring the catalog, trained models, etc.
 
82
  * journalism (articles published on the project)
83
  * linked to multidisciplinarity - legal hackathon as a step toward “non-technical” presentation
84
 
85
+ ##### Transparency
 
86
 
87
+ As a means to achieve reproducibility.
88
 
89
  BigScience work is actively promoted at various conferences, webinars, academic research, and scientific popularization so others can see our work.
 
 
 
 
 
 
 
 
 
 
90
 
91
+ We have set up a management framework to oversee the use of BigScience models, datasets, and tools, e.g. through working groups.
92
 
93
+ All BigScience internal meetings and work progress are publicly shared within the Community, e.g. through public episodes.
94
 
95
+ We are committed to building tools to interpret, monitor, explain, and make intelligible the artifacts developed by BigScience.
96
 
97
+ #### Interdisciplinarity
98
 
99
+ As a means to achieve inclusivity.
100
 
101
+ We are constantly building bridges among computer science, linguistics, law, sociology, philosophy, and other relevant disciplines in order to adopt a holistic approach in developing BigScience artifacts.
102
 
103
+ #### Multilingualism
104
 
105
+ As a means to achieve diversity.
106
 
107
+ By having a system that is multilingual from its conception, with the immediate goal of covering the 20 most spoken languages in the world and a broad reach to include up to hundreds based on collaborations with native speakers, we aim to reduce existing disparities in language and foster a more equitable distribution of the benefits of our artifacts.
108
 
109
  ________________
110
  [1] Chenyang Li, “The Confucian Ideal of Harmony”, in Philosophy East and West, vol. 56, no. 4, 2006, p. 589.