echarlaix HF staff commited on
Commit
07fe2fe
1 Parent(s): 529d42b

Adding model, graphs and metadata.

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +45 -0
  2. config.json +161 -0
  3. eval/eval_results.json +8 -0
  4. eval/sparsity_report.json +1 -0
  5. model_card/density_info.js +174 -0
  6. model_card/images/model_decoder_layers_0_encoder_attn_k_proj.png +0 -0
  7. model_card/images/model_decoder_layers_0_encoder_attn_out_proj.png +0 -0
  8. model_card/images/model_decoder_layers_0_encoder_attn_q_proj.png +0 -0
  9. model_card/images/model_decoder_layers_0_encoder_attn_v_proj.png +0 -0
  10. model_card/images/model_decoder_layers_0_fc1.png +0 -0
  11. model_card/images/model_decoder_layers_0_fc2.png +0 -0
  12. model_card/images/model_decoder_layers_0_self_attn_k_proj.png +0 -0
  13. model_card/images/model_decoder_layers_0_self_attn_out_proj.png +0 -0
  14. model_card/images/model_decoder_layers_0_self_attn_q_proj.png +0 -0
  15. model_card/images/model_decoder_layers_0_self_attn_v_proj.png +0 -0
  16. model_card/images/model_decoder_layers_1_encoder_attn_k_proj.png +0 -0
  17. model_card/images/model_decoder_layers_1_encoder_attn_out_proj.png +0 -0
  18. model_card/images/model_decoder_layers_1_encoder_attn_q_proj.png +0 -0
  19. model_card/images/model_decoder_layers_1_encoder_attn_v_proj.png +0 -0
  20. model_card/images/model_decoder_layers_1_fc1.png +0 -0
  21. model_card/images/model_decoder_layers_1_fc2.png +0 -0
  22. model_card/images/model_decoder_layers_1_self_attn_k_proj.png +0 -0
  23. model_card/images/model_decoder_layers_1_self_attn_out_proj.png +0 -0
  24. model_card/images/model_decoder_layers_1_self_attn_q_proj.png +0 -0
  25. model_card/images/model_decoder_layers_1_self_attn_v_proj.png +0 -0
  26. model_card/images/model_decoder_layers_2_encoder_attn_k_proj.png +0 -0
  27. model_card/images/model_decoder_layers_2_encoder_attn_out_proj.png +0 -0
  28. model_card/images/model_decoder_layers_2_encoder_attn_q_proj.png +0 -0
  29. model_card/images/model_decoder_layers_2_encoder_attn_v_proj.png +0 -0
  30. model_card/images/model_decoder_layers_2_fc1.png +0 -0
  31. model_card/images/model_decoder_layers_2_fc2.png +0 -0
  32. model_card/images/model_decoder_layers_2_self_attn_k_proj.png +0 -0
  33. model_card/images/model_decoder_layers_2_self_attn_out_proj.png +0 -0
  34. model_card/images/model_decoder_layers_2_self_attn_q_proj.png +0 -0
  35. model_card/images/model_decoder_layers_2_self_attn_v_proj.png +0 -0
  36. model_card/images/model_decoder_layers_3_encoder_attn_k_proj.png +0 -0
  37. model_card/images/model_decoder_layers_3_encoder_attn_out_proj.png +0 -0
  38. model_card/images/model_decoder_layers_3_encoder_attn_q_proj.png +0 -0
  39. model_card/images/model_decoder_layers_3_encoder_attn_v_proj.png +0 -0
  40. model_card/images/model_decoder_layers_3_fc1.png +0 -0
  41. model_card/images/model_decoder_layers_3_fc2.png +0 -0
  42. model_card/images/model_decoder_layers_3_self_attn_k_proj.png +0 -0
  43. model_card/images/model_decoder_layers_3_self_attn_out_proj.png +0 -0
  44. model_card/images/model_decoder_layers_3_self_attn_q_proj.png +0 -0
  45. model_card/images/model_decoder_layers_3_self_attn_v_proj.png +0 -0
  46. model_card/images/model_decoder_layers_4_encoder_attn_k_proj.png +0 -0
  47. model_card/images/model_decoder_layers_4_encoder_attn_out_proj.png +0 -0
  48. model_card/images/model_decoder_layers_4_encoder_attn_q_proj.png +0 -0
  49. model_card/images/model_decoder_layers_4_encoder_attn_v_proj.png +0 -0
  50. model_card/images/model_decoder_layers_4_fc1.png +0 -0
README.md ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: apache-2.0
4
+ tags:
5
+ - summarization
6
+ datasets:
7
+ - cnn_dailymail
8
+ metrics:
9
+ - R1
10
+ - R2
11
+ - RL
12
+ ---
13
+
14
+ ## facebook/bart-base model fine-tuned on CNN/DailyMail
15
+
16
+ This model was created using the [nn_pruning](https://github.com/huggingface/nn_pruning) python library: the linear layers contains **23%** of the original weights.
17
+
18
+
19
+
20
+ The model contains **45%** of the original weights **overall** (the embeddings account for a significant part of the model, and they are not pruned by this method).
21
+
22
+ <div class="graph"><script src="/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/density_info.js" id="4348cd46-05bd-4e27-b565-6693f9e0b03e"></script></div>
23
+
24
+
25
+ ## Fine-Pruning details
26
+ This model was fine-tuned from the HuggingFace [model](https://huggingface.co/facebook/bart-base).
27
+ A side-effect of block pruning is that some of the attention heads are completely removed: 61 heads were removed on a total of 216 (28.2%).
28
+
29
+ <div class="graph"><script src="/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/pruning_info.js" id="733e6c04-6c58-482f-89b8-a196de6b6550"></script></div>
30
+
31
+ ## Details of the CNN/DailyMail dataset
32
+
33
+ | Dataset | Split | # samples |
34
+ | ------------- | ----- | --------- |
35
+ | CNN/DailyMail | train | 287K |
36
+ | CNN/DailyMail | eval | 13K |
37
+
38
+ ### Results
39
+
40
+
41
+ | Metric | # Value |
42
+ | ----------- | --------- |
43
+ | **Rouge 1** | **41.43** |
44
+ | **Rouge 2** | **18.72** |
45
+ | **Rouge L** | **38.35** |
config.json ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "/tmp/tmpdmtb8m2s",
3
+ "activation_dropout": 0.1,
4
+ "activation_function": "gelu",
5
+ "add_bias_logits": false,
6
+ "add_final_layer_norm": false,
7
+ "architectures": [
8
+ "BartForConditionalGeneration"
9
+ ],
10
+ "attention_dropout": 0.1,
11
+ "bos_token_id": 0,
12
+ "classif_dropout": 0.1,
13
+ "classifier_dropout": 0.0,
14
+ "d_model": 768,
15
+ "decoder_attention_heads": 12,
16
+ "decoder_ffn_dim": 3072,
17
+ "decoder_layerdrop": 0.0,
18
+ "decoder_layers": 6,
19
+ "decoder_start_token_id": 2,
20
+ "dropout": 0.1,
21
+ "early_stopping": true,
22
+ "encoder_attention_heads": 12,
23
+ "encoder_ffn_dim": 3072,
24
+ "encoder_layerdrop": 0.0,
25
+ "encoder_layers": 6,
26
+ "eos_token_id": 2,
27
+ "forced_eos_token_id": 2,
28
+ "gradient_checkpointing": false,
29
+ "heads_count": 216,
30
+ "heads_to_prune": {
31
+ "0": [
32
+ 8,
33
+ 10
34
+ ],
35
+ "1": [
36
+ 0,
37
+ 5,
38
+ 6
39
+ ],
40
+ "10": [
41
+ 3,
42
+ 12,
43
+ 14,
44
+ 16,
45
+ 18,
46
+ 20,
47
+ 21
48
+ ],
49
+ "11": [
50
+ 10
51
+ ],
52
+ "2": [
53
+ 4,
54
+ 6,
55
+ 9
56
+ ],
57
+ "3": [
58
+ 0,
59
+ 1,
60
+ 2,
61
+ 4,
62
+ 6,
63
+ 7,
64
+ 8,
65
+ 9,
66
+ 11
67
+ ],
68
+ "4": [
69
+ 1,
70
+ 2,
71
+ 5,
72
+ 6,
73
+ 9,
74
+ 10,
75
+ 11
76
+ ],
77
+ "5": [
78
+ 2,
79
+ 3,
80
+ 4,
81
+ 5,
82
+ 6,
83
+ 10
84
+ ],
85
+ "6": [
86
+ 0,
87
+ 12,
88
+ 13,
89
+ 14
90
+ ],
91
+ "7": [
92
+ 2,
93
+ 11,
94
+ 16,
95
+ 17,
96
+ 18,
97
+ 19,
98
+ 23
99
+ ],
100
+ "8": [
101
+ 2,
102
+ 14,
103
+ 16,
104
+ 21
105
+ ],
106
+ "9": [
107
+ 3,
108
+ 12,
109
+ 13,
110
+ 15,
111
+ 16,
112
+ 18,
113
+ 21,
114
+ 22
115
+ ]
116
+ },
117
+ "id2label": {
118
+ "0": "LABEL_0",
119
+ "1": "LABEL_1",
120
+ "2": "LABEL_2"
121
+ },
122
+ "init_std": 0.02,
123
+ "is_encoder_decoder": true,
124
+ "label2id": {
125
+ "LABEL_0": 0,
126
+ "LABEL_1": 1,
127
+ "LABEL_2": 2
128
+ },
129
+ "max_position_embeddings": 1024,
130
+ "model_type": "bart",
131
+ "no_repeat_ngram_size": 3,
132
+ "normalize_before": false,
133
+ "normalize_embedding": true,
134
+ "num_beams": 4,
135
+ "num_hidden_layers": 6,
136
+ "pad_token_id": 1,
137
+ "scale_embedding": false,
138
+ "task_specific_params": {
139
+ "summarization": {
140
+ "length_penalty": 1.0,
141
+ "max_length": 128,
142
+ "min_length": 12,
143
+ "num_beams": 4
144
+ },
145
+ "summarization_cnn": {
146
+ "length_penalty": 2.0,
147
+ "max_length": 142,
148
+ "min_length": 56,
149
+ "num_beams": 4
150
+ },
151
+ "summarization_xsum": {
152
+ "length_penalty": 1.0,
153
+ "max_length": 62,
154
+ "min_length": 11,
155
+ "num_beams": 6
156
+ }
157
+ },
158
+ "transformers_version": "4.4.1",
159
+ "use_cache": true,
160
+ "vocab_size": 50265
161
+ }
eval/eval_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "eval_gen_len": 81.1528,
3
+ "eval_loss": 2.02736759185791,
4
+ "eval_rouge1": 41.4282,
5
+ "eval_rouge2": 18.7209,
6
+ "eval_rougeL": 28.2805,
7
+ "eval_rougeLsum": 38.3536
8
+ }
eval/sparsity_report.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"total": 139420416, "nnz": 63361439, "linear_total": 99090432, "linear_nnz": 23081472, "layers": {"0": {"total": 16539648, "nnz": 4338402, "linear_total": 16515072, "linear_nnz": 4321280, "linear_attention_total": 7077888, "linear_attention_nnz": 4170752, "linear_dense_total": 9437184, "linear_dense_nnz": 150528}, "1": {"total": 16539648, "nnz": 3697201, "linear_total": 16515072, "linear_nnz": 3680768, "linear_attention_total": 7077888, "linear_attention_nnz": 3261440, "linear_dense_total": 9437184, "linear_dense_nnz": 419328}, "2": {"total": 16539648, "nnz": 3582655, "linear_total": 16515072, "linear_nnz": 3566080, "linear_attention_total": 7077888, "linear_attention_nnz": 3076096, "linear_dense_total": 9437184, "linear_dense_nnz": 489984}, "3": {"total": 16539648, "nnz": 3230637, "linear_total": 16515072, "linear_nnz": 3214848, "linear_attention_total": 7077888, "linear_attention_nnz": 2703360, "linear_dense_total": 9437184, "linear_dense_nnz": 511488}, "4": {"total": 16539648, "nnz": 3432491, "linear_total": 16515072, "linear_nnz": 3416576, "linear_attention_total": 7077888, "linear_attention_nnz": 3104768, "linear_dense_total": 9437184, "linear_dense_nnz": 311808}, "5": {"total": 16539648, "nnz": 4899061, "linear_total": 16515072, "linear_nnz": 4881920, "linear_attention_total": 7077888, "linear_attention_nnz": 4456448, "linear_dense_total": 9437184, "linear_dense_nnz": 425472}}, "total_sparsity": 54.55368674269341, "linear_sparsity": 76.70665922619048, "pruned_heads": {"0": [8, 10], "1": [0, 5, 6], "2": [4, 6, 9], "3": [0, 1, 2, 4, 6, 7, 8, 9, 11], "4": [1, 2, 5, 6, 9, 10, 11], "5": [2, 3, 4, 5, 6, 10], "6": [0, 12, 13, 14], "7": [2, 11, 16, 17, 18, 19, 23], "8": [2, 14, 16, 21], "9": [3, 12, 13, 15, 16, 18, 21, 22], "10": [3, 12, 14, 16, 18, 20, 21], "11": [10]}}
model_card/density_info.js ADDED
@@ -0,0 +1,174 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ (function() {
2
+ var fn = function() {
3
+
4
+ (function(root) {
5
+ function now() {
6
+ return new Date();
7
+ }
8
+
9
+ var force = false;
10
+
11
+ if (typeof root._bokeh_onload_callbacks === "undefined" || force === true) {
12
+ root._bokeh_onload_callbacks = [];
13
+ root._bokeh_is_loading = undefined;
14
+ }
15
+
16
+
17
+
18
+
19
+ var element = document.getElementById("4348cd46-05bd-4e27-b565-6693f9e0b03e");
20
+ if (element == null) {
21
+ console.warn("Bokeh: autoload.js configured with elementid '4348cd46-05bd-4e27-b565-6693f9e0b03e' but no matching script tag was found.")
22
+ }
23
+
24
+
25
+ function run_callbacks() {
26
+ try {
27
+ root._bokeh_onload_callbacks.forEach(function(callback) {
28
+ if (callback != null)
29
+ callback();
30
+ });
31
+ } finally {
32
+ delete root._bokeh_onload_callbacks
33
+ }
34
+ console.debug("Bokeh: all callbacks have finished");
35
+ }
36
+
37
+ function load_libs(css_urls, js_urls, callback) {
38
+ if (css_urls == null) css_urls = [];
39
+ if (js_urls == null) js_urls = [];
40
+
41
+ root._bokeh_onload_callbacks.push(callback);
42
+ if (root._bokeh_is_loading > 0) {
43
+ console.debug("Bokeh: BokehJS is being loaded, scheduling callback at", now());
44
+ return null;
45
+ }
46
+ if (js_urls == null || js_urls.length === 0) {
47
+ run_callbacks();
48
+ return null;
49
+ }
50
+ console.debug("Bokeh: BokehJS not loaded, scheduling load and callback at", now());
51
+ root._bokeh_is_loading = css_urls.length + js_urls.length;
52
+
53
+ function on_load() {
54
+ root._bokeh_is_loading--;
55
+ if (root._bokeh_is_loading === 0) {
56
+ console.debug("Bokeh: all BokehJS libraries/stylesheets loaded");
57
+ run_callbacks()
58
+ }
59
+ }
60
+
61
+ function on_error(url) {
62
+ console.error("failed to load " + url);
63
+ }
64
+
65
+ for (let i = 0; i < css_urls.length; i++) {
66
+ const url = css_urls[i];
67
+ const element = document.createElement("link");
68
+ element.onload = on_load;
69
+ element.onerror = on_error.bind(null, url);
70
+ element.rel = "stylesheet";
71
+ element.type = "text/css";
72
+ element.href = url;
73
+ console.debug("Bokeh: injecting link tag for BokehJS stylesheet: ", url);
74
+ document.body.appendChild(element);
75
+ }
76
+
77
+ const hashes = {"https://cdn.bokeh.org/bokeh/release/bokeh-2.3.2.min.js": "XypntL49z55iwGVUW4qsEu83zKL3XEcz0MjuGOQ9SlaaQ68X/g+k1FcioZi7oQAc", "https://cdn.bokeh.org/bokeh/release/bokeh-tables-2.3.2.min.js": "bEsM86IHGDTLCS0Zod8a8WM6Y4+lafAL/eSiyQcuPzinmWNgNO2/olUF0Z2Dkn5i", "https://cdn.bokeh.org/bokeh/release/bokeh-widgets-2.3.2.min.js": "TX0gSQTdXTTeScqxj6PVQxTiRW8DOoGVwinyi1D3kxv7wuxQ02XkOxv0xwiypcAH"};
78
+
79
+ for (let i = 0; i < js_urls.length; i++) {
80
+ const url = js_urls[i];
81
+ const element = document.createElement('script');
82
+ element.onload = on_load;
83
+ element.onerror = on_error.bind(null, url);
84
+ element.async = false;
85
+ element.src = url;
86
+ if (url in hashes) {
87
+ element.crossOrigin = "anonymous";
88
+ element.integrity = "sha384-" + hashes[url];
89
+ }
90
+ console.debug("Bokeh: injecting script tag for BokehJS library: ", url);
91
+ document.head.appendChild(element);
92
+ }
93
+ };
94
+
95
+ function inject_raw_css(css) {
96
+ const element = document.createElement("style");
97
+ element.appendChild(document.createTextNode(css));
98
+ document.body.appendChild(element);
99
+ }
100
+
101
+
102
+ var js_urls = ["https://cdn.bokeh.org/bokeh/release/bokeh-2.3.2.min.js", "https://cdn.bokeh.org/bokeh/release/bokeh-widgets-2.3.2.min.js", "https://cdn.bokeh.org/bokeh/release/bokeh-tables-2.3.2.min.js"];
103
+ var css_urls = [];
104
+
105
+
106
+ var inline_js = [
107
+ function(Bokeh) {
108
+ Bokeh.set_log_level("info");
109
+ },
110
+
111
+ function(Bokeh) {
112
+ (function() {
113
+ var fn = function() {
114
+ Bokeh.safely(function() {
115
+ (function(root) {
116
+ function embed_document(root) {
117
+
118
+ var docs_json = '{"7a2e7246-2ff1-4aba-b6ac-1c411e98fc75":{"defs":[],"roots":{"references":[{"attributes":{},"id":"1180","type":"Selection"},{"attributes":{"data":{"density":["2.2%","2.2%","1.9%","1.9%","2.4%","2.4%","2.7%","2.7%","2.9%","2.9%","3.5%","3.5%","1.0%","1.0%","7.0%","7.0%","7.9%","7.9%","8.1%","8.1%","3.7%","3.7%","5.5%","5.5%"],"height":[0.052224,0.052224,0.043776,0.043776,0.0576,0.0576,0.064512,0.064512,0.067584,0.067584,0.082176,0.082176,0.02304,0.02304,0.165888,0.165888,0.187392,0.187392,0.191232,0.191232,0.08832,0.08832,0.13056,0.13056],"img_height":["384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px"],"img_width":["96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px","96px","384px"],"name":["encoder.layers.0.fc1","encoder.layers.0.fc2","encoder.layers.1.fc1","encoder.layers.1.fc2","encoder.layers.2.fc1","encoder.layers.2.fc2","encoder.layers.3.fc1","encoder.layers.3.fc2","encoder.layers.4.fc1","encoder.layers.4.fc2","encoder.layers.5.fc1","encoder.layers.5.fc2","decoder.layers.0.fc1","decoder.layers.0.fc2","decoder.layers.1.fc1","decoder.layers.1.fc2","decoder.layers.2.fc1","decoder.layers.2.fc2","decoder.layers.3.fc1","decoder.layers.3.fc2","decoder.layers.4.fc1","decoder.layers.4.fc2","decoder.layers.5.fc1","decoder.layers.5.fc2"],"parameters":["0.05","0.05","0.04","0.04","0.06","0.06","0.06","0.06","0.07","0.07","0.08","0.08","0.02","0.02","0.17","0.17","0.19","0.19","0.19","0.19","0.09","0.09","0.13","0.13"],"url":["/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_fc2.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_fc1.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_fc2.png"],"x":[0.45,0.55,1.45,1.55,2.4499999999999997,2.55,3.4499999999999997,3.55,4.45,4.55,5.45,5.55,6.85,6.95,7.85,7.95,8.850000000000001,8.950000000000001,9.850000000000001,9.950000000000001,10.850000000000001,10.950000000000001,11.850000000000001,11.950000000000001]},"selected":{"id":"1180"},"selection_policy":{"id":"1181"}},"id":"1153","type":"ColumnDataSource"},{"attributes":{"data":{"density":["64.1%","51.4%","40.6%","18.2%","27.4%","37.8%","63.0%","49.0%","43.6%","45.3%","46.7%","41.3%","60.1%","35.2%","63.2%","43.4%","52.4%","94.4%"],"height":[0.377856,0.303104,0.239616,0.10752,0.161792,0.223232,0.371712,0.288768,0.257024,0.267264,0.275456,0.243712,0.354304,0.207872,0.372736,0.256,0.309248,0.557056],"img_height":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"img_width":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"name":["encoder.layers.0_attn.q_proj","encoder.layers.1_attn.q_proj","encoder.layers.2_attn.q_proj","encoder.layers.3_attn.q_proj","encoder.layers.4_attn.q_proj","encoder.layers.5_attn.q_proj","decoder.layers.0_attn.q_proj","decoder.layers.0.encoder_attn.q_proj","decoder.layers.1_attn.q_proj","decoder.layers.1.encoder_attn.q_proj","decoder.layers.2_attn.q_proj","decoder.layers.2.encoder_attn.q_proj","decoder.layers.3_attn.q_proj","decoder.layers.3.encoder_attn.q_proj","decoder.layers.4_attn.q_proj","decoder.layers.4.encoder_attn.q_proj","decoder.layers.5_attn.q_proj","decoder.layers.5.encoder_attn.q_proj"],"parameters":["0.38","0.30","0.24","0.11","0.16","0.22","0.37","0.29","0.26","0.27","0.28","0.24","0.35","0.21","0.37","0.26","0.31","0.56"],"url":["/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_encoder_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_encoder_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_encoder_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_encoder_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_encoder_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_self_attn_q_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_encoder_attn_q_proj.png"],"x":[0.25,1.25,2.25,3.25,4.25,5.25,6.25,6.6499999999999995,7.25,7.6499999999999995,8.25,8.65,9.25,9.65,10.25,10.65,11.25,11.65]},"selected":{"id":"1176"},"selection_policy":{"id":"1177"}},"id":"1141","type":"ColumnDataSource"},{"attributes":{"source":{"id":"1129"}},"id":"1134","type":"CDSView"},{"attributes":{},"id":"1169","type":"AllLabels"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#aa69f7"},"line_alpha":{"value":0.1},"line_color":{"value":"#aa69f7"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1156","type":"VBar"},{"attributes":{"fill_color":{"value":"#6573f7"},"line_color":{"value":"#6573f7"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1131","type":"VBar"},{"attributes":{"callback":null,"tooltips":"\\n &lt;div&gt;\\n &lt;div style=\\"margin-bottom:10px\\"&gt;\\n &lt;span style=\\"font-size: 15px;\\"&gt;&lt;b&gt;@name&lt;/b&gt;&lt;br/&gt;density=@density&lt;/span&gt;\\n &lt;/div&gt;\\n &lt;div&gt; \\n &lt;img\\n src=\\"@url\\" height=\\"@img_height\\" width=\\"@img_width\\" alt=\\"@url\\"\\n style=\\"float: left; margin: 0px 15px 15px 0px;\\"\\n border=\\"0\\"\\n /&gt;\\n &lt;/div&gt;\\n &lt;/div&gt;\\n "},"id":"1107","type":"HoverTool"},{"attributes":{},"id":"1177","type":"UnionRenderers"},{"attributes":{"start":0},"id":"1113","type":"DataRange1d"},{"attributes":{"label":{"value":"Wq"},"renderers":[{"id":"1145"}]},"id":"1162","type":"LegendItem"},{"attributes":{},"id":"1174","type":"Selection"},{"attributes":{},"id":"1120","type":"BasicTicker"},{"attributes":{"text":"Transformer Layers"},"id":"1109","type":"Title"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#6573f7"},"line_alpha":{"value":0.1},"line_color":{"value":"#6573f7"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1132","type":"VBar"},{"attributes":{"data_source":{"id":"1135"},"glyph":{"id":"1137"},"hover_glyph":null,"muted_glyph":null,"name":"Wv","nonselection_glyph":{"id":"1138"},"view":{"id":"1140"}},"id":"1139","type":"GlyphRenderer"},{"attributes":{"data_source":{"id":"1129"},"glyph":{"id":"1131"},"hover_glyph":null,"muted_glyph":null,"name":"Wk","nonselection_glyph":{"id":"1132"},"view":{"id":"1134"}},"id":"1133","type":"GlyphRenderer"},{"attributes":{"axis":{"id":"1123"},"dimension":1,"ticker":null},"id":"1126","type":"Grid"},{"attributes":{"data_source":{"id":"1147"},"glyph":{"id":"1149"},"hover_glyph":null,"muted_glyph":null,"name":"Wo","nonselection_glyph":{"id":"1150"},"view":{"id":"1152"}},"id":"1151","type":"GlyphRenderer"},{"attributes":{},"id":"1168","type":"BasicTickFormatter"},{"attributes":{},"id":"1115","type":"LinearScale"},{"attributes":{"data_source":{"id":"1153"},"glyph":{"id":"1155"},"hover_glyph":null,"muted_glyph":null,"name":"FFN","nonselection_glyph":{"id":"1156"},"view":{"id":"1158"}},"id":"1157","type":"GlyphRenderer"},{"attributes":{"label":{"value":"Wo"},"renderers":[{"id":"1151"}]},"id":"1163","type":"LegendItem"},{"attributes":{"active_multi":null,"tools":[{"id":"1107"}]},"id":"1127","type":"Toolbar"},{"attributes":{},"id":"1173","type":"UnionRenderers"},{"attributes":{"fill_color":{"value":"#aa69f7"},"line_color":{"value":"#aa69f7"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1155","type":"VBar"},{"attributes":{},"id":"1175","type":"UnionRenderers"},{"attributes":{},"id":"1166","type":"AllLabels"},{"attributes":{},"id":"1124","type":"BasicTicker"},{"attributes":{},"id":"1171","type":"BasicTickFormatter"},{"attributes":{"label":{"value":"Wk"},"renderers":[{"id":"1133"}]},"id":"1160","type":"LegendItem"},{"attributes":{"source":{"id":"1141"}},"id":"1146","type":"CDSView"},{"attributes":{"source":{"id":"1153"}},"id":"1158","type":"CDSView"},{"attributes":{},"id":"1176","type":"Selection"},{"attributes":{"data_source":{"id":"1141"},"glyph":{"id":"1143"},"hover_glyph":null,"muted_glyph":null,"name":"Wq","nonselection_glyph":{"id":"1144"},"view":{"id":"1146"}},"id":"1145","type":"GlyphRenderer"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#20cb97"},"line_alpha":{"value":0.1},"line_color":{"value":"#20cb97"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1144","type":"VBar"},{"attributes":{"label":{"value":"Wv"},"renderers":[{"id":"1139"}]},"id":"1161","type":"LegendItem"},{"attributes":{"data":{"density":["68.2%","45.0%","31.1%","16.0%","21.2%","35.8%","72.9%","37.2%","60.4%","39.8%","60.8%","46.4%","75.0%","31.1%","73.8%","42.0%","64.2%","98.8%"],"height":[0.402432,0.265216,0.183296,0.094208,0.124928,0.210944,0.43008,0.219136,0.356352,0.234496,0.3584,0.273408,0.442368,0.183296,0.4352,0.247808,0.37888,0.582656],"img_height":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"img_width":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"name":["encoder.layers.0_attn.out_proj","encoder.layers.1_attn.out_proj","encoder.layers.2_attn.out_proj","encoder.layers.3_attn.out_proj","encoder.layers.4_attn.out_proj","encoder.layers.5_attn.out_proj","decoder.layers.0_attn.out_proj","decoder.layers.0.encoder_attn.out_proj","decoder.layers.1_attn.out_proj","decoder.layers.1.encoder_attn.out_proj","decoder.layers.2_attn.out_proj","decoder.layers.2.encoder_attn.out_proj","decoder.layers.3_attn.out_proj","decoder.layers.3.encoder_attn.out_proj","decoder.layers.4_attn.out_proj","decoder.layers.4.encoder_attn.out_proj","decoder.layers.5_attn.out_proj","decoder.layers.5.encoder_attn.out_proj"],"parameters":["0.40","0.27","0.18","0.09","0.12","0.21","0.43","0.22","0.36","0.23","0.36","0.27","0.44","0.18","0.44","0.25","0.38","0.58"],"url":["/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_encoder_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_encoder_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_encoder_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_encoder_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_encoder_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_self_attn_out_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_encoder_attn_out_proj.png"],"x":[0.35,1.35,2.3499999999999996,3.3499999999999996,4.35,5.35,6.35,6.75,7.35,7.75,8.350000000000001,8.75,9.350000000000001,9.75,10.350000000000001,10.75,11.350000000000001,11.75]},"selected":{"id":"1178"},"selection_policy":{"id":"1179"}},"id":"1147","type":"ColumnDataSource"},{"attributes":{},"id":"1179","type":"UnionRenderers"},{"attributes":{},"id":"1181","type":"UnionRenderers"},{"attributes":{"fill_color":{"value":"#20cb97"},"line_color":{"value":"#20cb97"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1143","type":"VBar"},{"attributes":{"items":[{"id":"1160"},{"id":"1161"},{"id":"1162"},{"id":"1163"},{"id":"1164"}],"location":[10,0],"orientation":"horizontal"},"id":"1159","type":"Legend"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#ed5642"},"line_alpha":{"value":0.1},"line_color":{"value":"#ed5642"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1138","type":"VBar"},{"attributes":{"fill_color":{"value":"#f7b269"},"line_color":{"value":"#f7b269"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1149","type":"VBar"},{"attributes":{},"id":"1178","type":"Selection"},{"attributes":{"fill_color":{"value":"#ed5642"},"line_color":{"value":"#ed5642"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1137","type":"VBar"},{"attributes":{"axis":{"id":"1119"},"grid_line_color":null,"ticker":null},"id":"1122","type":"Grid"},{"attributes":{"data":{"density":["67.5%","44.6%","28.8%","15.6%","17.4%","32.5%","71.9%","37.8%","59.2%","35.6%","55.7%","41.8%","75.9%","29.7%","73.8%","42.5%","62.5%","99.1%"],"height":[0.398336,0.263168,0.169984,0.09216,0.1024,0.191488,0.423936,0.223232,0.349184,0.20992,0.328704,0.246784,0.447488,0.175104,0.4352,0.25088,0.36864,0.584704],"img_height":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"img_width":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"name":["encoder.layers.0_attn.v_proj","encoder.layers.1_attn.v_proj","encoder.layers.2_attn.v_proj","encoder.layers.3_attn.v_proj","encoder.layers.4_attn.v_proj","encoder.layers.5_attn.v_proj","decoder.layers.0_attn.v_proj","decoder.layers.0.encoder_attn.v_proj","decoder.layers.1_attn.v_proj","decoder.layers.1.encoder_attn.v_proj","decoder.layers.2_attn.v_proj","decoder.layers.2.encoder_attn.v_proj","decoder.layers.3_attn.v_proj","decoder.layers.3.encoder_attn.v_proj","decoder.layers.4_attn.v_proj","decoder.layers.4.encoder_attn.v_proj","decoder.layers.5_attn.v_proj","decoder.layers.5.encoder_attn.v_proj"],"parameters":["0.40","0.26","0.17","0.09","0.10","0.19","0.42","0.22","0.35","0.21","0.33","0.25","0.45","0.18","0.44","0.25","0.37","0.58"],"url":["/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_encoder_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_encoder_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_encoder_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_encoder_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_encoder_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_self_attn_v_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_encoder_attn_v_proj.png"],"x":[0.15000000000000002,1.1500000000000001,2.15,3.15,4.1499999999999995,5.1499999999999995,6.1499999999999995,6.55,7.1499999999999995,7.55,8.15,8.55,9.15,9.55,10.15,10.55,11.15,11.55]},"selected":{"id":"1174"},"selection_policy":{"id":"1175"}},"id":"1135","type":"ColumnDataSource"},{"attributes":{"source":{"id":"1147"}},"id":"1152","type":"CDSView"},{"attributes":{"above":[{"id":"1159"}],"below":[{"id":"1119"}],"center":[{"id":"1122"},{"id":"1126"}],"height":300,"left":[{"id":"1123"}],"outline_line_color":null,"renderers":[{"id":"1133"},{"id":"1139"},{"id":"1145"},{"id":"1151"},{"id":"1157"}],"title":{"id":"1109"},"toolbar":{"id":"1127"},"width":505,"x_range":{"id":"1111"},"x_scale":{"id":"1115"},"y_range":{"id":"1113"},"y_scale":{"id":"1117"}},"id":"1108","subtype":"Figure","type":"Plot"},{"attributes":{},"id":"1172","type":"Selection"},{"attributes":{"source":{"id":"1135"}},"id":"1140","type":"CDSView"},{"attributes":{"label":{"value":"FFN"},"renderers":[{"id":"1157"}]},"id":"1164","type":"LegendItem"},{"attributes":{"axis_label":"Parameters (M)","formatter":{"id":"1168"},"major_label_policy":{"id":"1166"},"minor_tick_line_color":null,"ticker":{"id":"1124"}},"id":"1123","type":"LinearAxis"},{"attributes":{},"id":"1117","type":"LinearScale"},{"attributes":{"fill_alpha":{"value":0.1},"fill_color":{"value":"#f7b269"},"line_alpha":{"value":0.1},"line_color":{"value":"#f7b269"},"top":{"field":"height"},"width":{"value":0.07142857142857142},"x":{"field":"x"}},"id":"1150","type":"VBar"},{"attributes":{"axis_label":"Layer","formatter":{"id":"1171"},"major_label_policy":{"id":"1169"},"minor_tick_line_color":null,"ticker":{"id":"1120"}},"id":"1119","type":"LinearAxis"},{"attributes":{},"id":"1111","type":"DataRange1d"},{"attributes":{"data":{"density":["64.6%","54.0%","46.5%","18.1%","27.4%","36.5%","68.6%","42.4%","41.0%","33.2%","43.8%","38.0%","54.0%","29.5%","54.9%","39.4%","48.3%","93.2%"],"height":[0.380928,0.318464,0.274432,0.106496,0.161792,0.21504,0.40448,0.249856,0.241664,0.195584,0.258048,0.224256,0.318464,0.17408,0.323584,0.232448,0.284672,0.549888],"img_height":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"img_width":["96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px","96px"],"name":["encoder.layers.0_attn.k_proj","encoder.layers.1_attn.k_proj","encoder.layers.2_attn.k_proj","encoder.layers.3_attn.k_proj","encoder.layers.4_attn.k_proj","encoder.layers.5_attn.k_proj","decoder.layers.0_attn.k_proj","decoder.layers.0.encoder_attn.k_proj","decoder.layers.1_attn.k_proj","decoder.layers.1.encoder_attn.k_proj","decoder.layers.2_attn.k_proj","decoder.layers.2.encoder_attn.k_proj","decoder.layers.3_attn.k_proj","decoder.layers.3.encoder_attn.k_proj","decoder.layers.4_attn.k_proj","decoder.layers.4.encoder_attn.k_proj","decoder.layers.5_attn.k_proj","decoder.layers.5.encoder_attn.k_proj"],"parameters":["0.38","0.32","0.27","0.11","0.16","0.22","0.40","0.25","0.24","0.20","0.26","0.22","0.32","0.17","0.32","0.23","0.28","0.55"],"url":["/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_0_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_1_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_2_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_3_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_4_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_encoder_layers_5_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_0_encoder_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_1_encoder_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_2_encoder_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_3_encoder_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_4_encoder_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_self_attn_k_proj.png","/echarlaix/bart-base-cnn-r2-18.7-d23-hybrid/raw/main/model_card/images/model_decoder_layers_5_encoder_attn_k_proj.png"],"x":[0.05,1.05,2.05,3.05,4.05,5.05,6.05,6.45,7.05,7.45,8.05,8.450000000000001,9.05,9.450000000000001,10.05,10.450000000000001,11.05,11.450000000000001]},"selected":{"id":"1172"},"selection_policy":{"id":"1173"}},"id":"1129","type":"ColumnDataSource"}],"root_ids":["1108"]},"title":"Bokeh Application","version":"2.3.2"}}';
119
+ var render_items = [{"docid":"7a2e7246-2ff1-4aba-b6ac-1c411e98fc75","root_ids":["1108"],"roots":{"1108":"4348cd46-05bd-4e27-b565-6693f9e0b03e"}}];
120
+ root.Bokeh.embed.embed_items(docs_json, render_items);
121
+
122
+ }
123
+ if (root.Bokeh !== undefined) {
124
+ embed_document(root);
125
+ } else {
126
+ var attempts = 0;
127
+ var timer = setInterval(function(root) {
128
+ if (root.Bokeh !== undefined) {
129
+ clearInterval(timer);
130
+ embed_document(root);
131
+ } else {
132
+ attempts++;
133
+ if (attempts > 100) {
134
+ clearInterval(timer);
135
+ console.log("Bokeh: ERROR: Unable to run BokehJS code because BokehJS library is missing");
136
+ }
137
+ }
138
+ }, 10, root)
139
+ }
140
+ })(window);
141
+ });
142
+ };
143
+ if (document.readyState != "loading") fn();
144
+ else document.addEventListener("DOMContentLoaded", fn);
145
+ })();
146
+ },
147
+ function(Bokeh) {
148
+
149
+
150
+ }
151
+ ];
152
+
153
+ function run_inline_js() {
154
+
155
+ for (var i = 0; i < inline_js.length; i++) {
156
+ inline_js[i].call(root, root.Bokeh);
157
+ }
158
+
159
+ }
160
+
161
+ if (root._bokeh_is_loading === 0) {
162
+ console.debug("Bokeh: BokehJS loaded, going straight to plotting");
163
+ run_inline_js();
164
+ } else {
165
+ load_libs(css_urls, js_urls, function() {
166
+ console.debug("Bokeh: BokehJS plotting callback run at", now());
167
+ run_inline_js();
168
+ });
169
+ }
170
+ }(window));
171
+ };
172
+ if (document.readyState != "loading") fn();
173
+ else document.addEventListener("DOMContentLoaded", fn);
174
+ })();
model_card/images/model_decoder_layers_0_encoder_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_0_encoder_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_0_encoder_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_0_encoder_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_0_fc1.png ADDED
model_card/images/model_decoder_layers_0_fc2.png ADDED
model_card/images/model_decoder_layers_0_self_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_0_self_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_0_self_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_0_self_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_1_encoder_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_1_encoder_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_1_encoder_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_1_encoder_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_1_fc1.png ADDED
model_card/images/model_decoder_layers_1_fc2.png ADDED
model_card/images/model_decoder_layers_1_self_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_1_self_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_1_self_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_1_self_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_2_encoder_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_2_encoder_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_2_encoder_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_2_encoder_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_2_fc1.png ADDED
model_card/images/model_decoder_layers_2_fc2.png ADDED
model_card/images/model_decoder_layers_2_self_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_2_self_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_2_self_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_2_self_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_3_encoder_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_3_encoder_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_3_encoder_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_3_encoder_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_3_fc1.png ADDED
model_card/images/model_decoder_layers_3_fc2.png ADDED
model_card/images/model_decoder_layers_3_self_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_3_self_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_3_self_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_3_self_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_4_encoder_attn_k_proj.png ADDED
model_card/images/model_decoder_layers_4_encoder_attn_out_proj.png ADDED
model_card/images/model_decoder_layers_4_encoder_attn_q_proj.png ADDED
model_card/images/model_decoder_layers_4_encoder_attn_v_proj.png ADDED
model_card/images/model_decoder_layers_4_fc1.png ADDED