SqueezeBERT model version for course project on CLR and PM

This is the variation of SqueezeBERT model that was trained from scratch for the task of unsupervised anomaly detection in logs of CLR .NET environment. This model is the part of my 3 Year Course project at HSE FCS: https://github.com/mastavtsev/PM_NLP/tree/main

Files changed (3) hide show

config.json +28 -0
model.safetensors +3 -0
training_args.bin +3 -0

config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "architectures": [
+    "SqueezeBertForMaskedLM"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "embedding_size": 768,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_groups": 4,
+  "intermediate_size": 3072,
+  "k_groups": 4,
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "squeezebert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "output_groups": 4,
+  "pad_token_id": 0,
+  "post_attention_groups": 1,
+  "q_groups": 4,
+  "torch_dtype": "float32",
+  "transformers_version": "4.38.2",
+  "type_vocab_size": 2,
+  "v_groups": 4,
+  "vocab_size": 20000
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e78c311a87435e9b1089ba862b99393440f56c34887fbfcb21d8de23e07d71dd
+size 174508672

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:30bdd15fab99e68c6c4c82019ea86ca1941c15f607b2b7f263b7b7468a3d0203
+size 5048