Model save

Browse files

Files changed (4) hide show

README.md +134 -0
adapter_model.safetensors +1 -1
chat_template.json +3 -0
preprocessor_config.json +29 -0

README.md ADDED Viewed

	@@ -0,0 +1,134 @@

+---
+library_name: peft
+license: apache-2.0
+base_model: ben81828/CADICA_qwenvl_direction
+tags:
+- llama-factory
+- generated_from_trainer
+model-index:
+- name: qwenvl-2B-cadica-direction-then-detect-and-classify-scale6
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# qwenvl-2B-cadica-direction-then-detect-and-classify-scale6
+This model is a fine-tuned version of [ben81828/CADICA_qwenvl_direction](https://huggingface.co/ben81828/CADICA_qwenvl_direction) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4078
+- Num Input Tokens Seen: 35305984
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 1
+- eval_batch_size: 1
+- seed: 42
+- distributed_type: multi-GPU
+- num_devices: 4
+- gradient_accumulation_steps: 6
+- total_train_batch_size: 24
+- total_eval_batch_size: 4
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_ratio: 0.05
+- training_steps: 3400
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Input Tokens Seen |
+|:-------------:|:------:|:----:|:---------------:|:-----------------:|
+| 1.3918        | 0.0148 | 50   | 1.0422          | 516240            |
+| 0.8208        | 0.0295 | 100  | 0.8917          | 1030696           |
+| 0.8125        | 0.0443 | 150  | 0.9009          | 1550792           |
+| 0.7675        | 0.0591 | 200  | 0.9007          | 2071176           |
+| 0.7558        | 0.0739 | 250  | 0.8108          | 2587272           |
+| 0.78          | 0.0886 | 300  | 0.8194          | 3107200           |
+| 0.6602        | 0.1034 | 350  | 0.7663          | 3625752           |
+| 0.6739        | 0.1182 | 400  | 0.7039          | 4142592           |
+| 0.5661        | 0.1329 | 450  | 0.7133          | 4663320           |
+| 0.6283        | 0.1477 | 500  | 0.6505          | 5183664           |
+| 0.5957        | 0.1625 | 550  | 0.6883          | 5703016           |
+| 0.6331        | 0.1773 | 600  | 0.5883          | 6222736           |
+| 0.5483        | 0.1920 | 650  | 0.6101          | 6743120           |
+| 0.477         | 0.2068 | 700  | 0.5884          | 7262832           |
+| 0.514         | 0.2216 | 750  | 0.4666          | 7779872           |
+| 0.4239        | 0.2363 | 800  | 0.4822          | 8301976           |
+| 0.4949        | 0.2511 | 850  | 0.6122          | 8822832           |
+| 0.4852        | 0.2659 | 900  | 0.5606          | 9345160           |
+| 0.4737        | 0.2806 | 950  | 0.4791          | 9863168           |
+| 0.4005        | 0.2954 | 1000 | 0.5501          | 10379136          |
+| 0.3991        | 0.3102 | 1050 | 0.4378          | 10897528          |
+| 0.4624        | 0.3250 | 1100 | 0.5301          | 11413120          |
+| 0.4432        | 0.3397 | 1150 | 0.4249          | 11933632          |
+| 0.3296        | 0.3545 | 1200 | 0.2966          | 12456040          |
+| 0.335         | 0.3693 | 1250 | 0.3185          | 12972696          |
+| 0.3594        | 0.3840 | 1300 | 0.4716          | 13493264          |
+| 0.3731        | 0.3988 | 1350 | 0.5566          | 14014736          |
+| 0.388         | 0.4136 | 1400 | 0.3866          | 14532288          |
+| 0.3131        | 0.4284 | 1450 | 0.4740          | 15050992          |
+| 0.2928        | 0.4431 | 1500 | 0.4049          | 15572048          |
+| 0.3588        | 0.4579 | 1550 | 0.2871          | 16091960          |
+| 0.3879        | 0.4727 | 1600 | 0.3136          | 16609960          |
+| 0.2698        | 0.4874 | 1650 | 0.4020          | 17130896          |
+| 0.3904        | 0.5022 | 1700 | 0.3297          | 17650984          |
+| 0.3173        | 0.5170 | 1750 | 0.4491          | 18169344          |
+| 0.3127        | 0.5318 | 1800 | 0.3499          | 18691928          |
+| 0.2828        | 0.5465 | 1850 | 0.3781          | 19212992          |
+| 0.306         | 0.5613 | 1900 | 0.3766          | 19735976          |
+| 0.2992        | 0.5761 | 1950 | 0.3468          | 20253288          |
+| 0.2341        | 0.5908 | 2000 | 0.3366          | 20770728          |
+| 0.2931        | 0.6056 | 2050 | 0.3386          | 21291664          |
+| 0.1826        | 0.6204 | 2100 | 0.5386          | 21813984          |
+| 0.2387        | 0.6352 | 2150 | 0.2581          | 22332144          |
+| 0.2662        | 0.6499 | 2200 | 0.4840          | 22849552          |
+| 0.2332        | 0.6647 | 2250 | 0.4966          | 23366784          |
+| 0.2481        | 0.6795 | 2300 | 0.2418          | 23883032          |
+| 0.2313        | 0.6942 | 2350 | 0.1870          | 24401256          |
+| 0.262         | 0.7090 | 2400 | 0.3471          | 24921872          |
+| 0.2412        | 0.7238 | 2450 | 0.3456          | 25439896          |
+| 0.2382        | 0.7386 | 2500 | 0.2543          | 25961056          |
+| 0.2364        | 0.7533 | 2550 | 0.3871          | 26477208          |
+| 0.2082        | 0.7681 | 2600 | 0.3406          | 26997904          |
+| 0.1736        | 0.7829 | 2650 | 0.2697          | 27521088          |
+| 0.2225        | 0.7976 | 2700 | 0.4155          | 28042992          |
+| 0.2501        | 0.8124 | 2750 | 0.4115          | 28561248          |
+| 0.2507        | 0.8272 | 2800 | 0.3223          | 29079576          |
+| 0.1928        | 0.8419 | 2850 | 0.2828          | 29600536          |
+| 0.2029        | 0.8567 | 2900 | 0.3943          | 30118072          |
+| 0.1692        | 0.8715 | 2950 | 0.2034          | 30637448          |
+| 0.234         | 0.8863 | 3000 | 0.2556          | 31159736          |
+| 0.2303        | 0.9010 | 3050 | 0.2253          | 31679080          |
+| 0.1999        | 0.9158 | 3100 | 0.2710          | 32196176          |
+| 0.2069        | 0.9306 | 3150 | 0.2029          | 32713824          |
+| 0.2135        | 0.9453 | 3200 | 0.3564          | 33235872          |
+| 0.1964        | 0.9601 | 3250 | 0.3081          | 33752488          |
+| 0.2131        | 0.9749 | 3300 | 0.3541          | 34269496          |
+| 0.1779        | 0.9897 | 3350 | 0.2255          | 34784784          |
+| 0.2173        | 1.0044 | 3400 | 0.4078          | 35305984          |
+### Framework versions
+- PEFT 0.12.0
+- Transformers 4.47.0.dev0
+- Pytorch 2.5.1+cu121
+- Datasets 3.1.0
+- Tokenizers 0.20.3

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:302b9ce048dad99dc59725f9bc543136929b03fd0548bef61608ff98a26b885e
 size 29034840

 version https://git-lfs.github.com/spec/v1
+oid sha256:e895c1cddd40eb0dbe387456240309b06d66fe014fd793905368c3b37bbbff4a
 size 29034840

chat_template.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "chat_template": "{% set image_count = namespace(value=0) %}{% set video_count = namespace(value=0) %}{% for message in messages %}{% if loop.first and message['role'] != 'system' %}<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n{% endif %}<|im_start|>{{ message['role'] }}\n{% if message['content'] is string %}{{ message['content'] }}<|im_end|>\n{% else %}{% for content in message['content'] %}{% if content['type'] == 'image' or 'image' in content or 'image_url' in content %}{% set image_count.value = image_count.value + 1 %}{% if add_vision_id %}Picture {{ image_count.value }}: {% endif %}<|vision_start|><|image_pad|><|vision_end|>{% elif content['type'] == 'video' or 'video' in content %}{% set video_count.value = video_count.value + 1 %}{% if add_vision_id %}Video {{ video_count.value }}: {% endif %}<|vision_start|><|video_pad|><|vision_end|>{% elif 'text' in content %}{{ content['text'] }}{% endif %}{% endfor %}<|im_end|>\n{% endif %}{% endfor %}{% if add_generation_prompt %}<|im_start|>assistant\n{% endif %}"
+}

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "do_convert_rgb": true,
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.48145466,
+    0.4578275,
+    0.40821073
+  ],
+  "image_processor_type": "Qwen2VLImageProcessor",
+  "image_std": [
+    0.26862954,
+    0.26130258,
+    0.27577711
+  ],
+  "max_pixels": 12845056,
+  "merge_size": 2,
+  "min_pixels": 3136,
+  "patch_size": 14,
+  "processor_class": "Qwen2VLProcessor",
+  "resample": 3,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "max_pixels": 12845056,
+    "min_pixels": 3136
+  },
+  "temporal_patch_size": 2
+}