SentenceTransformer based on vinai/phobert-base-v2

This is a sentence-transformers model finetuned from vinai/phobert-base-v2. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Type: Sentence Transformer
Base model: vinai/phobert-base-v2
Maximum Sequence Length: 256 tokens
Output Dimensionality: 768 tokens
Similarity Function: Cosine Similarity

Model Sources

Documentation: Sentence Transformers Documentation
Repository: Sentence Transformers on GitHub
Hugging Face: Sentence Transformers on Hugging Face

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: RobertaModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("meandyou200175/phobert-finetune-512")
# Run inference
sentences = [
    'Bác sĩ cho em hỏi, em bị rạn nứt xương gót chân bên phải. Em bị hơn 1 tháng nay rồi. Em bỏ thuốc lá. Em muốn hỏi bác sĩ thông thường bó bột hơn hay thuốc lá hơn? Như của em khoảng bao lâu thì khỏi? Và giờ em vẫn chưa đi được bác sĩ ạ. Em cảm ơn.',
    'Chào em, Thứ nhất, bắt buộc phải có phim Xquang để biết em có thực sự nứt xương gót hay bị gãy phức tạp hơn, vì nhiều trường hợp tưởng chỉ nứt xương thôi nhưng thật ra là vỡ phức tạp, phải phẫu thuật mới nhanh ổn được. Thứ hai, theo nguyên tắc điều trị nứt gãy xương là phải cố định tốt để can xương mọc ra, chỗ nứt gãy mới được nối liền. Do đó, nếu bó bột thì chân sẽ được cố định liên tục trong 4-6 tuần, còn bó lá thì phải thay thường xuyên, mỗi lần thay là 1 lần xê dịch nên xương khó lành. Tốt hơn hết em nên đến Bệnh viện Chấn thương Chỉnh hình để được kiểm tra và điều trị thích hợp, em nhé. Thân mến.',
    'Chào bạn, Qua hình ảnh sang thương và mô tả triệu chứng, bệnh lý của bạn có khả năng là chàm hay còn gọi là viêm da dị ứng với đặc điểm là viêm và nổi mụn nhỏ, ngứa ngáy. Nguyên nhân của chàm hiện nay chưa rõ nhưng có thể do cơ địa dị ứng (người mắc hen, viêm mũi dị ứng có nguy cơ cao mắc chàm), do kích thích của hóa chất như nước rửa chén, bột giặt, cao su, kim loại, chất liệu giày dép (chàm tiếp xúc),... Thời tiết lạnh, stress, đổ mồ hôi nhiều và phấn hoa... cũng là những nguyên nhân có thể khiến da bị chàm. Chàm cũng có thể gặp ở người bị suy van tĩnh mạch, giãn tĩnh mạch chân khiến tình trạng bệnh dai dẳng, kém đáp ứng điều trị. Điều trị chàm thường phải sử dụng một số loại thuốc bôi da kéo dài, có thể để lại tác dụng phụ, do đó bạn nên khám BS Da liễu để kê toa loại thuốc phù hợp. Ngoài ra, bạn nên chú ý xem có yếu tố nào thường kích thích khởi phát chàm để tránh cho bệnh tái phát bạn nhé! Thân mến.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Evaluated with InformationRetrievalEvaluator

Metric	Value
cosine_accuracy@1	0.7239
cosine_accuracy@3	0.8503
cosine_accuracy@5	0.8928
cosine_accuracy@10	0.935
cosine_precision@1	0.7239
cosine_precision@3	0.2834
cosine_precision@5	0.1786
cosine_precision@10	0.0935
cosine_recall@1	0.7239
cosine_recall@3	0.8503
cosine_recall@5	0.8928
cosine_recall@10	0.935
cosine_ndcg@10	0.8297
cosine_mrr@10	0.7959
cosine_map@100	0.7987
dot_accuracy@1	0.7033
dot_accuracy@3	0.8397
dot_accuracy@5	0.8853
dot_accuracy@10	0.9321
dot_precision@1	0.7033
dot_precision@3	0.2799
dot_precision@5	0.1771
dot_precision@10	0.0932
dot_recall@1	0.7033
dot_recall@3	0.8397
dot_recall@5	0.8853
dot_recall@10	0.9321
dot_ndcg@10	0.8174
dot_mrr@10	0.7807
dot_map@100	0.7837

Training Details

Training Hyperparameters

Non-Default Hyperparameters

eval_strategy: steps
per_device_train_batch_size: 32
per_device_eval_batch_size: 32
learning_rate: 2e-05
num_train_epochs: 5
warmup_ratio: 0.1
fp16: True
batch_sampler: no_duplicates

All Hyperparameters

Click to expand

overwrite_output_dir: False
do_predict: False
eval_strategy: steps
prediction_loss_only: True
per_device_train_batch_size: 32
per_device_eval_batch_size: 32
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 1
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 2e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 5
max_steps: -1
lr_scheduler_type: linear
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 42
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: False
fp16: True
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: False
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: False
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
dispatch_batches: None
split_batches: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
batch_sampler: no_duplicates
multi_dataset_batch_sampler: proportional

Training Logs

Epoch	Step	Training Loss	Validation Loss	cosine_map@100
0	0	-	-	0.1388
0.0730	100	2.3608	-	-
0.1461	200	0.574	-	-
0.2191	300	0.3766	-	-
0.2922	400	0.3073	-	-
0.3652	500	0.2951	-	-
0.4383	600	0.2491	-	-
0.5113	700	0.228	-	-
0.5844	800	0.2352	-	-
0.6574	900	0.201	-	-
0.7305	1000	0.1851	0.1534	0.7402
0.8035	1100	0.1944	-	-
0.8766	1200	0.1638	-	-
0.9496	1300	0.1846	-	-
1.0226	1400	0.1682	-	-
1.0957	1500	0.1643	-	-
1.1687	1600	0.1366	-	-
1.2418	1700	0.1272	-	-
1.3148	1800	0.1195	-	-
1.3879	1900	0.0939	-	-
1.4609	2000	0.0792	0.1085	0.7773
1.5340	2100	0.0653	-	-
1.6070	2200	0.0719	-	-
1.6801	2300	0.0644	-	-
1.7531	2400	0.0553	-	-
1.8262	2500	0.0552	-	-
1.8992	2600	0.0505	-	-
1.9722	2700	0.0671	-	-
2.0453	2800	0.0551	-	-
2.1183	2900	0.0515	-	-
2.1914	3000	0.0462	0.0984	0.7859
2.2644	3100	0.0464	-	-
2.3375	3200	0.0401	-	-
2.4105	3300	0.0294	-	-
2.4836	3400	0.0277	-	-
2.5566	3500	0.0226	-	-
2.6297	3600	0.024	-	-
2.7027	3700	0.0241	-	-
2.7757	3800	0.0214	-	-
2.8488	3900	0.018	-	-
2.9218	4000	0.0237	0.0927	0.7909
2.9949	4100	0.0251	-	-
3.0679	4200	0.0208	-	-
3.1410	4300	0.0207	-	-
3.2140	4400	0.0176	-	-
3.2871	4500	0.0193	-	-
3.3601	4600	0.0173	-	-
3.4332	4700	0.0137	-	-
3.5062	4800	0.0115	-	-
3.5793	4900	0.0113	-	-
3.6523	5000	0.0107	0.0900	0.7952
3.7253	5100	0.009	-	-
3.7984	5200	0.0099	-	-
3.8714	5300	0.01	-	-
3.9445	5400	0.0122	-	-
4.0175	5500	0.0105	-	-
4.0906	5600	0.0107	-	-
4.1636	5700	0.0094	-	-
4.2367	5800	0.0105	-	-
4.3097	5900	0.0088	-	-
4.3828	6000	0.0084	0.0860	0.7987
4.4558	6100	0.0078	-	-
4.5289	6200	0.0067	-	-
4.6019	6300	0.0078	-	-
4.6749	6400	0.0059	-	-
4.7480	6500	0.0056	-	-
4.8210	6600	0.0061	-	-
4.8941	6700	0.0067	-	-
4.9671	6800	0.0082	-	-

Framework Versions

Python: 3.10.14
Sentence Transformers: 3.2.1
Transformers: 4.45.1
PyTorch: 2.4.0
Accelerate: 0.34.2
Datasets: 3.0.1
Tokenizers: 0.20.0

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

meandyou200175
/

phobert-finetune-512