Erik Novak

eriknovak
·

AI & ML interests

AI; NLP; text mining

Recent Activity

Reacted to tomaarsen's post with 🔥 14 days ago
I just released Sentence Transformers v3.3.0 & it's huge! 4.5x speedup for CPU with OpenVINO int8 static quantization, training with prompts for a free perf. boost, PEFT integration, evaluation on NanoBEIR, and more! Details: 1. We integrate Post-Training Static Quantization using OpenVINO, a very efficient solution for CPUs that processes 4.78x as many texts per second on average, while only hurting performance by 0.36% on average. There's a new `export_static_quantized_openvino_model` method to quantize a model. 2. We add the option to train with prompts, e.g. strings like "query: ", "search_document: " or "Represent this sentence for searching relevant passages: ". It's as simple as using the `prompts` argument in `SentenceTransformerTrainingArguments`. Our experiments show that you can easily reach 0.66% to 0.90% relative performance improvement on NDCG@10 at no extra cost by adding "query: " before each training query and "document: " before each training answer. 3. Sentence Transformers now supports training PEFT adapters via 7 new methods for adding new adapters or loading pre-trained ones. You can also directly load a trained adapter with SentenceTransformer as if it's a normal model. Very useful for e.g. 1) training multiple adapters on 1 base model, 2) training bigger models than otherwise possible, or 3) cheaply hosting multiple models by switching multiple adapters on 1 base model. 4. We added easy evaluation on NanoBEIR, a subset of BEIR a.k.a. the MTEB Retrieval benchmark. It contains 13 datasets with 50 queries and up to 10k documents each. Evaluation is fast, and can easily be done during training to track your model's performance on general-purpose information retrieval tasks. Additionally, we also deprecate Python 3.8, add better compatibility with Transformers v4.46.0, and more. Read the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.3.0
upvoted a collection 18 days ago
SmolLM2
liked a model 19 days ago
tencent/Tencent-Hunyuan-Large
View all activity

Organizations

eriknovak's activity

New activity in E3-JSI/gliner-multi-pii-domains-v1 2 months ago

False positives

2
#2 opened 2 months ago by abpani1994
New activity in E3-JSI/gliner-multi-pii-domains-v1 3 months ago

model max_seq_length?

13
#1 opened 3 months ago by abpani1994

model max_seq_length?

13
#1 opened 3 months ago by abpani1994

model max_seq_length?

13
#1 opened 3 months ago by abpani1994