McGill-NLP
/

LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised

Model card Files Files and versions Community

Configuration Parsing Warning: In adapter_config.json: "peft.task_type" must be a string

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

LLM2Vec is a simple recipe to convert decoder-only LLMs into text encoders. It consists of 3 simple steps: 1) enabling bidirectional attention, 2) masked next token prediction, and 3) unsupervised contrastive learning. The model can be further fine-tuned to achieve state-of-the-art performance.

Repository: https://github.com/McGill-NLP/llm2vec
Paper: https://arxiv.org/abs/2404.05961

Installation

pip install llm2vec

Usage

from llm2vec import LLM2Vec

import torch
from transformers import AutoTokenizer, AutoModel, AutoConfig
from peft import PeftModel

# Loading base Mistral model, along with custom code that enables bidirectional connections in decoder-only LLMs. MNTP LoRA weights are merged into the base model.
tokenizer = AutoTokenizer.from_pretrained(
    "McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp"
)
config = AutoConfig.from_pretrained(
    "McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp", trust_remote_code=True
)
model = AutoModel.from_pretrained(
    "McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp",
    trust_remote_code=True,
    config=config,
    torch_dtype=torch.bfloat16,
    device_map="cuda" if torch.cuda.is_available() else "cpu",
)
model = PeftModel.from_pretrained(
    model,
    "McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp",
)
model = model.merge_and_unload()  # This can take several minutes on cpu

# Loading supervised model. This loads the trained LoRA weights on top of MNTP model. Hence the final weights are -- Base model + MNTP (LoRA) + supervised (LoRA).
model = PeftModel.from_pretrained(
    model, "McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised"
)

# Wrapper for encoding and pooling operations
l2v = LLM2Vec(model, tokenizer, pooling_mode="mean", max_length=512)

# Encoding queries using instructions
instruction = (
    "Given a web search query, retrieve relevant passages that answer the query:"
)
queries = [
    [instruction, "how much protein should a female eat"],
    [instruction, "summit define"],
]
q_reps = l2v.encode(queries)

# Encoding documents. Instruction are not required for documents
documents = [
    "As a general guideline, the CDC's average requirement of protein for women ages 19 to 70 is 46 grams per day. But, as you can see from this chart, you'll need to increase that if you're expecting or training for a marathon. Check out the chart below to see how much protein you should be eating each day.",
    "Definition of summit for English Language Learners. : 1  the highest point of a mountain : the top of a mountain. : 2  the highest level. : 3  a meeting or series of meetings between the leaders of two or more governments.",
]
d_reps = l2v.encode(documents)

# Compute cosine similarity
q_reps_norm = torch.nn.functional.normalize(q_reps, p=2, dim=1)
d_reps_norm = torch.nn.functional.normalize(d_reps, p=2, dim=1)
cos_sim = torch.mm(q_reps_norm, d_reps_norm.transpose(0, 1))

print(cos_sim)
"""
tensor([[0.5485, 0.0551],
        [0.0565, 0.5425]])
"""

Questions

If you have any question about the code, feel free to email Parishad (parishad.behnamghader@mila.quebec) and Vaibhav (vaibhav.adlakha@mila.quebec).

Downloads last month: 1,468

Inference Providers NEW

Sentence Similarity

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support sentence-similarity models for peft library.

Spaces using McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised 5

Collection including McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised

LLM2Vec

Collection

16 items • Updated Oct 8, 2024 • 40

Evaluation results

accuracy on MTEB AmazonCounterfactualClassification (en)
test set self-reported

77.582
ap on MTEB AmazonCounterfactualClassification (en)
test set self-reported

41.455
f1 on MTEB AmazonCounterfactualClassification (en)
test set self-reported

71.761
accuracy on MTEB AmazonPolarityClassification
test set self-reported

91.120
ap on MTEB AmazonPolarityClassification
test set self-reported

88.010
f1 on MTEB AmazonPolarityClassification
test set self-reported

91.105
accuracy on MTEB AmazonReviewsClassification (en)
test set self-reported

49.966
f1 on MTEB AmazonReviewsClassification (en)
test set self-reported

48.908
map_at_1 on MTEB ArguAna
test set self-reported

32.788
map_at_10 on MTEB ArguAna
test set self-reported

48.665

View on Papers With Code