Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 8 days ago • 40
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised Sentence Similarity • Updated Apr 30, 2024 • 21.5k • 48
Robust Multi-bit Text Watermark with LLM-based Paraphrasers Paper • 2412.03123 • Published Dec 4, 2024 • 5